Skip to main content
Frontiers in Plant Science logoLink to Frontiers in Plant Science
. 2022 Feb 23;13:823190. doi: 10.3389/fpls.2022.823190

Highly Resolved Papilionoid Legume Phylogeny Based on Plastid Phylogenomics

In-Su Choi 1,*, Domingos Cardoso 2,*, Luciano P de Queiroz 3, Haroldo C de Lima 4, Chaehee Lee 5, Tracey A Ruhlman 5, Robert K Jansen 5,6, Martin F Wojciechowski 1
PMCID: PMC8905342  PMID: 35283880

Abstract

Comprising 501 genera and around 14,000 species, Papilionoideae is not only the largest subfamily of Fabaceae (Leguminosae; legumes), but also one of the most extraordinarily diverse clades among angiosperms. Papilionoids are a major source of food and forage, are ecologically successful in all major biomes, and display dramatic variation in both floral architecture and plastid genome (plastome) structure. Plastid DNA-based phylogenetic analyses have greatly improved our understanding of relationships among the major groups of Papilionoideae, yet the backbone of the subfamily phylogeny remains unresolved. In this study, we sequenced and assembled 39 new plastomes that are covering key genera representing the morphological diversity in the subfamily. From 244 total taxa, we produced eight datasets for maximum likelihood (ML) analyses based on entire plastomes and/or concatenated sequences of 77 protein-coding sequences (CDS) and two datasets for multispecies coalescent (MSC) analyses based on individual gene trees. We additionally produced a combined nucleotide dataset comprising CDS plus matK gene sequences only, in which most papilionoid genera were sampled. A ML tree based on the entire plastome maximally supported all of the deep and most recent divergences of papilionoids (223 out of 236 nodes). The Swartzieae, ADA (Angylocalyceae, Dipterygeae, and Amburaneae), Cladrastis, Andira, and Exostyleae clades formed a grade to the remainder of the Papilionoideae, concordant with nine ML and two MSC trees. Phylogenetic relationships among the remaining five papilionoid lineages (Vataireoid, Dermatophyllum, Genistoid s.l., Dalbergioid s.l., and Baphieae + Non-Protein Amino Acid Accumulating or NPAAA clade) remained uncertain, because of insufficient support and/or conflicting relationships among trees. Our study fully resolved most of the deep nodes of Papilionoideae, however, some relationships require further exploration. More genome-scale data and rigorous analyses are needed to disentangle phylogenetic relationships among the five remaining lineages.

Keywords: deep evolution, Meso-Papilionoideae, plastid genome, Papilionoideae, Leguminosae

Introduction

Advances in next-generation sequencing and computational resources have enabled unparalleled phylogenomic analyses. These studies have deepened our understanding of evolutionary relationships across many branches in the plant Tree of Life, from the most recalcitrant deep relationships at and within the family level (e.g., Xi et al., 2012; Goremykin et al., 2015; Duvall et al., 2020; Koenen et al., 2020a; Yang et al., 2020; Antonelli et al., 2021; Orton et al., 2021; Schneider et al., 2021; Serna-Sánchez et al., 2021) to long, unresolved radiations at the species level (e.g., Nicholls et al., 2015; Welch et al., 2016; Villaverde et al., 2018; Thode et al., 2020; Pereira et al., 2021). While massive amounts of plastid genome (plastome) sequence data have filled the family level sampling gap for angiosperms (e.g., Li et al., 2021), infra-family levels remain less well covered. This is particularly true of the economically important, ecologically successful, morphologically diverse, species-rich legume family Fabaceae (Leguminosae), from which the plastomes of only 319 species in 184 genera have been deposited in the GenBank database1 (Accessed Sep. 09, 2021) thus far, of the more than 22,000 species and 770 genera in six subfamilies (LPWG, 2017, 2021).

Fabaceae is one of the most spectacular examples of diversification among flowering plants. Many legumes are not only ecologically dominant across major tropical and subtropical biomes (Schrire et al., 2005; DRYFLOR, 2016; LPWG, 2017) but symbiotically associated with nitrogen-fixing bacteria via root nodules (Sprent et al., 2017), amplifying the family’s importance for food security, sustainable agriculture, and ecosystem function (Food and Agriculture Organization2; LPWG, 2013; Yahara et al., 2013). The successful radiation of legumes is thought to be associated with plant defense strategies against herbivores, diverse, intimate ecological interactions involving ant-housing domatia and ant-feeding extrafloral nectaries (Janzen, 1966; Marazzi and Sanderson, 2010; Chomicki et al., 2015; Marazzi et al., 2019), an extraordinary range of floral forms (Tucker, 2003a) and pollination mechanisms (Arroyo, 1981). The family provides a wide diversity of secondary metabolites (alkaloids, flavonoids, lignans, tannins, terpenoids, benzofuranoids, and non-proteinogenic amino acids such as canavanine; Bisby et al., 1994; Kursar et al., 2009; Wink, 2013). The family is also an excellent model to reveal the patterns and processes of plastome structural evolution, since its taxa have undergone several dramatic rearrangements involving inversions of large blocks of sequence, contraction, loss, and regain of the inverted repeat (IR), gene/intron loss and repeat accumulation (e.g., Palmer et al., 1987; Lavin et al., 1990; Doyle et al., 1996; Jansen et al., 2008; Martin et al., 2014; Schwarz et al., 2015; Choi and Choi, 2017; Choi et al., 2019; Zhang et al., 2020; Charboneau et al., 2021; Lee et al., 2021). One of the most striking examples is a 50-kb inversion situated in the plastome large single-copy region (LSC) that is shared by the vast majority of subfamily Papilionoideae (papilionoids) (Doyle et al., 1996; Pennington et al., 2001; Wojciechowski et al., 2004; Cardoso et al., 2012a; LPWG, 2013). The 50 kb-inversion was long considered an unequivocal molecular synapomorphy for this clade, but recently at least three species of Sesbania Adans. were shown to have completely reverted the 50-kb sequence, resulting in essentially the same gene order as found in the earliest-diverging papilionoids (Lee et al., 2021).

In addition to providing a model for plastome structural rearrangements, Papilionoideae, the largest legume subfamily with an estimated 501 genera and 14,000 species (LPWG, 2021), also exhibits an impressive morphological diversity (Lewis et al., 2005; LPWG, 2017). For example, the early diversification of the Papilionoideae is marked by multiple evolutionary shifts in floral architecture (Ireland et al., 2000; Pennington et al., 2001; Cardoso et al., 2012a, 2013a,b; Klitgård et al., 2013; Ramos et al., 2016; Castellanos et al., 2017). Genera that were traditionally classified in the “primitive” tribes Sophoreae and Swartzieae (e.g., Cowan, 1981; Polhill, 1981a) are now phylogenetically scattered among the early-branching lineages of Papilionoideae. Their flowers are morphologically variable, from actinomorphic (radial or polysymmetric) with five undifferentiated petals to zygomorphic (bilateral or monosymmetric) with the petals poorly differentiated, but also absent or restricted to just the adaxial standard petal, and with free, often numerous stamens (Pennington et al., 2000; Cardoso et al., 2012a). The main phylogenetic outcome from early, single molecular locus Papilionoideae phylogenies (Doyle et al., 1997; Pennington et al., 2001; Wojciechowski et al., 2004) was that the papilionate-flowered ancestors experienced high evolutionary lability during the initial diversification of the subfamily, refuting the notion that non-papilionate flowers represented signatures of antiquity (e.g., Arroyo, 1981; Polhill, 1981a; Tucker and Douglas, 1994).

The extraordinary evolutionary and ecological success of legumes during their more than 60 million years of diversification history (Lavin et al., 2005; Bruneau et al., 2008; Koenen et al., 2021) may be related to the macroevolutionary stability of the highly specialized papilionate flower (Cardoso et al., 2013a), beneficial associations with nodulating symbiotic bacteria (Sprent et al., 2017), ant feeding via extrafloral nectaries (Marazzi et al., 2012, 2019) and/or secondary metabolite accumulation (Wink, 2013). Determining the emergence and influence of these phenomena in the evolutionary history of legumes requires a well-sampled and fully resolved phylogeny.

Previous phylogenetic studies using the coding sequences of the plastid rbcL (e.g., Doyle et al., 1997; Kajita et al., 2001) and matK genes (e.g., Wojciechowski et al., 2004; Cardoso et al., 2012a, 2013a, 2015; Ramos et al., 2016; Castellanos et al., 2017; LPWG, 2017; Queiroz et al., 2017), as well as a supermatrix approach (McMahon and Sanderson, 2006), sampled densely across the Papilionoideae, revealed many new clades, unexpected generic re-alignments, and placed several taxonomically orphan genera. However, these studies left the Papilionoideae backbone phylogeny and the placement of several evolutionary key genera largely unresolved. Such is the case with the small temperate North American genus Dermatophyllum Scheele. Apart from the Genistoid s.l. clade, Dermatophyllum is the only lineage of legumes known to accumulate a variety of quinolizidine alkaloids (QA), a class of alkaloids mainly distributed in these two papilionoid lineages with some phylogenetically scattered occurrences within other angiosperm families (Bisby et al., 1994; Kite and Pennington, 2003; Lee et al., 2013; Wink, 2013). Determining the sister relationship of this enigmatic, isolated genus with respect to the genistoids is fundamental to answer whether the evolution of such an important secondary metabolite in papilionoids had a single or multiple origins. Also unresolved is the polytomy within the large 50-kb-inversion clade involving the species-rich Dalbergioid s.l. and Non-Protein Amino Acid Accumulating (NPAAA) clades, as well as the florally heterogeneous Andira, Exostyleae, and Vataireoid clades. Previous phylogenies with broad taxon sampling resolved the earliest divergences of the Papilionoideae involving the swartzioids and the ADA clade (Angylocalyceae, Dipterygeae, and Amburaneae), albeit with low support values. These clades are often interchangeably shown as sister to the remainder of the subfamily (Cardoso et al., 2012a, 2013a, 2015; Ramos et al., 2016; Zhang et al., 2020; Zhao et al., 2021).

Recent advances in deep- and genus-level legume phylogenomics using both plastid and nuclear genes (Cannon et al., 2015; LPWG, 2017; Vatanparast et al., 2018; Ojeda et al., 2019; Koenen et al., 2020a,b, 2021; Oyebanji et al., 2020; Zhang et al., 2020; Zhao et al., 2021) have contributed to the resolution of some of the obscure deep relationships in the Papilionoideae. Still, many morphological key genera in the early-diverging lineages outside the agriculturally important NPAAA clade (Cardoso et al., 2012a, 2013a) have not been evaluated in any phylogenomic study. Among the legume phylogenomic analyses that more thoroughly investigated the Papilionoideae (Koenen et al., 2020a; Zhang et al., 2020; Zhao et al., 2021), only Zhao et al. (2021) have greatly improved taxon sampling within the early-diverging clades outside the NPAAA clade. In that study, more than 1,500 nuclear genes from transcriptome and genome assemblies of 217 papilionoid genera were explored. Previous plastome-inferred legume phylogenies have sampled just 32 (Koenen et al., 2020a) and 48 (Zhang et al., 2020) papilionoid genera. These studies all left important gaps, for example, regarding the placement of Dermatophyllum, either because no representative species were sampled or because statistical support for its sister relationship was relatively low. Thus, we still lack a clearer, more focused picture of Papilionoideae evolutionary history from comprehensively sampled plastome data.

In this study, we used a denser taxon sampling of plastome sequence data with a focus on the early-branching Papilionoideae. Despite the fact that plastome data may produce conflicting topologies (e.g., Gonçalves et al., 2019; Walker et al., 2019; Koenen et al., 2020a), examining them thoroughly can help to clarify the phylogenetic signal in concatenated analyses of a large number of plastid loci (Gonçalves et al., 2019; Zhang et al., 2020). For example, the inclusion of protein-coding genes only, removal of ambiguously aligned regions, or the use of coalescent methods (e.g., Gonçalves et al., 2019) may be helpful to arrive at phylogenetically accurate topologies. Here, we aimed to explore phylogenetic signals across diverse plastome-derived datasets together with cross-genomic comparisons of mitochondrial and nuclear data and shed light on key evolutionary relationships related to a number of questions that have persisted in the subfamily.

Materials and Methods

Plant Material Sampling

Our taxon sampling strategy was designed to cover the major papilionoid clades recognized by Cardoso et al. (2012a, 2013a), especially those in need of additional investigation. Thirty-eight taxa were collected from Brazil and the United States of America. Seeds of Phaseolus acutifolius A. Gray were acquired from the Desert Legume Program (University of Arizona, Tucson) and grown in the University of Texas at Austin (UT-Austin) greenhouse. Leaf samples were collected, flash-frozen in liquid nitrogen and stored at −80°C before DNA isolation except the silica-gel dried leaf tissues of Astragalus canadensis var. brevidens (Gand.) Barneby. Voucher specimens are housed at the Billie L. Turner Plant Resource Center at UT-Austin (TEX-LL), the Arizona State University (ASU), the Universidade Estadual de Feira de Santana (HUEFS), and the Jardim Botânico do Rio de Janeiro (RB) herbaria. Collection information for a total of 39 taxa is available in Supplementary Table 1.

Next-Generation Sequencing and Plastid Genome Completion

Total genomic DNAs were extracted by the hexadecyltrimethylammonium bromide protocol, described in Doyle and Doyle (1987), or using the NucleoSpin Plant II, Mini Kit for DNA from plants (Macherey-Nagel, Düren, Germany). Next-generation sequencing (NGS) reads (2 × 150 bp) from ca. 300 bp insert libraries for 38 taxa were generated by the Beijing Genomics Institute (BGI; Shenzhen, China) using the MGI DNBseq™ platform (MGI Tech Co., Shenzhen, China). Total genomic DNA of A. canadensis var. brevidens was sequenced using the Illumina NextSeq 500 at the Arizona State University Genomics Facility (2 × 151 bp; ca. 300 bp insert size). Plastomes were assembled and annotated by the methods described in Choi et al. (2019).

Plastid Phylogenomics

In addition to the complete plastomes generated in this study, we obtained the plastome sequence from at least a single species of each papilionoid genus currently available in GenBank (see text footnote 1) (Accessed May 24, 2021) or Dryad3 (Accessed Aug. 24, 2020). In total, 244 taxa, with 237 papilionoids (174 genera) and seven non-papilionoids were collected as ingroup and outgroup, respectively (Supplementary Table 2). We produced nine sequence alignments as data sets for maximum likelihood (ML) analysis and two gene tree sets for species tree estimation based on the multispecies coalescent (MSC) approach (Table 1 and Supplementary Table 3).

TABLE 1.

Descriptions for 11 datasets used for plastid phylogenomic analyses in this study.

Tree number Dataset name Description
1 WP Whole plastid genome alignment
2 WP_nogap WP dataset with all indels removed
3 WP_gb WP dataset with only conserved sequence blocks
4 CD_nt Concatenation of 77 CDS nucleotide alignments
5 CD_nt_gb CD_nt dataset with only conserved sequence blocks
6 CD_nt_dg CD_nt dataset with degenerated synonymous substitutions sites
7 CD_aa Concatenation of 77 CDS amino acid alignments
8 CD_aa_gb CD_aa dataset with only conserved sequence blocks
9 CD_matK_cb Combined dataset of CD_nt and matK-only dataset
10 77 gene trees All 77 individual gene trees
11 26 gene trees 26 gene trees with a mean bootstrap value higher than 85

Datasets 1—3 (WP, WP_nogap, and WP_gb; see Table 1) were prepared based on whole plastome alignments. For the legume taxa with two copies of the large IR, one copy was deleted. To align plastomes with different gene order, Mauve 2.3.1 (Darling et al., 2010) was used to detect locally colinear blocks (LCBs) relative to Cercis canadensis L. (KF856619). Sequence blocks of all taxa were rearranged to be colinear with C. canadensis. To avoid introducing non-orthologous sequences during the rearrangements, non-genic edges of LCBs were deleted. The intergenic regions that coincide with the end points of the 50-kb inversion and an adjacent gene encoding rps16, pseudogenized in many papilionoids (Schwarz et al., 2015; Choi and Choi, 2017; Lee et al., 2021), were deleted from all taxa. Complete plastomes were aligned by MAFFT v7.450 (Katoh and Standley, 2013) in Geneious Prime 2021.0.34 using default options. This raw alignment is designated as WP (dataset 1). The WP dataset was further refined by two different strategies. The WP_nogap (dataset 2) was prepared by deleting all indel regions using the “mask alignment” tool in Geneious Prime, as described in Orton et al. (2021). The WP_gb (dataset 3) was prepared from the WP dataset using Gblocks 0.91b (Castresana, 2000) using default options allowing selection of conserved sequence blocks without indel regions exclusively.

Datasets 4—8 (CD_nt, CD_nt_gb, CD_nt_dg, CD_aa, and CD_aa_gb; see Table 1) were prepared based on the protein-coding sequences (CDS) for 77 genes (Supplementary Table 4) extracted from each plastome. The 77 CDS included putatively pseudogenized genes with few mutations that could represent heteroplasmic variations (Park et al., 2020). In plastomes where the intactness of rps16 was questionable, the pseudogene was unsampled. A single nucleotide (A, T, C, G, or N) was introduced or deleted to fit the reading frame when a gene included a premature stop codon due to indel polymorphism. Nucleotide sequences of each CDS dataset were aligned by MAFFT using the translation align option available in Geneious Prime. Finally, poorly aligned regions of each CDS alignment were manually adjusted or deleted. A concatenated aligned matrix of 77 CDS sequences was produced using the R package catGenes5. This concatenated nucleotide alignment (dataset 4) was designated CD_nt. This data set was further refined by Gblocks using default options resulting in CD_nt_gb (dataset 5). To avoid the problem of nucleotide compositional heterogeneity among taxa, the synonymous substitution sites in the CD_nt (dataset 4) were degenerated using Degen ver. 1.4.6 (Regier et al., 2010) and the result was designated CD_nt_dg (dataset 6). Datasets 7 and 8 were prepared based on translated amino acid sequences (AA) of CD_nt (dataset 4): Dataset 7 (CD_aa) was aligned without trimming and dataset 8 (CD_aa_gb) was trimmed using Gblocks with default options.

To more broadly examine the phylogenetic relationships of papilionoid legumes we included taxa that were not sampled in our plastome datasets but were previously sequenced for matK. We produced dataset 9 (CD_matK_cb; see Table 1) by combining CD_nt (dataset 4) and a matK-only dataset of 534 nucleotide sequences. For this alignment the taxa lacking plastome-scale data were represented by the matK coding region while their 76 CDS were coded as missing data. In total, dataset 9 included 771 papilionoids (478 genera) (Supplementary Table 5). The dataset included matK sequences used in LPWG (2017) and excluded duplications and short sequence fragments (< 800 bp).

For each dataset (1—9), ML analysis was conducted using IQ-TREE 1.6.12 (Nguyen et al., 2015) with 1,000 bootstrap (BS) replications, and a best-fit nucleotide substitution model was automatically selected based on the Bayesian information criterion.

Datasets 10 and 11 (see Table 1) consisted of sets of individual ML trees based on each CDS for species tree estimation with MSC approach using ASTRAL-III (Zhang et al., 2018). The ML analyses for each CDS were conducted as described above. To decrease error rate, branches with BS value lower than 10 were contracted as suggested in Zhang et al. (2018). Dataset 10 included all 77 individual trees. The individual trees with strong phylogenetic signals (average BS value > 85) were selected and this set of trees was designated dataset 11. The local posterior probability (LPP) and quartet score (QS) were calculated for all nodes.

Visualization and editing of phylogenetic trees were conducted using Interactive Tree Of Life (iTOL; Letunic and Bork, 2021), ggtree package (Yu et al., 2017), and custom R scripts.

Results

In total, we included plastomes from 244 species (Supplementary Table 2), 39 of which were sequenced for this study (Supplementary Table 1), and matK sequences from 534 species (Supplementary Table 5). This taxon sampling scheme includes representatives of all 22 main, early-diverging lineages (19 in plastome-only sampling, without Amphimas Pierre ex Harms, Aldina Endl., and Clathrotropis macrocarpa Ducke), as recognized by Cardoso et al. (2012a), and spans a diversity of taxa exhibiting radially symmetrical to bilaterally symmetrical floral architecture (Figure 1).

FIGURE 1.

FIGURE 1

Broad variation in floral architecture across the papilionoid legumes (Papilionoideae). The selected taxa represent genera from the early-branching lineages (A–M) outside the NPAAA (non-protein amino acid accumulating) clade. (A) Swartzia acutifolia (Swartzieae); (B) Castanospermum australe [Angylocalyceae, ADA clade (Angylocalyceae, Dipterygeae, and Amburaneae)]; (C) Dipteryx magnifica (Dipterygeae, ADA clade); (D) Myrocarpus fastigiatus (Amburaneae, ADA clade); (E) Pickeringia montana (Cladrastis clade); (F) Harpalyce brasiliana (Brongniartieae, Genistoid s.l.); (G) Aldina latifolia (Andira clade); (H) Exostyles venusta (Exostyleae); (I) Dermatophyllum secundiflorum (unresolved); (J) Dalea mollis (Amorpheae); (K) Centrolobium microchaete (Dalbergieae); (L) Leptolobium dasycarpum (Leptolobieae, Genistoid s.l.); (M) Lupinus sericeus (Genisteae, Genistoid s.l.); (N) Robinia neomexicana (Robinieae, NPAAA); (O) Astragalus mollissimus (Astragalean, NPAAA); Photos by Domingos Cardoso (A–D,F–H,K–M) and Martin F. Wojciechowski (E,I,J,N,O).

In total, 39 papilionoid plastomes were assembled and included in our analysis (Supplementary Table 1). Plastomes varied from 123,013 (Astragalus canadensis var. brevidens) to 168,148 bp (Myrospermum sousanum A.Delgado & M. C.Johnst.) in unit length. The read depth varied from 409 to 6,213×. Together with previously sequenced plastid data, eight plastome-only and one combined (matK-only + CD_nt) datasets were prepared for our ML analyses of papilionoid legumes (Tables 1 and Supplementary Table 3). The nine datasets for ML analyses varied in the values for the alignment length, proportion of gaps/ambiguities, and number of parsimony-informative sites (PIS) (Figure 2A). The highest values of the alignment length and PIS were shown in the WP dataset with 313,335 bp (60.3% gaps/ambiguities) and 101,445 sites, respectively, values substantially higher than the rest of the datasets. The original concatenation of all CDS (CD_nt) showed higher values for the alignment length and number of PIS, but a trimmed version (CD_nt_gb) was similar to plastome alignments without indel regions (WP_nogap) or with only conserved sequence blocks (WP_gb). Both modification of CD_nt (CD_nt_dg) and the combination of CD_nt with the matK-only dataset (CD_matK_cb) resulted in changes in the number of the PIS and gaps/ambiguity (Supplementary Table 3).

FIGURE 2.

FIGURE 2

Comparison of datasets used for phylogenetic analyses and their resolution in resulting trees. Descriptions for these datasets provided in Table 1. (A) A scatter plot showing differences in number of parsimony-informative sites (PIS) and gap/ambiguity among datasets for maximum likelihood (ML) analysis. (B) A scatter plot showing PIS and mean bootstrap values (BS) of 77 individual gene trees that were used for the multispecies coalescent (MSC) approach. The top seven protein-coding regions (including matK) with PIS (>1,000) are indicated. (C) A comparison of tree topology resolutions. Values within the histograms indicate the number of nodes. Maximal support values are BS = 100 or local posterior probability = 1.0.

The datasets based on translated AAs showed the lowest values for alignment length and number of PIS. Datasets 10 and 11 (77 gene trees and 26 gene trees) employed for the MSC approach were prepared based on ML analyses for each individual CDS alignment (Supplementary Table 4). Length of each alignment varied from 90 bp (petN) to 11,175 (ycf1) bp. The lowest and highest mean BS values were shown from psbN (45.7) and ycf1 (97.9) (Figure 2B and Supplementary Table 4). There were 26 trees with average BS > 85 (i.e., dataset 11).

Based on 11 datasets, 11 trees were inferred and compared (Figure 2C). Tree 1 (WP dataset) resolved the most papilionoid nodes (223 out of 236, 93.5%). Trees 2—5 resolved a similar number of nodes with maximal support (BS = 100) (215—217, 91.1—91.9%). Trees 6—8 showed the lowest number of resolved nodes with maximal support (196—207, 83.1—87.7%) among ML trees. The many additional papilionoid nodes in Tree 9 (CDS and matK-only) resolved 580 out of 770 nodes (75.3%) with maximal support. Multispecies coalescence trees (10 and 11) resolved a similar number of nodes [192 (81.4%) and 193 (81.8%), respectively] with maximal support (LPP = 1.0).

Topological concordance with regard to 10 main papilionoid lineages (Swartzieae, ADA, Cladrastis, Andira, Exostyleae, Dermatophyllum, Vataireoid, Genistoid s.l., Dalbergioid s.l., and Baphieae + NPAAA clades) was assessed across the 11 trees (Figure 3). Topologies showing the five lineages Swartzieae, ADA, Cladrastis, Andira, and Exostyleae clades as successive sister groups to a monophyletic group comprising the remaining five papilionoid lineages (Dermatophyllum, Vataireoid, Genistoid s.l., Dalbergioid s.l., and Baphieae + NPAAA clades) were consistently retrieved from all trees. The three nodes related to the five remaining lineages resolved differently. The most frequently recovered relationship subdivided the group into two clades of (Dalbergioid s.l., Baphieae + NPAAA) and (Dermatophyllum, (Vataireoid, Genistoid s.l.)). Support and alternative topologies for these five lineages are shown in Figure 4. While all 11 trees resolved relationships, only the nodes inferred in Tree 1 had 100% BS support. Among the seven ML trees (except Tree 1) based on plastome datasets, Trees 7 and 8 from translated AA alignments showed slightly better support for the interrelationships of the five lineages. The MSC-based Tree 10 had only low LPP and QS for the main topology of the five lineages. The selection of 26 genes with high mean BS values as the input dataset (dataset 11) showed slightly increased LPP and QS for the monophyly of the five remaining lineages but does not substantially affect resolution of relationships among these lineages (Tree 11). To show specific relationships within and among the 10 key papilionoid lineages, Tree 5 along with support values of Trees 1 and 11 are presented as the main tree (Figure 5). In total, the 50 papilionoid genera that need further attention in plastid phylogenomics are highlighted in Figure 6 based on the most comprehensive taxon sampling (CD_matK_cb). These 50 genera are either still phylogenetically enigmatic, are morphological key genera, or represent monospecific lineages.

FIGURE 3.

FIGURE 3

The consensus of 11 phylogenetic trees of papilionoid legumes based on analyses of plastome-scale and matK sequence data. The number of topologies in which this clade was recovered in 11 trees is indicated at each node. Branches leading to taxa known to produce quinolizidine alkaloids are highlighted in brown. NPAAA: non-proteinogenic amino acid accumulating, ADA: Angylocalyceae, Dipterygeae, and Amburaneae, kb: kilobase, s.l.: sensu lato.

FIGURE 4.

FIGURE 4

Differences in topology and support values for the 11 trees with respect to resolving the phylogenetic relationships among the Vataireoid, Dermatophyllum, Genistoid s.l., Dalbergioid s.l., and Baphieae + NPAAA (non-proteinogenic amino acid accumulating) clades. Nodes with maximal bootstrap support (100) are denoted as black dots, while other nodes are marked with blue dots with values in nine trees (Trees 1–9). In Trees 10 and 11, based on the multispecies coalescence approach, pie charts representing quartet score values of topologies (black = main, orange = first alternative, blue = second alternative) and local posterior probability are presented at nodes.

FIGURE 5.

FIGURE 5

A maximum-likelihood tree based on a concatenated dataset of 77 plastome coding regions with Gblocks trimming (See Table 1). The figure is separated into five panels with a focus on the major clades. (A) Swartzieae, ADA (Angylocalyceae, Dipterygeae, and Amburaneae), Cladrastis, Andira, Exostyleae, Vataireoid, and Genistoid s.l. clades; (B) Dalbergioid s.l. clade; (C) Baphieae, Hypocalypteae, Mirbelioid, Indigofereae, Millettioid (part 1) clades; (D) Millettioid (part 2) clade and (E) Robinioid clade and inverted repeat-lacking clade (IRLC). Supporting values of three trees are visualized at nodes as rectangles. The rectangles are subdivided into three columns that correspond to Tree 1 (left, WP), Tree 5 (middle, CD_nt_gb), and Tree 11 (right, 26 gene trees). Colors in each column represent support values. Black [bootstrap support value (BS) = 100 or local posterior probability (LPP) = 1], Blue [BS (90–99) or LPP (0.76–0.99)], Orange [BS (80–89) or LPP (0.50–0.75)], Bluish green (BS < 80 or LPP 0.50). Scale indicates number of nucleotide substitutions per site. The colors for clades are not related to their support values. We generally followed the color scheme for early-branching papilionoid clades of Figures 2–5 in Cardoso et al. (2013a).

FIGURE 6.

FIGURE 6

A genus-level phylogeny of Papilionoideae as produced by the maximum likelihood analysis of the dataset 9 (CD_matK_cb), which combined genera with fully sequenced plastomes and the expanded sampling of genera with matK sequence data only. Black nodes are supported by bootstrap values (BS) = 100 and blue nodes by BS = 90–99, while orange nodes are supported by BS = 80–89 and cyan nodes by BS = 70–79. Genera highlighted in black have already been fully sequenced for plastomes. The 50 genera highlighted in cyan are either still phylogenetically enigmatic, are morphological key genera, or represent monospecific lineages, all of which should be the focus of future plastome research. The colors for clades are not related to their support values. We followed the color scheme for papilionoid clades of Figure 1 in Cardoso et al. (2013a). NPAAA: non-proteinogenic amino acid accumulating, ADA: Angylocalyceae, Dipterygeae, and Amburaneae.

Discussion

Plastid Phylogenomic Signals Across the Papilionoid Legumes

Maximum Likelihood analysis of the whole plastome alignment (Figure 4) resolved all deep relationships among major early-diverging clades of the Papilionoideae with maximal support values. This approach for phylogenetic analyses using the entire plastome as a single locus was rigorously tested in Poaceae (Duvall et al., 2020; Orton et al., 2021). Duvall et al. (2020) demonstrated how tree topology and support values could vary according to the level of gap removal from the whole plastome alignment, and recommended excluding any gapped positions from machine-generated alignments. Apart from concerns as to whether it is reasonable to recognize the plastome as a single locus (Gonçalves et al., 2020; Doyle, 2021), one of the main and practical criticisms about whole plastome alignment without further filtration is that most gaps are located in AT-rich and low complexity regions, which are prone to high-level length polymorphisms in simple sequence repeats and falsely aligned non-orthologous sequences (Duvall et al., 2020). The much longer whole plastome alignment of 244 legumes (313,335 bp) compared to the alignment lacking indels (64,099 bp) is not surprising when considering the dynamic gene, intron, intergenic, and repeat content across papilionoids (e.g., Milligan et al., 1989; Doyle et al., 1995; Bailey et al., 1997; Cai et al., 2008; Jansen et al., 2008; Sabir et al., 2014; Sveinsson and Cronk, 2014; Schwarz et al., 2015; Choi et al., 2019, 2020a; Jin et al., 2019; Oyebanji et al., 2020; Lee et al., 2021). The WP alignment extended well beyond the length of a typical plastome to accommodate for poorly aligned sequences. While the lability of papilionoid plastome intergenic regions may provide a much greater number of PIS, their inclusion can introduce error in the alignment and infer spurious relationships.

In the remaining ML trees based on datasets without highly divergent intergenic regions, three controversial nodes connected by very short branches were revealed within a monophyletic group that included the Vataireoid, Dermatophyllum, Genistoid s.l., Dalbergioid s.l., and Baphieae + NPAAA clades (see Figures 35). The trees that were derived from analyses of translated AA sequences showed slightly better support. Even though MSC methods were suggested as an alternative way to explore plastome data (Gonçalves et al., 2019, 2020), this approach also left problematic nodes with low support values and showed a high frequency of alternative tree topologies. The source of conflicting signals among plastid loci can be diverse, including systematic and stochastic errors (Walker et al., 2019). The vast majority of plastid genes are short (see Figure 2B and Supplementary Table 4) and often yield poorly resolved individual gene trees as input for the MSC approach (Walker et al., 2019; Doyle, 2021). The species tree estimated from the trees based on the highly informative 26 CDS loci that exhibited strong phylogenetic signals also showed a similar level of alternative topologies (Figure 4). A likely explanation for this phylogenetic uncertainty is that the time interval between the divergences was too short to achieve resolution even with such phylogenetically informative loci. This makes it challenging to avoid stochastic errors in a zone of rapid divergence at deep nodes, such as the case within the 50-kb inversion clade of the papilionoid phylogeny. Thus, comparison with a phylogeny based on other genomic data (e.g., Zhao et al., 2021) would be desirable.

So far, several plastid phylogenomic analyses at several taxonomic levels of Fabaceae have been conducted (LPWG, 2017; Koenen et al., 2020a; Oyebanji et al., 2020; Zhang et al., 2020; Aecyo et al., 2021), but none of these has paid particular attention to increasing taxonomic diversity to fully resolve the deep nodes of Papilionoideae. Our taxon sampling included one to several representatives of most evolutionary key lineages except for a few small groups (e.g., Cardoso et al., 2013a; LPWG, 2017). In the following sections, phylogenetic relationships of papilionoid lineages will be discussed in a systematic context with other relevant molecular and non-molecular evidence. Overall, our analyses have provided well-resolved and strongly supported phylogenies, which are in agreement with topologies retrieved from previous plastid sequence-based phylogenies, and essentially validate the resolving power of matK sequences alone for reconstructing papilionoid phylogenies (e.g., Hu et al., 2000; Lavin et al., 2001; McMahon and Hufford, 2004; Cardoso et al., 2013a). Compared to a recent nuclear phylogenomic study (Zhao et al., 2021), there were several notable relationships with various nodal support values that were not retrieved from plastid data alone. Sexual recombination is restricted or absent in wild-type plastomes, suggesting possible involvement of reticulation/introgression events in the formation of some ancestral papilionoid lineages.

Swartzieae, ADA, and Cladrastis clades

The phylogenetic analyses in this study consistently resolved the early-diverging Swartzieae, ADA (Angylocalyceae, Dipterygeae, and Amburaneae), and Cladrastis clades, which collectively form a grade to the remaining Papilionoideae. Most genera within this grade were previously treated as members of tribes Sophoreae and Swartzieae by Polhill (1981b, 1994). In that sense, the grade comprising the Swartzieae, ADA, and Cladrastis clades, is reminiscent of Polhill’s (1981b, Figure 4; 1994) traditional “supposed relationship of tribes,” which recognized several groups of caesalpinioid-like genera among the largely tropical and subtropical Swartzieae and Sophoreae as transitional between subfamilies Caesalpinioideae and Papilionoideae. This group was considered to form a basal assemblage of the subfamily leading to the more derived papilionoid tribes. The taxonomic composition of this grade was partly recognized from early surveys for the absence/presence of a 50-kb inversion in legume plastomes (Doyle et al., 1996) and nodulation ability (Sprent, 2000).

Molecular phylogenetic analyses to date (Cardoso et al., 2012a, 2015; Ramos et al., 2016; LPWG, 2017; Zhang et al., 2020; Zhao et al., 2021) have reached a consensus that there are three main monophyletic groups that diverged early at the base of Papilionoideae. Early plastid sequence-based phylogenetic studies (Doyle et al., 1996, 1997; Pennington et al., 2001; Wojciechowski et al., 2004) suggested that groups of certain genera from the Swartzieae, Sophoreae, and Dipterygeae of Polhill’s traditional classification (Polhill, 1981b, 1994) were outside of the 50-kb inversion clade in Papilionoideae. A Swartzieae-derived monophyletic group as the first diverging lineage sister to all Papilionoideae was initially recognized by Pennington et al. (2001), while Doyle et al. (1997) and Wojciechowski et al. (2004) showed groups of additional genera from these tribes among the early-diverging clades. A sister relationship between a clade comprising the genera Cladrastis Raf., Pickeringia Nutt. ex Torr. & A.Gray, and Styphnolobium Schott, and the more derived 50-kb inversion clade, was initially revealed by Wojciechowski et al. (2004). Subsequent studies by Cardoso et al. (2012a; 2013a; 2015), Wojciechowski (2013a), and Duan et al. (2019), with more comprehensive taxon sampling based upon analyses of both plastid matK and nuclear rDNA regions, further clarified the composition of the Swartzieae, ADA, and Cladrastis clades. Relationships among these three clades remain uncertain. Recent plastome-based phylogenetic analyses employing multi-locus (Ramos et al., 2016) and complete plastome (supported by all 11 trees, see Figure 3; Zhang et al., 2020) datasets retrieved the (Swartzieae,(ADA,(Cladrastis,50-kb inversion clade))) topology while a recent nuclear phylogenomic study recovered the (ADA(Swartzieae,(Cladrastis,50-kb inversion clade))) topology (Zhao et al., 2021).

Given these results, the phylogenetic relationship of the Swartzieae and ADA clades remain equivocal. Both clades are highly diverse in floral structure (Figure 1) where almost all constituent genera display a particular floral architecture (e.g., Tucker, 1993, 2003b; Cardoso et al., 2013a; Leite et al., 2014, 2015; Prenner et al., 2015; Sinjushin, 2018). In addition to being distinguished by bilaterally or radially symmetrical apetalous flowers with a profusion of free stamens (e.g., Cordyla, Swartzia Schreb.; Figure 1A) or by single-petal flowers (e.g., Amburana Schwacke and Taub., Swartzia, Trischidium Tul.), representatives in these clades may have tiny radially symmetrical flowers measuring up to 3 mm long (Myrocarpus Allem., Figure 1D) to hardy, bat-pollinated flowers larger than 10 cm long (e.g., Alexa Moq.). Also, these clades include representatives with bilaterally symmetrical papilionate flowers that are unique in the Papilionoideae, either because of the wing-like, enlarged calyx lobes (e.g., Dipteryx Schreb., Pterodon Vogel; Figure 1C) or the fimbriate-glandular wing petals (e.g., Petaladenium Ducke). Such dramatic floral diversity in the early stages of the diversification history of the Papilionoideae indicates that resolving the relationships of the Swartzieae and ADA clades is fundamental, not just to better reconstruct the most likely ancestral form of the flower of the subfamily, but also to understand why their evolutionary history has been marked by profound deviations in flower morphology, from the typical papilionate architecture (as exhibited by those in Figures 1F,I–K,M–O).

Meso-Papilionoideae (50-kb Inversion Clade)

Large inversions in the plastome were first identified in legumes (Palmer and Thompson, 1982; Bruneau et al., 1990) and subsequently in other plant groups (e.g., Asteraceae, Jansen and Palmer, 1987; Poaceae, Doyle et al., 1992). Early evidence for a monophyletic group of papilionoids marked by this apparently unique, derived synapomorphy (Doyle et al., 1996) has been confirmed by subsequent molecular phylogenetic studies (e.g., Doyle et al., 1997; Pennington et al., 2001; Wojciechowski et al., 2004). The presence/absence of the 50-kb inversion in sampled taxa (Doyle et al., 1996) not only clarified the positions of several generic groups of both Sophoreae and Swartzieae at the base of papilionoids or among more derived groups (Polhill, 1981b), but also revealed the existence of a large monophyletic group of higher papilionoids defined by this molecular synapomorphy that includes the vast majority of the subfamily. Indeed, this large core papilionoid group comprises 98% of Papilionoideae based upon current species diversity estimates (LPWG, 2021). With this fact in mind, and consistent with the principles of phylogenetic nomenclature (Cantino and de Queiroz, 2006; Cantino et al., 2007), we suggest the adoption of “Meso-Papilionoideae” for the 50-kb inversion clade as defined previously by Wojciechowski (2013b). The 50-kb inversion serves as a synapomorphy for this clade, supported by nuclear (Zhao et al., 2021), mitochondrial (Choi et al., 2021), and plastid (supported by all 11 trees, see Figure 3; Zhang et al., 2020) phylogenomic studies.

The recent discovery of three species of Sesbania, nested within the Hologalegina clade (Robinioid + IRLC; Figure 5; Wojciechowski et al., 2000), with plastomes that appear to have completely reverted the 50-kb inversion (Lee et al., 2021) does not diminish the phylogenetic significance of this large inversion early in papilionoids. The phylogenetic distribution of this plastome structure within Sesbania or in its close relatives in the Robinioid clade remains elusive.

Meso-Papilionoideae includes the species-rich Genistoid s.l., Dalbergioid s.l., and Baphieae + NPAAA clades, the small Andira, Exostyleae and Vataireoid clades, as well as the phylogenetically unresolved genera Dermatophyllum and the African Amphimas. Relationships among and within these clades will be discussed below except for Amphimas, from which complete plastome data is not yet available, but which has been placed in the 50-kb inversion clade based on matK sequence data (Figure 6; Cardoso et al., 2015).

Andira and Exostyleae Clades

In our study, the Andira and Exostyleae clades were consistently (supported by all 11 trees, see Figure 3) resolved as successive sister groups to the monophyletic group comprising the remainder of the Meso-Papilionoideae (maximal support values from all three trees, see Figure 5A). The plastid phylogenomic study of Zhang et al. (2020), which included one taxon from each clade, also inferred the same topology. However, nuclear phylogenomic analysis (Zhao et al., 2021) placed Andira inermis (W. Wright) Kunth ex DC. sister to a clade of Baphieae + NPAAA, while Exostyleae was resolved as the first diverging lineage within Meso-Papilionoideae with maximum support values in all seven coalescent trees. The Andira clade (sensu Ramos et al., 2016), which includes three genera (Aldina, Andira Lam., and Hymenolobium Benth.), is expanded from Cardoso et al. (2012a, 2013a) to include Aldina based primarily on plastid data with some morphological similarities and shared ecological and distributional preference for Amazonian tropical rain forests. The Andira clade contains a mixture of taxa with radially symmetrical flowers with a profusion of exposed free stamens (Aldina) (see Figure 1D) and truly papilionate floral architecture (Andira and Hymenolobium). This was interpreted as an additional example of the common, interlaced phylogenetic distribution of the heterogeneous floral morphologies in early-diverging clades of Papilionoideae (Ramos et al., 2016). Whether the unexpected, more derived placement of Andira in Zhao et al. (2021) represents a signature of incongruence between plastid and nuclear genomes deserves further investigation.

Genistoid s.l. and Dermatophyllum Relationships, and Implications for the Evolutionary Distribution of Quinolizidine Alkaloids in Legumes

The sister relationship between the Genistoid s.l. and Dermatophyllum lineages was retrieved from three ML analyses, where the highest support value (BS = 93) was found in Tree 8 (Figure 4). However, this relationship is equivocal because of conflicting alternative ML topologies based on analyses of the datasets derived from complete plastomes. In addition, the MSC tree estimations returned weakly supported relationships for these groups and inferred a high frequency of alternative topologies among individual CDS phylogenies.

Among legumes, QA production is restricted to most genistoid genera and to Dermatophyllum (Bisby et al., 1994; Kite and Pennington, 2003; Lee et al., 2013; Wink, 2013). This chemotaxonomic evidence has led to alternative hypotheses with respect to the controversial phylogenetic position of Dermatophyllum. One hypothesis suggests that the production of QAs is a derived characteristic defining a clade that includes the most recent common ancestor (MRCA) of the Genistoid s.l. and Dermatophyllum lineages, and thus forms strong evidence for a sister relationship of these two taxa (e.g., Cardoso et al., 2012a, 2013a; Kite et al., 2013; Lee et al., 2013). Alternatively, the genetic capacity for QA biosynthesis was established in the very early diversification of papilionoids but the genes remain silent (or were lost) in many descendant lineages (e.g., Wink et al., 2010). The Genistoid s.l. clade, as delimited in the phylogenetic analysis of matK sequences by Wojciechowski et al. (2004), included all known QA-accumulating genera except Dermatophyllum (as syn. Calia Terán and Berland.) and Ormosia Jacks. The Genistoid s.l. clade was subsequently expanded to accommodate the genus Ormosia and related genera of the Ormosieae, which has been consistently resolved as sister to the rest of the clade (Cardoso et al., 2012a, 2013a), suggesting that the only remaining QA-producing genus Dermatophyllum was likely sister to the Genistoid s.l. clade. The scenario of Wink et al. (2010) was based on an rbcL gene phylogeny that showed an early divergence of Dermatophyllum within a monophyletic group containing Myroxylon L. (Amburaneae) that does not produce QAs and is not a member of Meso-Papilionoideae. However, our study (Figures 35) and others (Zhang et al., 2020) with broad taxon sampling of papilionoid legumes resolved Dermatophyllum as a more derived lineage in the Meso-Papilionoideae. Wink et al. (2010) suggested that QA production was one of many herbivore defense strategies rendering it dispensable, as exemplified by loss or extreme reduction of QA production in some species within genistoid genera such as Crotalaria L., Lotononis (DC.) Eckl. and Zeyh., Ulex L., Calicotome Link, and Spartocytisus Webb and Berthel.

In light of our results, modified ancient gain and loss scenarios regarding QA production can be postulated according to the alternative phylogenetic positions of Dermatophyllum (supported by BS value > 90) relative to the Genistoid s.l., Vataireoid, Dalbergioid s.l., and Baphieae + NPAAA clades. One scenario posits that the QA biosynthesis pathway was present at least since the MRCA of all five lineages but was lost after the successive divergence of the Dermatophyllum and the Genistoid s.l. clades (Figure 4, Tree 1). Alternatively, QA production is ancestral in only three lineages (Dermatophyllum, Genistoid s.l. and Vataireoid clades) but was lost early in the diversification of the Vataireoid clade. Our study does not confidently identify the position of Dermatophyllum, but it reduces the number of possible solutions by resolving other early-diverging relationships within Meso-Papilionoideae. Nevertheless, the Dermatophyllum-Genistoid s.l. sister hypothesis remains the most parsimonious (single-step, excluding recent loss events in the genistoids) scenario. In order to shed light on the evolutionary pathway(s) leading to the biosynthesis of QAs, resolving the relationship of Dermatophyllum with respect to the remaining genistoids is a high priority in future nuclear-based phylogenomics of the early-diverging Papilionoideae.

Vataireoid Clade

The exclusively neotropical Vataireoid clade includes just 28 species, and is another example of an early-diverging papilionoid lineage outside the large NPAAA clade with heterogeneous floral morphology (Cardoso et al., 2013b), similar to the Andira clade (sensu Ramos et al., 2016). Monophyly of this morphologically heterogeneous group was highly supported in phylogenetic analyses based on single plastid genes or a few combined nuclear and plastid genes (Cardoso et al., 2012a, 2013a,b) to genome-scale data (Figure 5A; Zhao et al., 2021). There was no strongly supported sister relationship hypothesis for this clade except Zhao et al. (2021). The close affinity of the vataireoids to Dalbergieae based on a single-seeded samaroid fruit morphology was suggested by Lima (1980; 1982; 1990), but molecular systematic studies have not supported this relationship. The phylogenetic position of the clade is inconclusive based on our analyses, but a nuclear phylogenomic study (Zhao et al., 2021) resolved the clade as sister to a monophyletic group (including Dalbergioid s.l., Genistoid s.l., Andira, and Baphieae + NPAAA clades) that includes all known taxa with the ability to nodulate within Meso-Papilionoideae (Sprent et al., 2017; Ardley and Sprent, 2021).

Dalbergioid s.l. Clade

The Dalbergioid s.l. clade includes monophyletic groups of the pantropical Dalbergieae and the predominantly North American temperate Amorpheae (Wojciechowski et al., 2004). In our study, this clade most frequently grouped with the NPAAA + Baphieae clade, albeit with alternative positions and low support values (Figures 3, 4). A sister group relationship of the Andira clade with the Dalbergioid s.l. clade was once weakly supported (Wojciechowski et al., 2004), but further support for that relationship is lacking. A nuclear phylogenomic study (Zhao et al., 2021) grouped the Dalbergioid s.l. clade with the Genistoid s.l. clade but with weak support (highest BS value from seven coalescent trees was 77). As such, the sister relationship of the dalbergioids within Meso-Papilionoideae is still unclear.

While both nuclear phylogenomic (Zhao et al., 2021) and our plastid phylogenomic (Figures 35) analyses have failed to clarify the relationships of the Dalbergioid s.l. clade, they concur with previous comprehensively sampled matK phylogenies in strongly supporting the monophyly and interrelationships of three main subclades within the dalbergioids: the Adesmia, Dalbergia, and Pterocarpus clades (Lavin et al., 2001; Wojciechowski et al., 2004; Cardoso et al., 2012a, 2013a). Additionally, by resolving the radially symmetrical flowered genera Riedeliella Harms and Acosmium Schott in isolated positions within Dalbergieae, our analyses based on complete plastomes demonstrated yet again how the independent evolution of non-papilionate floral architecture has been so recurrent among the early-branching papilionoids (Lavin et al., 2001; Cardoso et al., 2012a,b).

Baphieae and Non-protein Amino Acid Accumulating Clades

A single origin of non-protein amino acid biosynthesis in papilionoids was hypothesized by Bell (1981) because canavanine, a close analog of arginine, was almost mutually exclusive of alkaloid accumulation, and restricted to 16 closely related papilionoid tribes. A monophyletic group containing all known taxa producing non-protein amino acids, the NPAAA clade (Cardoso et al., 2012b), is supported by all recent phylogenetic analyses (LPWG, 2017; Koenen et al., 2020b, 2021; Zhang et al., 2020; Choi et al., 2021; Zhao et al., 2021). This clade includes several of the most species-rich (ca. 11,000 spp.) and rapidly evolving legume lineages (Cardoso et al., 2013a; Koenen et al., 2013), as well as the largest flowering plant genus Astragalus L. (ca. 3000 spp.) and the most agriculturally important culinary pulses such as common beans, peas, lentils, and soybeans (Gepts et al., 2005). The clade also stands out among the main Papilionoideae lineages with respect to the almost universal evolutionary canalization of the specialized, strongly bilaterally symmetrical papilionate floral architecture. Thus far, there are no examples of reversion to radial floral symmetry or profusion of free stamens as found in earlier branching clades (Lewis et al., 2005; Cardoso et al., 2013a).

The NPAAA clade is sister to the small (c. 60 spp.), predominantly African Baphieae clade, which contains genera with both radially and bilaterally symmetrical flowers. Evolutionary transitions between polysymmetry and monosymmetry in floral architecture are common in angiosperms (Endress, 1996, 1999), such that the emergence of the core eudicots, a clade with more than 200,000 species, coincides with the fixation of pentamerous flowers, whorl organization, and a perianth often differentiated into sepals and petals (Specht and Bartlett, 2009). Such changes in floral architecture may have led to the recurrent evolution of bilateral symmetry (zygomorphy) from polysymmetric-flowered ancestors across angiosperms. This shift coincides with the co-diversification of megadiverse families and specialized pollinating insects (Cardinal and Danforth, 2013). These factors may have contributed to increased speciation rates (Sargent, 2004; Davis et al., 2014) and monosymmetry as a key innovation during the radiation of angiosperms (Sanderson and Donoghue, 1994; Bond and Opell, 1998). Likewise, the evolutionary maintenance of the papilionate flower may have sparked the explosive diversification of the large NPAAA clade, which includes almost 70% of both specific and generic diversity in Papilionoideae (Cardoso et al., 2012a, 2013b).

Here, two large clades within the NPAAA clade have been confirmed in agreement with all previous molecular phylogenetic studies (Figures 5C–E; Wojciechowski et al., 2004; Cardoso et al., 2013a; Queiroz et al., 2015; LPWG, 2017; Zhao et al., 2021): the Indigofereae + Millettioid and Hologalegina (Robinioid + IRLC) clades. Most of the relationships among the main lineages within the Millettioid clade concur with previous studies (Figures 5C,D; Queiroz et al., 2015; Zhao et al., 2021), however, a discordance was observed in the sister relationship of Mucuna Adans. (Phaseoleae) and Desmodieae. Mucuna was supported as the sister to Desmodieae in plastid phylogenies (Doyle et al., 2000; Kajita et al., 2001; Stefanović et al., 2009; Jin et al., 2019). This sister relationship is also marked by the shared loss of the plastid rpl2 intron, which appears to have been lost only a few times in legumes (Doyle et al., 1995; Bailey et al., 1997; Lai et al., 1997; Jin et al., 2019). While our analyses of complete plastome data showed Mucuna as sister (maximal support values from all three trees, see Figure 5D), or part of a sister clade (including Craspedolobium Harms and Haymondia A.N.Egan & B.Pan) to Desmodieae (Figure 6), Zhao et al.’s (2021) nuclear phylogenomic analyses resolved Mucuna as sister to Cochlianthus Benth., combining a Mucuna-Cochlianthus clade as sister to Apios Fabr., whereas Craspedolobium and Haymondia were successive sister groups of Desmodieae. Lackey (1981) considered the genera Apios, Cochlianthus, and Mucuna to be members of an artificial amalgamation that defined the Phaseoleae subtribe Erythrininae, but considered Apios and Cochlianthus to be a natural grouping or even congeneric, making the sister relationship between Cochlianthus and Mucuna largely unexpected. However, the key feature of Haymondia (i.e., an explosive flower tripping mechanism involving the upward movement of the reproductive column that remains fully reflexed from the wing and keel petals, and touching the standard petal; Egan and Pan, 2015) can also be found in some genera of Phaseoleae (Apios, Cochlianthus, and Mucuna) and Desmodieae (except the Lespedeza group) within the millettioids (Schrire et al., 2009). Due to its phylogenetically scattered distribution that includes Indigofereae and Medicago L. within the NPAAA clade, as well as in the genistoids Genista L., Harpalyce Moc. and Sessé ex DC., Spartium L., and Ulex L., and the dalbergioid Brya P. Browne (Arroyo, 1981; Lavin et al., 2001; Schrire et al., 2009), convergent evolution of this pollination-related morphological feature cannot be ruled out. The presence of a cryptic, shared morphological feature between two distantly related clades, including Apios and Desmodieae and the highly supported, yet conflicting positions of Mucuna in nuclear and plastid phylogenies supports putative ancestral hybridization in these groups (Egan et al., 2016).

The Hologalegina clade was first designated by Wojciechowski et al. (2000) to comprise the traditionally recognized “temperate herbaceous tribes” of Polhill (1981b, 1994). This large monophyletic group includes two main, well supported subclades; Robinoid (Sesbanieae, Loteae, and Robinieae) and the IRLC. Even though some relationships within the Robinioid clade and IRLC need further clarification, the monophyly of each group has been consistently supported in plastid-based phylogenies (maximal support values from all three trees, see Figure 5E; Wojciechowski et al., 2004; Lavin et al., 2005). The nuclear phylogenomics study of Zhao et al. (2021) did not support the monophyly of the Robinioids and resolved Sesbanieae + Loteae as sister to the IRLC, albeit with low support values (4 out of 7 trees > BS 70). Indeed, Loteae was once regarded as closer to members of the IRLC than to Robinieae, based on similarities in vegetative morphology, growth habit, and distribution (Polhill, 1981b, 1994). Similarly, the early nuclear rDNA-ITS-based phylogeny of Hu et al. (2000) and recent mitogenome-based phylogeny of Choi et al. (2021), with limited taxon sampling (without Sesbanieae), resolves Lotus L. as sister to the IRLC.

Within the Vicioid clade (IRLC), five genera of Trifolieae (Medicago, Melilotus (L.) Mill., Ononis L., Trifolium L., Trigonella L., excluding Parochetus Buch.-Ham. ex D.Don), have been resolved as paraphyletic, placing Trifolium as sister to the monophyletic Fabeae in plastid phylogenies (Wojciechowski et al., 2000, 2004; Steele and Wojciechowski, 2003; Schaefer et al., 2012). Our study also showed the sister relationship of Trifolium to Fabeae (maximal support values from two of three trees, see Figure 5E). However, the nuclear phylogenomic study resolved Trifolium as sister to a clade composed of Medicago, Melilotus, and Trigonella with maximal support values from all seven coalescent trees (Zhao et al., 2021). Similarly, but with very limited sampling, a mitochondrial phylogenomic study (Choi et al., 2021) also supported a monophyletic group based on Medicago and Trifolium as sister to Vicia (Fabeae). This grouping of four Trifolieae genera (Medicago, Melilotus, Trifolium, and Trigonella) is also marked by the loss of mitochondrial rps1 due to its functional, intracellular gene transfer to the nuclear genome (Hazle and Bonen, 2007; Choi et al., 2020b). In the case of Trifolieae, in which the nuclear and mitochondrial data produce topologies that agree closely with a classification based on gross morphology while the plastid data does not, a plastid capture scenario is worth considering.

Phylogenies based on plastid and mitochondrial genomes can produce different topologies for a lineage because of biparental organelle inheritance together with cytonuclear incompatibility, as exemplified in the IRLC genus Pisum L. (Bogdanova et al., 2021). Many IRLC taxa share the potential for biparental plastid inheritance (Corriveau and Coleman, 1988; Zhang et al., 2003). That a limited number of taxa have been tested for potential paternal transmission of organelle genomes and that the mitogenome-based phylogeny showed an alternative topology with regard to Lotus and Trifolium (Choi et al., 2021) warrants further investigation on the mode of organelle inheritance in Hologalegina. Within the IRLC, plastid capture scenarios have been suggested for various lineages based on conflicting results between nuclear rDNA and plastid data-based phylogenies (e.g., Ellison et al., 2006; Duan et al., 2021). Discordant taxon sampling across the three genomic datasets and the topological conflicts, which may have originated from the complex evolutionary behavior of repetitive nuclear rDNA (Álvarez and Wendel, 2003), hinder the detailed examination of those scenarios.

Future Research Directions in Plastid Phylogenomics of the Papilionoid Legumes

Our study provides well resolved and strongly supported plastid phylogenies for the papilionoid legumes. Nevertheless, there remain nodes lacking phylogenetic resolution. The inclusion of non-CDS regions of plastomes could add more phylogenetic signal but at the potential expense of introducing a great amount of homoplasy, which could either decrease phylogenetic support or introduce bias toward unreliable relationships. There are further limitations on the use of plastome data despite that they have served well as a fundamental source of phylogenetic information. This information has helped us to address complex evolutionary issues in the papilionoid legumes, such as the origin of their extremely diverse floral architectures and secondary metabolites. The ancient, rapid diversification and reticulation/introgression detected in plastome-based Papilionoideae phylogenies warrant the incorporation of nuclear and mitochondrial data from concordant, broad taxon sampling. Perception and reconciling of conflicting phylogenetic signals within and between the three genomes are in their primary stage, and more complex evolutionary patterns may be revealed from the total genomic loci across all levels of subdivision in the Papilionoideae. In Hologalegina in particular, where the transition of maternal to biparental plastid inheritance as well as reticulation/introgression(s) likely occurred, more conflicts are expected.

Closing the sampling gaps within and between matK-only and complete plastome datasets could shed light on future sampling directions in Papilionoideae phylogenomics. Since Cardoso et al. (2013a) estimated the number of early-branching papilionoid genera that were not sampled for matK sequence data (52 out of the 196), there has been great progress in filling the sampling gap by virtue of individual locus or whole plastome sequences (e.g., Cardoso et al., 2015, 2017; Swanepoel et al., 2015; LPWG, 2017; Queiroz et al., 2017; Zhang et al., 2020). Our matK-only + plastome combined data set (Supplementary Table 5) includes 39 genera that were previously unsampled by Cardoso et al. (2013a), however, 12 genera (one is synonymized, see Supplementary Table 6) still remain to be fully sequenced. Moreover, many genera are still not well resolved in phylogenies based on matK-only data, such as the African tree genus Amphimas (Cardoso et al., 2012a, 2013a, 2015). Further plastome research should focus on examining the positions of phylogenetically unresolved genera, as well as those that are morphological key groups or represent monospecific lineages (Figure 6). Next generation sequencing applied to museomics (Bakker et al., 2016; Johnson et al., 2019) has proven to be a feasible approach to collecting genomic data from taxa for which fresh tissues are unavailable.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: GenBank with accessions MZ725323, OL672849-OL672886. All sequence alignments and trees that were generated from this study are submitted to Dryad (https://doi.org/10.5061/dryad.sf7m0cg7m).

Author Contributions

MW, TR, RJ, DC, and I-SC: conception and experimental design. MW, RJ, and TR: acquisition of funds. DC, RJ, TR, I-SC, HL, LQ, and MW: field collections. I-SC, CL, and TR: nucleic acid isolation. CL and I-SC: plastome assembly and annotation. I-SC, DC, and MW: DNA alignments and phylogenetic analyses and data interpretation. I-SC and DC: production of figures and tables. I-SC, DC, MW, TR, and RJ: writing and revision of the manuscript. All authors read and commented on the manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We thank the TEX-LL, HUEFS, and RB herbaria for voucher deposition, the Desert Legume Program at the University of Arizona for seeds, and Alessandra Schnadelbach, Hedina Basile, Henrique Batalha Filho, and Paula Ristow for their lab support at UFBA. We also thank George Yatskievych (TEX/LL) for arranging a formal Material Transfer Agreement (Decree number 8772) under the SisGen Cadastro RDC6BE9, which facilitated research activities between our institutions. Finally, we wish to thank the two reviewers and the editor for their many helpful comments.

Footnotes

Funding

This work was supported by grants from the National Science Foundation (DEB-1853010 and DEB-1853024) to MW, RJ, and TR, the Texas Ecological Laboratory (EcoLab) to RJ, TR, and I-SC, the Sidney F. and Doris Blake Professorship in Systematic Botany to RJ, the CNPq (Research Productivity Fellowship no. 308244/2018-4; Universal no. 422325/2018-0), FAPESB (Universal no. APP0037/2016), and UFBA PROQUAD program to DC.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.823190/full#supplementary-material

References

  1. Aecyo P., Marques A., Huettel B., Silva A., Esposito T., Ribeiro E., et al. (2021). Plastome evolution in the Caesalpinia group (Leguminosae) and its application in phylogenomics and populations genetics. Planta 254:27. 10.1007/s00425-021-03655-8 [DOI] [PubMed] [Google Scholar]
  2. Álvarez I., Wendel J. F. (2003). Ribosomal ITS sequences and plant phylogenetic inference. Mol. Phylogenet. Evol. 29 417–434. 10.1016/S1055-7903(03)00208-2 [DOI] [PubMed] [Google Scholar]
  3. Antonelli A., Clarkson J. J., Kainulainen K., Maurin O., Brewer G. E., Davis A. P., et al. (2021). Settling a family feud: a high-level phylogenomic framework for the Gentianales based on 353 nuclear genes and partial plastomes. Am. J. Bot. 108 1143–1165. 10.1002/ajb2.1697 [DOI] [PubMed] [Google Scholar]
  4. Ardley J., Sprent J. (2021). Evolution and biogeography of actinorhizal plants and legumes: a comparison. J. Ecol. 109 1098–1121. 10.1111/1365-2745.13600 [DOI] [Google Scholar]
  5. Arroyo M. T. K. (1981). “Breeding systems and pollination biology in Leguminosae,” in Advances in Legume Systematics, Part2, eds Polhill R. M., Raven P. H. (Richmond, UK: Royal Botanic Gardens, Kew; ), 723–770. [Google Scholar]
  6. Bailey C. D., Doyle J. J., Kajita T., Nemoto T., Ohashi H. (1997). The chloroplast rpl2 intron and ORF184 as phylogenetic markers in the legume tribe Desmodieae. Syst. Bot. 22 133–138. 10.2307/2419681 [DOI] [Google Scholar]
  7. Bakker F. T., Lei D., Yu J., Mohammadin S., Wei Z., Van de Kerke S., et al. (2016). Herbarium genomics: plastome sequence assembly from a range of herbarium specimens using an iterative organelle genome assembly pipeline. Biol. J. Linn. Soc. 117 33–43. 10.1111/bij.12642 [DOI] [Google Scholar]
  8. Bell E. A. (1981). “Non-protein amino acids in the Leguminosae,” in Advances in Legume Systematics, Part 2, eds Polhill R. M., Raven P. H. (Richmond, UK: Royal Botanic Gardens, Kew; ), 489–499. [Google Scholar]
  9. Bisby F. A., Buckingham J., Harborne J. B. (eds) (1994). Phytochemical Dictionary of the Leguminosae. London: Chapman and Hall. [Google Scholar]
  10. Bogdanova V. S., Shatskaya N. V., Mglinets A. V., Kosterin O. E., Vasiliev G. V. (2021). Discordant evolution of organellar genomes in peas (Pisum L.). Mol. Phylogenet. Evol. 160:107136. 10.1016/j.ympev.2021.107136 [DOI] [PubMed] [Google Scholar]
  11. Bond J. E., Opell B. D. (1998). Testing adaptive radiation and key innovation hypotheses in spiders. Evolution 52 403–414. 10.1111/j.1558-5646.1998.tb01641.x [DOI] [PubMed] [Google Scholar]
  12. Bruneau A., Doyle J. J., Palmer J. D. (1990). A chloroplast DNA inversion as a subtribal character in the Phaseoleae (Leguminosae). Syst. Bot. 15 378–386. 10.2307/2419351 [DOI] [Google Scholar]
  13. Bruneau A., Mercure M., Lewis G. P., Herendeen P. S. (2008). Phylogenetic patterns and diversification in the caesalpinioid legumes. Botany 86 697–718. 10.1139/b08-058 [DOI] [Google Scholar]
  14. Cai Z., Guisinger M., Kim H.-G., Ruck E., Blazier J. C., McMurtry V., et al. (2008). Extensive reorganization of the plastid genome of Trifolium subterraneum (Fabaceae) is associated with numerous repeated sequences and novel DNA insertions. J. Mol. Evol. 67 696–704. 10.1007/s00239-008-9180-7 [DOI] [PubMed] [Google Scholar]
  15. Cannon S. B., McKain M. R., Harkess A., Nelson M. N., Dash S., Deyholos M. K., et al. (2015). Multiple polyploidy events in the early radiation of nodulating and nonnodulating legumes. Mol. Biol. Evol. 32 193–210. 10.1093/molbev/msu296 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cantino P. D., de Queiroz K. (2006). PhyloCode: a Phylogenetic Code of Biological Nomenclature. Version 3a. Available online at: http://www.phylocode.org (accessed September 15, 2021). [Google Scholar]
  17. Cantino P. D., Doyle J. A., Graham S. W., Judd W. S., Olmstead R. G., Soltis D. E., et al. (2007). Towards a phylogenetic nomenclature of Tracheophyta. Taxon 56 E1–E44. 10.1002/tax.563001 [DOI] [Google Scholar]
  18. Cardinal S., Danforth B. N. (2013). Bees diversified in the age of eudicots. Proc. Royal Soc. B 280:20122686. 10.1098/rspb.2012.2686 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Cardoso D., Harris D. J., Wieringa J. J., São-Mateus W. M. B., Batalha-Filho H., Torke B. M., et al. (2017). A molecular-dated phylogeny and biogeography of the monotypic legume genus Haplormosia, a missing African branch of the otherwise American-Australian Brongniartieae clade. Mol. Phylogenet. Evol. 107 431–442. 10.1016/j.ympev.2016.12.012 [DOI] [PubMed] [Google Scholar]
  20. Cardoso D., Lima H. C., Rodrigues R. S., Queiroz L. P., Pennington R. T., Lavin M. (2012a). The realignment of Acosmium sensu stricto with the Dalbergioid clade (Leguminosae: papilionoideae) reveals a proneness for independent evolution of radial floral symmetry among early-branching papilionoid legumes. Taxon 61 1057–1073. 10.1002/tax.615011 [DOI] [Google Scholar]
  21. Cardoso D., Queiroz L. P., Pennington R. T., Lima H. C., Fonty E., Wojciechowski M. F., et al. (2012b). Revisiting the phylogeny of papilionoid legumes: new insights from comprehensively sampled early-branching lineages. Am. J. Bot. 99 1991–2013. 10.3732/ajb.1200380 [DOI] [PubMed] [Google Scholar]
  22. Cardoso D., Pennington R. T., Queiroz L. P., Boatwright J. S., Van Wyk B.-E., Wojciechowski M. F., et al. (2013a). Reconstructing the deep-branching relationships of the papilionoid legumes. S. Afr. J. Bot. 89 58–75. 10.1016/j.sajb.2013.05.001 [DOI] [Google Scholar]
  23. Cardoso D., Queiroz L. P., Lima H. C., Suganuma E., Van den Berg C., Lavin M. (2013b). A molecular phylogeny of the vataireoid legumes underscores floral evolvability that is general to many early-branching papilionoid lineages. Am. J. Bot. 100 403–421. 10.3732/ajb.1200276 [DOI] [PubMed] [Google Scholar]
  24. Cardoso D., São-Mateus W. M., Cruz D. T., Zartman C. E., Komura D. L., Kite G., et al. (2015). Filling in the gaps of the papilionoid legume phylogeny: the enigmatic Amazonian genus Petaladenium is a new branch of the early-diverging Amburaneae clade. Mol. Phylogenet. Evol. 84 112–124. 10.1016/j.ympev.2014.12.015 [DOI] [PubMed] [Google Scholar]
  25. Castellanos C., Steeves R., Lewis G. P., Bruneau A. (2017). A settled sub-family for the orphan tree: the phylogenetic position of the endemic Colombian genus Orphanodendron in the Leguminosae. Brittonia 69 62–70. 10.1007/s12228-016-9451-3 [DOI] [Google Scholar]
  26. Castresana J. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17 540–552. 10.1093/oxfordjournals.molbev.a026334 [DOI] [PubMed] [Google Scholar]
  27. Charboneau J. L. M., Cronn R. C., Liston A., Wojciechowski M. F., Sanderson M. J. (2021). Plastome structural evolution and homoplastic inversions in Neo-Astragalus (Fabaceae). Genome. Biol. Evol. 13:evab215. 10.1093/gbe/evab215 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Choi I.-S., Choi B.-H. (2017). The distinct plastid genome structure of Maackia fauriei (Fabaceae: papilionoideae) and its systematic implications for genistoids and tribe Sophoreae. PLoS One 12:e0173766. 10.1371/journal.pone.0173766 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Choi I.-S., Jansen R., Ruhlman T. (2019). Lost and found: return of the inverted repeat in the legume clade defined by its absence. Genome Biol. Evol. 11 1321–1333. 10.1093/gbe/evz076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Choi I.-S., Jansen R., Ruhlman T. (2020a). Caught in the act: variation in plastid genome inverted repeat expansion within and between populations of Medicago minima. Ecol. Evol. 10 12129–12137. 10.1002/ece3.6839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Choi I.-S., Ruhlman T. A., Jansen R. K. (2020b). Comparative mitogenome analysis of the genus Trifolium reveals independent gene fission of ccmFn and intracellular gene transfers in Fabaceae. Int. J. Mol. Sci. 21:1959. 10.3390/ijms21061959 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Choi I.-S., Wojciechowski M. F., Ruhlman T. A., Jansen R. K. (2021). In and out: evolution of viral sequences in the mitochondrial genomes of legumes (Fabaceae). Mol. Phylogenet. Evol. 163:107236. 10.1016/j.ympev.2021.107236 [DOI] [PubMed] [Google Scholar]
  33. Chomicki G., Ward P. S., Renner S. S. (2015). Macroevolutionary assembly of ant/plant symbioses: Pseudomyrmex ants and their ant-housing plants in the Neotropics. Proc. Royal Soc. B 282:20152200. 10.1098/rspb.2015.2200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Corriveau J. L., Coleman A. W. (1988). Rapid screening method to detect potential biparental inheritance of plastid DNA and results for over 200 angiosperm species. Am. J. Bot. 75 1443–1458. 10.1002/j.1537-2197.1988.tb11219.x [DOI] [Google Scholar]
  35. Cowan R. S. (1981). “Swartzieae,” in Advances in Legume Systematics, Part 1, eds Polhill R. M., Raven P. H. (Richmond, UK: Royal Botanic Gardens, Kew; ), 209–212. [Google Scholar]
  36. Darling A. E., Mau B., Perna N. T. (2010). progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5:e11147. 10.1371/journal.pone.0011147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Davis C. C., Schaefer H., Xi Z., Baum D. A., Donoghue M. J., Harmon L. J. (2014). Long-term morphological stasis maintained by a plant-pollinator mutualism. Proc. Natl. Acad. Sci. U. S. A. 111 5914–5919. 10.1073/pnas.1403157111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Doyle J. J. (2021). Defining coalescent genes: theory meets practice in organelle phylogenomics. Syst. Biol. syab053. 10.1093/sysbio/syab053 [DOI] [PubMed] [Google Scholar]
  39. Doyle J. J., Chappill J. A., Bailey C. D., Kajita T. (2000). “Towards a comprehensive phylogeny of legumes: evidence from rbcL sequences and non-molecular data,” in Advances in Legume Systematics, Part 9, eds Herendeen P. S., Bruneau A. (Richmond, UK: Royal Botanic Gardens, Kew; ), 1–20. [Google Scholar]
  40. Doyle J. J., Davis J. I., Soreng R. J., Garvin D., Anderson M. J. (1992). Chloroplast DNA inversions and the origin of the grass family (Poaceae). Proc. Natl. Acad. Sci. U.S.A. 89 7722–7726. 10.1073/pnas.89.16.7722 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Doyle J. J., Doyle J. L. (1987). A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 19 11–15. [Google Scholar]
  42. Doyle J. J., Doyle J. L., Ballenger J. A., Dickson E. E., Kajita T., Ohashi H. (1997). A phylogeny of the chloroplast gene rbcL in the Leguminosae: taxonomic correlations and insights into the evolution of nodulation. Am. J. Bot. 84 541–554. 10.2307/2446030 [DOI] [PubMed] [Google Scholar]
  43. Doyle J. J., Doyle J. L., Ballenger J. A., Palmer J. D. (1996). The distribution and phylogenetic significance of a 50-kb chloroplast DNA inversion in the flowering plant family Leguminosae. Mol. Phylogenet. Evol. 5 429–438. 10.1006/mpev.1996.0038 [DOI] [PubMed] [Google Scholar]
  44. Doyle J. J., Doyle J. L., Palmer J. D. (1995). Multiple independent losses of two genes and one intron from legume chloroplast genomes. Syst. Bot. 20 272–294. 10.2307/2419496 [DOI] [Google Scholar]
  45. DRYFLOR (2016). Plant diversity patterns in neotropical dry forests and their conservation implications. Science 353 1383–1387. 10.1126/science.aaf5080 [DOI] [PubMed] [Google Scholar]
  46. Duan L., Harris A., Ye W., Deng S.-W., Song Z.-Q., Chen H.-F., et al. (2019). Untangling the taxonomy of the Cladrastis clade (Leguminosae: papilionoideae) by integrating phylogenetics and ecological evidence. Taxon 68 1189–1203. 10.1002/tax.12155 [DOI] [Google Scholar]
  47. Duan L., Li S.-J., Su C., Sirichamorn Y., Han L.-N., Ye W., et al. (2021). Phylogenomic framework of the IRLC legumes (Leguminosae subfamily Papilionoideae) and intercontinental biogeography of tribe Wisterieae. Mol. Phylogenet. Evol. 163:107235. 10.1016/j.ympev.2021.107235 [DOI] [PubMed] [Google Scholar]
  48. Duvall M. R., Burke S. V., Clark D. C. (2020). Plastome phylogenomics of Poaceae: alternate topologies depend on alignment gaps. Bot. J. Linnean Soc. 192 9–20. 10.1093/botlinnean/boz060 [DOI] [Google Scholar]
  49. Egan A. N., Pan B. (2015). Resolution of polyphyly in Pueraria (Leguminosae, Papilionoideae): the creation of two new genera, Haymondia and Toxicopueraria, the resurrection of Neustanthus, and a new combination in Teyleria. Phytotaxa 218 201–226. 10.11646/PHYTOTAXA.218.3.1 [DOI] [Google Scholar]
  50. Egan A. N., Vatanparast M., Cagle W. (2016). Parsing polyphyletic Pueraria: delimiting distinct evolutionary lineages through phylogeny. Mol. Phylogenet. Evol. 104 44–59. 10.1016/j.ympev.2016.08.001 [DOI] [PubMed] [Google Scholar]
  51. Ellison N. W., Liston A., Steiner J. J., Williams W. M., Taylor N. L. (2006). Molecular phylogenetics of the clover genus (Trifolium—Leguminosae). Mol. Phylogenet. Evol. 39 688–705. 10.1016/j.ympev.2006.01.004 [DOI] [PubMed] [Google Scholar]
  52. Endress P. K. (1996). Diversity and Evolutionary Biology of Tropical Flowers. Cambridge, UK: Cambridge University Press. [Google Scholar]
  53. Endress P. K. (1999). Symmetry in flowers: diversity and evolution. Int. J. Plant Sci. 160 S3–S23. 10.1086/314211 [DOI] [PubMed] [Google Scholar]
  54. Gepts P., Beavis W. D., Brummer E. C., Shoemaker R. C., Stalker H. T., Weeden N. F., et al. (2005). Legumes as a model plant family. Genomics for food and feed report of the cross-legume advances through genomics conference. Plant Physiol. 137 1228–1235. 10.1104/pp.105.060871 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Gonçalves D. J. P., Jansen R. K., Ruhlman T. A., Mandel J. R. (2020). Under the rug: abandoning persistent misconceptions that obfuscate organelle evolution. Mol. Phylogenet. Evol. 151:106903. 10.1016/j.ympev.2020.106903 [DOI] [PubMed] [Google Scholar]
  56. Gonçalves D. J. P., Simpson B. B., Ortiz E. M., Shimizu G. H., Jansen R. K. (2019). Incongruence between gene trees and species trees and phylogenetic signal variation in plastid genes. Mol. Phylogenet. Evol. 138 219–232. 10.1016/j.ympev.2019.05.022 [DOI] [PubMed] [Google Scholar]
  57. Goremykin V. V., Nikiforova S. V., Cavalieri D., Pindo M., Lockhart P. (2015). The root of flowering plants and total evidence. Syst. Biol. 64 879–891. 10.1093/sysbio/syv028 [DOI] [PubMed] [Google Scholar]
  58. Hazle T., Bonen L. (2007). Status of genes encoding the mitochondrial S1 ribosomal protein in closely-related legumes. Gene 405 108–116. 10.1016/j.gene.2007.09.019 [DOI] [PubMed] [Google Scholar]
  59. Hu J.-M., Lavin M., Wojciechowski M. F., Sanderson M. J. (2000). Phylogenetic systematics of the tribe Millettieae (Leguminosae) based on chloroplast trnK/matK sequences and its implications for evolutionary patterns in Papilionoideae. Am. J. Bot. 87 418–430. 10.2307/2656638 [DOI] [PubMed] [Google Scholar]
  60. Ireland H., Pennington R. T., Preston J. (2000). “Molecular systematics of the Swartzieae,” in Advances in Legume Systematics, Part 9, eds Herendeen P. S., Bruneau A. (Richmond, UK: Royal Botanic Gardens, Kew; ), 217–231. [Google Scholar]
  61. Jansen R. K., Palmer J. D. (1987). A chloroplast DNA inversion marks an ancient evolutionary split in the sunflower family (Asteraceae). Proc. Natl. Acad. Sci. U.S.A. 84 5818–5822. 10.1073/pnas.84.16.5818 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Jansen R. K., Wojciechowski M. F., Sanniyasi E., Lee S.-B., Daniell H. (2008). Complete plastid genome sequence of the chickpea Cicer arietinum and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae). Mol. Phylogenet. Evol. 48 1204–1217. 10.1016/j.ympev.2008.06.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Janzen D. H. (1966). Coevolution of mutualism between ants and acacias in Central America. Evolution 20 249–275. 10.2307/2406628 [DOI] [PubMed] [Google Scholar]
  64. Jin D.-P., Choi I.-S., Choi B.-H. (2019). Plastid genome evolution in tribe Desmodieae (Fabaceae: papilionoideae). PLoS One 14:e0218743. 10.1371/journal.pone.0218743 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Johnson M. G., Pokorny L., Dodsworth S., Botigué L. R., Cowan R. S., Devault A., et al. (2019). A universal probe set for targeted sequencing of 353 nuclear genes from any flowering plant designed using k-medoids clustering. Syst. Biol. 68 594–606. 10.1093/sysbio/syy086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Kajita T., Ohashi H., Tateishi Y., Bailey C. D., Doyle J. J. (2001). rbcL and legume phylogeny, with particular reference to Phaseoleae, Millettieae, and allies. Syst. Bot. 26 515–536. 10.1043/0363-6445-26.3.515 [DOI] [Google Scholar]
  67. Katoh K., Standley D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30 772–780. 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Kite G. C., Pennington R. T. (2003). Quinolizidine alkaloid status of Styphnolobium and Cladrastis (Leguminosae). Biochem. Syst. Ecol. 31 1409–1416. 10.1016/S0305-1978(03)00118-2 [DOI] [Google Scholar]
  69. Kite G. C., Veitch N. C., Soto-Hernández M., Lewis G. P. (2013). Highly glycosylated flavonols at the genistoid boundary and the systematic position of Dermatophyllum. S. Afr. J. Bot. 89 181–187. 10.1016/j.sajb.2013.06.003 [DOI] [Google Scholar]
  70. Klitgård B. B., Forest F., Booth T. J., Saslis-Lagoudakis C. H. (2013). A detailed investigation of the Pterocarpus clade (Leguminosae: dalbergieae): etaballia with radially symmetrical flowers is nested within the papilionoid-flowered Pterocarpus. S. Afr. J. Bot. 89 128–142. 10.1016/j.sajb.2013.07.006 [DOI] [Google Scholar]
  71. Koenen E. J. M., de Vos J. M., Atchison G. W., Simon M. F., Schrire B. D., de Souza E. R., et al. (2013). Exploring the tempo of species diversification in legumes. S. Afr. J. Bot. 89 19–30. 10.1016/j.sajb.2013.07.005 [DOI] [Google Scholar]
  72. Koenen E. J. M., Kidner C., Souza E. R., Simon M. F., Iganci J. R., Nicholls J. A., et al. (2020a). Hybrid capture of 964 nuclear genes resolves evolutionary relationships in the mimosoid legumes and reveals the polytomous origins of a large pantropical radiation. Am. J. Bot. 107 1710–1735. 10.1002/ajb2.1568 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Koenen E. J. M., Ojeda D. I., Steeves R., Migliore J., Bakker F. T., Wieringa J. J., et al. (2020b). Large-scale genomic sequence data resolve the deepest divergences in the legume phylogeny and support a near-simultaneous evolutionary origin of all six subfamilies. New Phytol. 225 1355–1369. 10.1111/nph.16290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Koenen E. J. M., Ojeda D. I., Bakker F. T., Wieringa J. J., Kidner C., Hardy O. J., et al. (2021). The origin of the legumes is a complex paleopolyploid phylogenomic tangle closely associated with the Cretaceous–Paleogene (K–Pg) mass extinction event. Syst. Biol. 70 508–526. 10.1093/sysbio/syaa041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Kursar T. A., Dexter K. G., Lokvam J., Pennington R. T., Richardson J. E., Weber M. G., et al. (2009). The evolution of antiherbivore defenses and their contribution to species coexistence in the tropical tree genus Inga. Proc. Natl. Acad. Sci. U.S.A. 106 18073–18078. 10.1073/pnas.0904786106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Lackey J. A. (1981). “Phaseoleae,” in Advances in Legume Systematics, Part 1, eds Polhill R. M., Raven P. H. (Richmond, UK: Royal Botanic Gardens, Kew; ), 301–327. [Google Scholar]
  77. Lai M., Sceppa J., Ballenger J. A., Doyle J. J., Wunderlin R. P. (1997). Polymorphism for the presence of the rpl2 intron in chloroplast genomes of Bauhinia (Leguminosae). Syst. Bot. 22 519–528. 10.2307/2419825 [DOI] [Google Scholar]
  78. Lavin M., Doyle J. J., Palmer J. D. (1990). Evolutionary significance of the loss of the chloroplast-DNA inverted repeat in the Leguminosae subfamily Papilionoideae. Evolution 44 390–402. 10.1111/j.1558-5646.1990.tb05207.x [DOI] [PubMed] [Google Scholar]
  79. Lavin M., Herendeen P. S., Wojciechowski M. F. (2005). Evolutionary rates analysis of Leguminosae implicates a rapid diversification of lineages during the Tertiary. Syst. Biol. 54 575–594. 10.1080/10635150590947131 [DOI] [PubMed] [Google Scholar]
  80. Lavin M., Pennington R. T., Klitgaard B. B., Sprent J. I., Lima H. C., Gasson P. E. (2001). The dalbergioid legumes (Fabaceae): delimitation of a pantropical monophyletic clade. Am. J. Bot. 88 503–533. 10.2307/2657116 [DOI] [PubMed] [Google Scholar]
  81. Lee C., Choi I.-S., Cardoso D., Lima H. C., Queiroz L. P., Wojciechowski M. F., et al. (2021). The chicken or the egg? Plastome evolution and an independent loss of the inverted repeat in papilionoid legumes. Plant J. 107 861–875. 10.1111/tpj.15351 [DOI] [PubMed] [Google Scholar]
  82. Lee S. T., Cook D., Molyneux R. J., Davis T. Z., Gardner D. R. (2013). Alkaloid profiles of Dermatophyllum arizonicum, Dermatophyllum gypsophilum, Dermatophyllum secundiflorum, Styphnolobium affine, and Styphnolobium japonicum previously classified as Sophora species. Biochem. Syst. Ecol. 49 87–93. 10.1016/j.bse.2013.03.018 [DOI] [Google Scholar]
  83. LPWG [Legume Phylogeny Working Group] (2013). Legume phylogeny and classification in the 21st century: progress, prospects and lessons for other species-rich clades. Taxon 62 217–248. 10.12705/622.8 [DOI] [Google Scholar]
  84. LPWG [Legume Phylogeny Working Group] (2017). A new subfamily classification of the Leguminosae based on a taxonomically comprehensive phylogeny. Taxon 66 44–77. 10.12705/661.3 [DOI] [Google Scholar]
  85. LPWG [Legume Phylogeny Working Group] (2021). The World Checklist of Vascular Plants (WCVP): Fabaceae, vers. June 2021, ed R. Govaerts. Available online at: http://sftp.kew.org/pub/data_collaborations/Fabaceae/DwCA/ [Google Scholar]
  86. Leite V. G., Mansano V. F., Teixeira S. P. (2014). Floral ontogeny in Dipterygeae (Fabaceae) reveals new insights into one of the earliest branching tribes in papilionoid legumes. Bot. J. Linnean Soc. 174 529–550. 10.1111/boj.12158 [DOI] [Google Scholar]
  87. Leite V. G., Teixeira S. P., Mansano V. F., Prenner G. (2015). Floral development of the early-branching papilionoid legume Amburana cearensis (Leguminosae) reveals rare and novel characters. Int. J. Plant Sci. 176 94–106. 10.1086/678468 [DOI] [Google Scholar]
  88. Letunic I., Bork P. (2021). Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49 W293–W296. 10.1093/nar/gkab301 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Lewis G. P., Schrire B., Mackinder B., Lock M. (2005). Legumes of the World. Richmond, UK: Royal Botanic Gardens, Kew. [Google Scholar]
  90. Li H.-T., Luo Y., Gan L., Ma P.-F., Gao L.-M., Yang J.-B., et al. (2021). Plastid phylogenomic insights into relationships of all flowering plant families. BMC Biol. 19:232. 10.1186/s12915-021-01166-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Lima H. C. (1980). Revisão taxonômica do gênero Vataireopsis Ducke (Leguminosae–Faboideae). Rodriguésia 32 21–40. [Google Scholar]
  92. Lima H. C. (1982). Revisão taxonômica do gênero Vatairea Aubl. (Leguminosae–Faboideae). Arch. Jard. Bot. Rio de Janeiro 26 173–214. [Google Scholar]
  93. Lima H. C. (1990). Tribo Dalbergieae (Leguminosae Papilionoideae)–Morfologia dos frutos, sementes e plântulas e sua aplicação na sistemática. Arch. Jard. Bot. Rio de Janeiro 30 1–42. [Google Scholar]
  94. Marazzi B., Ané C., Simon M. F., Delgado-Salinas A., Luckow M., Sanderson M. J. (2012). Locating evolutionary precursors on a phylogenetic tree. Evolution 66 3918–3930. 10.1111/j.1558-5646.2012.01720.x [DOI] [PubMed] [Google Scholar]
  95. Marazzi B., Gonzalez A. M., Delgado-Salinas A., Luckow M. A., Ringelberg J. J., Hughes C. E. (2019). Extrafloral nectaries in Leguminosae: phylogenetic distribution, morphological diversity and evolution. Aust. Syst. Bot. 32 409–458. 10.1071/SB19012 [DOI] [Google Scholar]
  96. Marazzi B., Sanderson M. J. (2010). Large-scale patterns of diversification in the widespread legume genus Senna and the evolutionary role of extrafloral nectaries. Evolution 64 3570–3592. 10.1111/j.1558-5646.2010.01086.x [DOI] [PubMed] [Google Scholar]
  97. Martin G. E., Rousseau-Gueutin M., Cordonnier S., Lima O., Michon-Coudouel S., Naquin D., et al. (2014). The first complete chloroplast genome of the Genistoid legume Lupinus luteus: evidence for a novel major lineage-specific rearrangement and new insights regarding plastome evolution in the legume family. Ann. Bot. 113 1197–1210. 10.1093/aob/mcu050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. McMahon M., Hufford L. (2004). Phylogeny of Amorpheae (Fabaceae: papilionoideae). Am. J. Bot. 91 1219–1230. 10.3732/ajb.91.8.1219 [DOI] [PubMed] [Google Scholar]
  99. McMahon M. M., Sanderson M. J. (2006). Phylogenetic supermatrix analysis of GenBank sequences from 2228 papilionoid legumes. Syst. Biol. 55 818–836. 10.1080/10635150600999150 [DOI] [PubMed] [Google Scholar]
  100. Milligan B. G., Hampton J. N., Palmer J. D. (1989). Dispersed repeats and structural reorganization in subclover chloroplast DNA. Mol. Biol. Evol. 6 355–368. 10.1093/oxfordjournals.molbev.a040558 [DOI] [PubMed] [Google Scholar]
  101. Nguyen L.-T., Schmidt H. A., Von Haeseler A., Minh B. Q. (2015). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32 268–274. 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Nicholls J. A., Pennington R. T., Koenen E. J. M., Hughes C. E., Hearn J., Bunnefeld L., et al. (2015). Using targeted enrichment of nuclear genes to increase phylogenetic resolution in the neotropical rain forest genus Inga (Leguminosae: mimosoideae). Front. Plant. Sci. 6:710. 10.3389/fpls.2015.00710 [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Ojeda D., I, Koenen E. J. M., Cervantes S., de la Estrella M., Banguera-Hinestroza E., Janssens S. B., et al. (2019). Phylogenomic analyses reveal an exceptionally high number of evolutionary shifts in a florally diverse clade of African legumes. Mol. Phylogenet. Evol. 137 156–167. 10.1016/j.ympev.2019.05.002 [DOI] [PubMed] [Google Scholar]
  104. Orton L. M., Barberá P., Nissenbaum M. P., Peterson P. M., Quintanar A., Soreng R. J., et al. (2021). A 313 plastome phylogenomic analysis of Pooideae: exploring relationships among the largest subfamily of grasses. Mol. Phylogenet. Evol. 159:107110. 10.1016/j.ympev.2021.107110 [DOI] [PubMed] [Google Scholar]
  105. Oyebanji O., Zhang R., Chen S.-Y., Yi T.-S. (2020). New insights into the plastome evolution of the Millettioid/Phaseoloid clade (Papilionoideae, Leguminosae). Front. Plant. Sci. 11:151. 10.3389/fpls.2020.00151 [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Palmer J. D., Osorio B., Aldrich J., Thompson W. F. (1987). Chloroplast DNA evolution among legumes: loss of a large inverted repeat occurred prior to other sequence rearrangements. Curr. Genet. 11 275–286. 10.1007/BF00355401 [DOI] [Google Scholar]
  107. Palmer J. D., Thompson W. F. (1982). Chloroplast DNA rearrangements are more frequent when a large inverted repeat sequence is lost. Cell 29 537–550. 10.1016/0092-8674(82)90170-2 [DOI] [PubMed] [Google Scholar]
  108. Park S., An B., Park S. (2020). Recurrent gene duplication in the angiosperm tribe Delphinieae (Ranunculaceae) inferred from intracellular gene transfer events and heteroplasmic mutations in the plastid matK gene. Sci. Rep. 10:2720. 10.1038/s41598-020-59547-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Pennington R. T., Klitgaard B. B., Ireland H., Lavin M. (2000). “New insights into floral evolution of basal Papilionoideae from molecular phylogenies,” in Advances in Legume Systematics, Part 9, eds Herendeen P. S., Bruneau A. (Richmond, UK: Royal Botanic Gardens, Kew; ), 233–248. [Google Scholar]
  110. Pennington R. T., Lavin M., Ireland H., Klitgaard B., Preston J., Hu J.-M. (2001). Phylogenetic relationships of basal papilionoid legumes based upon sequences of the chloroplast trnL intron. Syst. Bot. 26 537–556. 10.1043/0363-6445-26.3.537 [DOI] [Google Scholar]
  111. Pereira J. B. S., Giulietti A. M., Prado J., Vasconcelos S., Watanabe M. T. C., Pinangé D. S. B., et al. (2021). Plastome-based phylogenomics elucidate relationships in rare Isoëtes species groups from the Neotropics. Mol. Phylogenet. Evol. 161:107177. 10.1016/j.ympev.2021.107177 [DOI] [PubMed] [Google Scholar]
  112. Polhill R. M. (1981a). “Sophoreae,” in Advances in Legume Systematics, Part 1, eds Polhill R. M., Raven P. H. (Richmond, UK: Royal Botanic Gardens, Kew; ), 213–230. [Google Scholar]
  113. Polhill R. M. (1981b). “Papilionoideae,” in Advances in Legume Systematics, Part 1, eds Polhill R. M., Raven P. H. (Richmond, UK: Royal Botanic Gardens, Kew; ), 191–208. [Google Scholar]
  114. Polhill R. M. (1994). “Classification of the Leguminosae,” in Phytochemical Dictionary of the Leguminosae, eds Bisby F. A., Buckingham J., Harborne J. B. (London, UK: Chapman and Hall; ), xxxv–xlviii. [Google Scholar]
  115. Prenner G., Cardoso D., Zartman C. E., Queiroz L. P. (2015). Flowers of the early-branching papilionoid legume Petaladenium urceoliferum display unique morphological and ontogenetic features. Am. J. Bot. 102 1780–1793. 10.3732/ajb.1500348 [DOI] [PubMed] [Google Scholar]
  116. Queiroz L. P., Pastore J. F. B., Cardoso D., Snak C., Lima A. L. D. C., Gagnon E., et al. (2015). A multilocus phylogenetic analysis reveals the monophyly of a recircumscribed papilionoid legume tribe Diocleae with well-supported generic relationships. Mol. Phylogenet. Evol. 90 1–19. 10.1016/j.ympev.2015.04.016 [DOI] [PubMed] [Google Scholar]
  117. Queiroz L. P., São Mateus W., Delgado-Salinas A., Torke B. M., Lewis G. P., Dorado O., et al. (2017). A molecular phylogeny reveals the Cuban enigmatic genus Behaimia as a new piece in the Brongniartieae puzzle of papilionoid legumes. Mol. Phylogenet. Evol. 109 191–202. 10.1016/j.ympev.2017.01.001 [DOI] [PubMed] [Google Scholar]
  118. Ramos G., Lima H. C., Prenner G., Queiroz L. P., Zartman C. E., Cardoso D. (2016). Molecular systematics of the Amazonian genus Aldina, a phylogenetically enigmatic ectomycorrhizal lineage of papilionoid legumes. Mol. Phylogenet. Evol. 97 11–18. 10.1016/j.ympev.2015.12.017 [DOI] [PubMed] [Google Scholar]
  119. Regier J. C., Shultz J. W., Zwick A., Hussey A., Ball B., Wetzer R., et al. (2010). Arthropod relationships revealed by phylogenomic analysis of nuclear protein-coding sequences. Nature 463 1079–1083. 10.1038/nature08742 [DOI] [PubMed] [Google Scholar]
  120. Sabir J., Schwarz E., Ellison N., Zhang J., Baeshen N. A., Mutwakil M., et al. (2014). Evolutionary and biotechnology implications of plastid genome variation in the inverted-repeat-lacking clade of legumes. Plant Biotechnol. J. 12 743–754. 10.1111/pbi.12179 [DOI] [PubMed] [Google Scholar]
  121. Sanderson M. J., Donoghue M. J. (1994). Shifts in diversification rate with the origin of angiosperms. Science 264 1590–1593. 10.1126/science.264.5165.1590 [DOI] [PubMed] [Google Scholar]
  122. Sargent R. D. (2004). Floral symmetry affects speciation rates in angiosperms. Proc. R. Soc. Lond. B 271 603–608. 10.1098/rspb.2003.2644 [DOI] [PMC free article] [PubMed] [Google Scholar]
  123. Schaefer H., Hechenleitner P., Santos-Guerra A., Sequeira M. M., Pennington R. T., Kenicer G., et al. (2012). Systematics, biogeography, and character evolution of the legume tribe Fabeae with special focus on the middle-Atlantic island lineages. BMC Evol. Biol. 12:250. 10.1186/1471-2148-12-250 [DOI] [PMC free article] [PubMed] [Google Scholar]
  124. Schneider J. V., Paule J., Jungcurt T., Cardoso D., Amorim A. M., Berberich T., et al. (2021). Resolving recalcitrant clades in the Pantropical Ochnaceae: insights from comparative phylogenomics of plastome and nuclear genomic data derived from targeted sequencing. Front. Plant Sci. 12:638650. 10.3389/fpls.2021.638650 [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Schrire B. D., Lavin M., Barker N. P., Forest F. (2009). Phylogeny of the tribe Indigofereae (Leguminosae–Papilionoideae): geographically structured more in succulent-rich and temperate settings than in grass-rich environments. Am. J. Bot. 96 816–852. 10.3732/ajb.0800185 [DOI] [PubMed] [Google Scholar]
  126. Schrire B. D., Lavin M., Lewis G. P. (2005). Global distribution patterns of the Leguminosae: insights from recent phylogenies. Biol. Skr. 55 375–422. [Google Scholar]
  127. Schwarz E. N., Ruhlman T. A., Sabir J. S., Hajrah N. H., Alharbi N. S., Al-Malki A. L., et al. (2015). Plastid genome sequences of legumes reveal parallel inversions and multiple losses of rps16 in papilionoids. J. Syst. Evol. 53 458–468. 10.1111/jse.12179 [DOI] [Google Scholar]
  128. Serna-Sánchez M. A., Pérez-Escobar O. A., Bogarín D., Torres-Jimenez M. F., Alvarez-Yela A. C., Arcila-Galvis J. E., et al. (2021). Plastid phylogenomics resolves ambiguous relationships within the orchid family and provides a solid timeframe for biogeography and macroevolution. Sci. Rep. 11:6858. 10.1038/s41598-021-83664-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Sinjushin A. A. (2018). Floral ontogeny in Cordyla pinnata (A. Rich.) Milne-Redh. (Leguminosae, Papilionoideae): away from stability. Flora 241 8–15. 10.1016/j.flora.2018.02.005 [DOI] [Google Scholar]
  130. Specht C. D., Bartlett M. E. (2009). Flower evolution: the origin and subsequent diversification of the angiosperm flower. Annu. Rev. Ecol. Evol. Syst. 40 217–243. 10.1146/annurev.ecolsys.110308.120203 [DOI] [Google Scholar]
  131. Sprent J. I. (2000). “Nodulation as a taxonomic tool,” in Advances in Legume Systematics, Part 9, eds Herendeen P. S., Bruneau A. (Richmond, UK: Royal Botanic Gardens, Kew; ), 21–43. [Google Scholar]
  132. Sprent J. I., Ardley J., James E. K. (2017). Biogeography of nodulated legumes and their nitrogen-fixing symbionts. New Phytol. 215 40–56. 10.1111/nph.14474 [DOI] [PubMed] [Google Scholar]
  133. Steele K. P., Wojciechowski M. F. (2003). “Phylogenetic analyses of tribes Trifolieae and Vicieae, based on sequences of the plastid gene matK (Papilionoideae: leguminosae),” in Advances in Legume Systematics, Part 10, eds Klitgaard B. B., Bruneau A. (Richmond, UK: Royal Botanic Gardens, Kew; ), 355–370. [Google Scholar]
  134. Stefanović S., Pfeil B. E., Palmer J. D., Doyle J. J. (2009). Relationships among phaseoloid legumes based on sequences from eight chloroplast regions. Syst. Bot. 34 115–128. 10.1600/036364409787602221 [DOI] [Google Scholar]
  135. Sveinsson S., Cronk Q. (2014). Evolutionary origin of highly repetitive plastid genomes within the clover genus (Trifolium). BMC Evol. Biol. 14:228. 10.1186/s12862-014-0228-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  136. Swanepoel W., Le Roux M. M., Wojciechowski M. F., Van Wyk A. E. (2015). Oberholzeria (Fabaceae subfam. Faboideae), a new monotypic legume genus from Namibia. PLoS One 10:e0122080. 10.1371/journal.pone.0122080 [DOI] [PMC free article] [PubMed] [Google Scholar]
  137. Thode V. A., Lohmann L. G., Sanmartín I. (2020). Evaluating character partitioning and molecular models in plastid phylogenomics at low taxonomic levels: a case study using Amphilophium (Bignonieae, Bignoniaceae). J. Syst. Evol. 58 1071–1089. 10.1111/jse.12579 [DOI] [Google Scholar]
  138. Tucker S. C. (1993). Floral ontogeny in Sophoreae (Leguminosae: papilionoideae). I. Myroxylon (Myroxylon group) and Castanospermum (Angylocalyx group). Am. J. Bot. 80 65–75. 10.1002/j.1537-2197.1993.tb13768.x [DOI] [Google Scholar]
  139. Tucker S. C. (2003a). Floral development in legumes. Plant Physiol. 131 911–926. 10.1104/pp.102.017459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  140. Tucker S. C. (2003b). Floral ontogeny in Swartzia (Leguminosae: papilionoideae: swartzieae): distribution and role of the ring meristem. Am. J. Bot. 90 1271–1292. 10.3732/ajb.90.9.1271 [DOI] [PubMed] [Google Scholar]
  141. Tucker S. C., Douglas A. W. (1994). “Ontogenetic evidence and phylogenetic relationships among basal taxa of legumes,” in Advances in Legume Systematics, Part 6, eds Ferguson I. K., Tucker S. C. (Richmond, UK: Royal Botanic Gardens, Kew; ), 11–32. [Google Scholar]
  142. Vatanparast M., Powell A., Doyle J. J., Egan A. N. (2018). Targeting legume loci: a comparison of three methods for target enrichment bait design in Leguminosae phylogenomics. Appl. Plant Sci. 6:e1036. 10.1002/aps3.1036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  143. Villaverde T., Pokorny L., Olsson S., Rincón-Barrado M., Johnson M. G., Gardner E. M., et al. (2018). Bridging the micro- and macroevolutionary levels in phylogenomics: hyb-seq solves relationships from populations to species and above. New Phytol. 220 636–650. 10.1111/nph.15312 [DOI] [PubMed] [Google Scholar]
  144. Walker J. F., Walker-Hale N., Vargas O. M., Larson D. A., Stull G. W. (2019). Characterizing gene tree conflict in plastome-inferred phylogenies. PeerJ 7:e7747. 10.7717/peerj.7747 [DOI] [PMC free article] [PubMed] [Google Scholar]
  145. Welch A. J., Collins K., Ratan A., Drautz-Moses D. I., Schuster S. C., Lindqvist C. (2016). The quest to resolve recent radiations: plastid phylogenomics of extinct and endangered Hawaiian endemic mints (Lamiaceae). Mol. Phylogenet. Evol. 99 16–33. 10.1016/j.ympev.2016.02.024 [DOI] [PubMed] [Google Scholar]
  146. Wink M. (2013). Evolution of secondary metabolites in legumes (Fabaceae). S. Afr. J. Bot. 89 164–175. 10.1016/j.sajb.2013.06.006 [DOI] [Google Scholar]
  147. Wink M., Botschen F., Gosmann C., Schäfer H., Waterman P. G. (2010). “Chemotaxonomy seen from a phylogenetic perspective and evolution of secondary metabolism,” in Annual Plant Reviews Volume 40: Biochemistry of Plant Secondary Metabolism, ed. Wink M. (Oxford, UK: Wiley-Blackwell; ), 364–433. [Google Scholar]
  148. Wojciechowski M. F. (2013a). The origin and phylogenetic relationships of the Californian chaparral ‘paleoendemic’ Pickeringia (Leguminosae). Syst. Bot. 38 132–142. 10.1600/036364413X662024 [DOI] [Google Scholar]
  149. Wojciechowski M. F. (2013b). Towards a new classification of Leguminosae: naming clades using non-Linnaean phylogenetic nomenclature. S. Afr. J. Bot. 89 85–93. 10.1016/j.sajb.2013.06.017 [DOI] [Google Scholar]
  150. Wojciechowski M. F., Lavin M., Sanderson M. J. (2004). A phylogeny of legumes (Leguminosae) based on analysis of the plastid matK gene resolves many well-supported subclades within the family. Am. J. Bot. 91 1846–1862. 10.3732/ajb.91.11.1846 [DOI] [PubMed] [Google Scholar]
  151. Wojciechowski M. F., Sanderson M. J., Steele K. P., Liston A. (2000). “Molecular phylogeny of the “temperate herbaceous tribes” of papilionoid legumes: a supertree approach,” in Advances in Legume Systematics, Part 9, eds Herendeen P. S., Bruneau A. (Richmond, UK: Royal Botanic Gardens, Kew; ), 277–298. [Google Scholar]
  152. Xi Z., Ruhfel B. R., Schaefer H., Amorim A. M., Sugumaran M., Wurdack K. J., et al. (2012). Phylogenomics and a posteriori data partitioning resolve the Cretaceous angiosperm radiation Malpighiales. Proc. Natl. Acad. Sci. U.S.A. 109 17519–17524. 10.1073/pnas.1205818109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  153. Yahara T., Javadi F., Onoda Y., Queiroz L. P., Faith D. P., Prado D. E., et al. (2013). Global legume diversity assessment: concepts, key indicators, and strategies. Taxon 62 249–266. 10.12705/622.12 [DOI] [Google Scholar]
  154. Yang L., Su D., Chang X., Foster C. S. P., Sun L., Huang C.-H., et al. (2020). Phylogenomic insights into deep phylogeny of angiosperms based on broad nuclear gene sampling. Plant Commun. 1:100027. 10.1016/j.xplc.2020.100027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  155. Yu G., Smith D., Zhu H., Guan Y., Lam T. T.-Y. (2017). ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Meth. Ecol. Evol. 8 28–36. 10.1111/2041-210X.12628 [DOI] [Google Scholar]
  156. Zhang C., Rabiee M., Sayyari E., Mirarab S. (2018). ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinform. 19:153. 10.1186/s12859-018-2129-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  157. Zhang Q., Liu Y. Sodmergen (2003). Examination of the cytoplasmic DNA in male reproductive cells to determine the potential for cytoplasmic inheritance in 295 angiosperm species. Plant Cell Physiol. 44 941–951. 10.1093/pcp/pcg121 [DOI] [PubMed] [Google Scholar]
  158. Zhang R., Wang Y.-H., Jin J.-J., Stull G. W., Bruneau A., Cardoso D., et al. (2020). Exploration of plastid phylogenomic conflict yields new insights into the deep relationships of Leguminosae. Syst. Biol. 69 613–622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  159. Zhao Y., Zhang R., Jiang K.-W., Qi J., Hu Y., Guo J., et al. (2021). Nuclear phylotranscriptomics and phylogenomics support numerous polyploidization events and hypotheses for the evolution of rhizobial nitrogen-fixing symbiosis in Fabaceae. Mol. Plant. 14 748–773. 10.1016/j.molp.2021.02.006 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: GenBank with accessions MZ725323, OL672849-OL672886. All sequence alignments and trees that were generated from this study are submitted to Dryad (https://doi.org/10.5061/dryad.sf7m0cg7m).


Articles from Frontiers in Plant Science are provided here courtesy of Frontiers Media SA

RESOURCES