Significance
We demonstrate that instances of high gene-tree conflict (discordance in phylogenetic signal across genes) in mammals, birds, and several major plant clades correspond to rate increases in morphological innovation. We suggest that gene-tree conflict and rapid phenotypic change across major clades of vascular plants, mammals, and birds are often the result of coincident population processes (e.g., changes in population size, rapid speciation, and incomplete lineage sorting). Closer examination of these coincident patterns will contribute to a stronger understanding of the microevolutionary dynamics underlying the emergence of phenotypic patterns typically only observable over deep time. Our findings indicate that patterns of gene-tree conflict, occurring episodically across the Tree of Life, offer clues toward understanding the population-level processes driving major evolutionary transitions.
Keywords: phylogenomics, morphological evolution, gene-tree discordance, tempo and mode, molecular evolution
Abstract
Evolutionary biologists have long been fascinated with the episodes of rapid phenotypic innovation that underlie the emergence of major lineages. Although our understanding of the environmental and ecological contexts of such episodes has steadily increased, it has remained unclear how population processes contribute to emergent macroevolutionary patterns. One insight gleaned from phylogenomics is that gene-tree conflict, frequently caused by population-level processes, is often rampant during the origin of major lineages. With the understanding that phylogenomic conflict is often driven by complex population processes, we hypothesized that there may be a direct correspondence between instances of high conflict and elevated rates of phenotypic innovation if both patterns result from the same processes. We evaluated this hypothesis in six clades spanning vertebrates and plants. We found that the most conflict-rich regions of these six clades also tended to experience the highest rates of phenotypic innovation, suggesting that population processes shaping both phenotypic and genomic evolution may leave signatures at deep timescales. Closer examination of the biological significance of phylogenomic conflict may yield improved connections between micro- and macroevolution and increase our understanding of the processes that shape the origin of major lineages across the Tree of Life.
Phylogenomic conflict, where gene trees disagree about species tree resolution, is common across genomes and throughout the Tree of Life (1–3). Conflict is often driven by population processes such as incomplete lineage sorting (ILS), hybridization, and rapid evolution. Regions of high conflict often coincide with the emergence of major clades, such as mammals (e.g., ref. 4), angiosperms (e.g., ref. 5), and metazoa (e.g., ref. 6). This suggests that heterogeneity in the population processes underlying the origins of biodiversity has been recorded in particular nodes across the Tree of Life (e.g., refs. 3, 7). Most large-scale phylogenetic and phylogenomic studies meant to resolve species relationships have treated gene-tree discordance as an analytical nuisance to be filtered or accommodated (e.g., refs. 4, 5, 7). However, since phylogenomic conflict often represents the imprint of past population genetic processes on the genome, studying its correlation with other macroevolutionary patterns may shed light on the microevolutionary processes underlying major transitions across the Tree of Life.
Researchers have long recognized that rates of morphological evolution vary across the Tree of Life, with pronounced bursts in morphological change interspersed with periods of relative stasis (8, 9). However, the strength of the relationship between rates of morphological change and genomic evolution is not well understood. In particular, it may be worthwhile to examine whether phylogenomic conflict tends to coincide with periods of rapid morphological change. This may help to identify whether genomic conflict and morphological innovation detectable over macroevolutionary timescales stem from the same population processes.
Phylogenomic conflict often appears to coincide with important episodes of morphological differentiation among major lineages. For example, the major differences in life history and body plan that distinguish mammalian orders emerged rapidly among ancestral taxa following the Cretaceous-Paleogene (K-Pg) mass extinction (10). Phylogenetic resolution of many of the deepest mammal nodes has proved difficult, and attempts at resolution have included the use of both sophisticated methodological approaches and the generation of large-scale genomic datasets (11–13). The early avian radiation has proved similarly challenging and is notable for the rapid establishment of phenotypically and ecologically disparate lineages. Several large-scale studies using massive genomic datasets have revealed extensive conflict among phylogenetic branches coinciding with the early radiation of crown Aves (14, 15). Since the origin of land plants, there have been numerous major phases of morphological and ecological innovation, ranging from the initial appearance of vascular plant body plans in the late Silurian (16) to distinct phases of angiosperm radiation from the Cretaceous to the present (17). As with the vertebrate lineages, the origins of many major plant clades show elevated levels of phylogenomic conflict, some of which may stem from biased representations of gene categories with functional significance (18). Many of these high-conflict branches have been the subject of intense analysis and debate (e.g., refs. 3, 7, 19), and recent sequencing efforts have failed to confidently resolve most of these challenges.
Here, we examine whether periods of high genomic conflict coincide with episodes of rapid morphological evolution across several phylogenetically disparate clades. Many clades show qualitative evidence that time periods with particularly rampant conflict are associated with important phenotypic changes. However, if phylogenomic conflict indeed represents the remnants of complex population processes that are adaptively important, we reasoned that there should be a direct correspondence between areas of high conflict and elevated rates of phenotypic evolution. To examine whether this is the case, we reconstructed concurrent patterns in phylogenomic conflict and phenotypic evolution across six major clades of the eukaryotic Tree of Life: crown Aves, Mammalia, and the vascular plant lineages Ericales, Caryophyllales, Polypodiopsida, and Acrogymnospermae.
Results
We assembled and analyzed six genomic datasets for plants, mammals, and birds. Specifically, we examined published phylogenomic datasets including 624 genes of Caryophyllales from Yang et al. (20), 387 genes of Ericales from Larson et al. (21), 1,332 gene regions of Polypodiopsida from Shen et al. (22), 1,308 gene regions representing Acrogymnospermae from Ran et al. (23), 424 genes of Mammalia from Song et al. (4), and 259 gene regions of Aves from Prum et al. (15). We gathered publicly available phenotypic matrices corresponding to each of these datasets (see Materials and Methods). For the Caryophyllales, a suitable matrix was not available, so we generated a matrix comprising 23 phenotypic traits scored across the species included in the phylogenomic sampling. Generally, broad phenotypic matrices that cover the taxa in each of these datasets are rare or nonexistent. Furthermore, overlap between the genomic and phenotypic matrices was such that, other than for Caryophyllales, each needed to be reduced to ensure that identical edges were being compared. Divergence times continue to be controversial across the Tree of Life. To overcome these difficulties, we used one set of divergence times and one topology for both the morphological and molecular analyses (SI Appendix).
We found that mammals experienced elevated levels of both phylogenomic discordance and phenotypic innovation near the K-Pg boundary (Fig. 1 A and B). These episodes of conflict coincided with high phenotypic rates calculated from two datasets: a massive “phenomic” matrix of >4,000 qualitative characters (ref. 10, median P = 0.001; range = 0 to 0.006) and a dataset of mammalian body masses (ref. 24, median P = 0.144; range = 0.117 to 0.175), which we interpreted as a proxy for many strongly correlated aspects of life history and ecology (25). The two phenotypic datasets displayed remarkably similar patterns through time, showing that both life history and anatomical characteristics diversified rapidly during the early divergences of placental lineages. Although skeletal and dental characters comprised the majority of the qualitative dataset, characters representing internal and external organs, musculature, and behavior were also included. Our analysis of body masses is consistent with earlier studies in showing that higher-level variation in mammalian life history was established during their early radiation (26), coincident with rapid rates of anatomical change and high phylogenomic discordance. The association of body mass with conflict was statistically nonsignificant due to a continued elevated pace of body-mass evolution after the K-Pg (Fig. 1B). While many of the phenotypic and ecological innovations of mammals likely originated during the evolutionary burst near the K-Pg boundary, it is possible that adjustments to body mass represent an additional source of more labile adaptive variation that has facilitated the continued expansion of niche space occupied by mammalian lineages throughout the Cenozoic.
Fig. 1.
Co-occurring patterns in gene-tree conflict and morphological rates of evolution across the mammal (A and B) and bird (C and D) datasets. Conflict is measured as the proportion of gene trees that disagree with the species tree. Morphological rate is measured as the number of changes per million years for qualitative datasets and Brownian variance per million years for quantitative datasets. Vertical lines correspond to the onset of the Paleogene (66 Ma) and the Miocene (23.03 Ma).
For birds, our analyses of the genomic (15), qualitative (ref. 27, median P = 0; range = 0 to 0.001), and quantitative (ref. 28, median P = 0; range = 0 to 0.001) data revealed rampant conflict accompanied by elevated phenotypic rates among early-diverging avian lineages immediately following the K-Pg boundary (Fig. 1 C and D). We also observed an additional episode of increased conflict that occurred at the onset of the Miocene. The results of the two phenotypic datasets were similar, with the main difference being that the quantitative dataset composed of principal component axes corresponding to body size, relative beak size, and relative wing size showed slightly more elevated rates throughout the Cenozoic compared to the qualitative dataset following the initial peak present in both datasets at the K-Pg. This difference likely reflects the increased lability of these three axes of quantitative variation versus the discrete transformations captured by the qualitative traits.
The Caryophyllales showed substantial conflict during epochs that also witnessed fast rates of phenotypic change (Fig. 2A; median P = 0, range = 0 to 0.004). Based on available divergence times, these conflicts occurred between ∼59 and 101 Mya with the most extensive conflict at ∼77 Mya. Caryophyllales clearly stand out among major flowering plant clades in terms of their phenotypic and ecological diversity (e.g., refs. 29–35), and we find that many characteristic Caryophyllales features (e.g., betalain pigments, successive cambia, variations in perianth differentiation and merosity, campylotropous ovules, curved embryos, and carnivory) arose during this temporal window of high conflict, with some traits showing complicated yet intriguing patterns of presence/absence across the phylogeny. Overall, the link between conflict and rate of innovation was very strong in the Caryophyllales, showing a close relationship both visually (Fig. 2A) and statistically (SI Appendix, Fig. S2A).
Fig. 2.
Co-occurring patterns in gene-tree conflict and morphological rates of evolution across (A) Caryophyllales, (B) Ericales, (C) Gymnosperms, and (D) Ferns. Conflict is measured as the proportion of gene trees that disagree with the species tree. Morphological rate is measured as the number of changes per million years. Vertical lines correspond to the onset of the Paleogene (66 Ma) and the Miocene (23.03 Ma).
The Ericales exhibited both elevated conflict and rapid phenotypic change during their early radiation (Fig. 2B; median P = 0.006, range = 0.001 to 0.013). This episode witnessed the rapid evolution of reproductive features such as perianth merism, androecial structure (e.g., stamen number and anther orientation and attachment), and carpel number. Similar variation in floral morphology is evident in some of the earliest fossil representatives of this order (e.g., refs. 36–38) and likely relates to early coevolution with insect pollinators, particularly Hymenoptera (39, 40). This observed phylogenomic conflict and innovation in floral morphology precedes the evolution of the major families and also corresponds to several life history shifts and movements out of the tropics (41) occurring by ∼79 to 90 Mya based on available fossil evidence (e.g., ref. 37) and between ∼100 and 110 Mya based on molecular divergence-time estimates (41, 42).
The gymnosperms (Acrogymnospermae) show several spikes in both conflict and rates of morphological evolution loosely corresponding to major stratigraphic boundaries marked by episodes of major ecological upheaval (Fig. 2C; median P = 0.8945, range = 0.873 to 0.92). We found the visible similarity of the patterns in morphological rate and phylogenomic conflict compelling despite the lack of statistical significance and believe that the lack of significance likely stems from limited taxonomic sampling in both the morphological and molecular datasets. The largest spike for both conflict and innovation occurs before the Permian-Triassic (P-T) boundary and corresponds to the divergence of Gnetales. The phylogenetic placement of this lineage has long been contentious, with different molecular datasets (or analytical methods) resulting in different topologies (e.g., ref. 43). Morphologically, Gnetales is also highly modified compared to other gymnosperms, which has led to various hypotheses concerning its placement (44). Our results, which follow recent studies in placing Gnetales sister to Pinaceae (e.g., refs. 3, 23), suggest that the divergence of the order involved a coordinated burst in both genomic and morphological evolution; the possible causes are unclear but deserve further study.
The analysis of ferns (=Polypodiopsida or “monilophytes,” ref. 45) suggests a possible link between morphological rate and phylogenomic conflict that is qualitatively apparent but not statistically detectable (Fig. 2D; median P = 0.534, range = 0.504 to 0.573), perhaps due to the small sample size. This is shown in the raw data, which shows a positive trend between discordance and morphological rate, albeit between relatively few data points (SI Appendix, Fig. S7D). Ferns (in the broad sense) are a heterogeneous assemblage of ancient lineages—including horsetails (Equisetidae), marratioid ferns (Marattiidae), whisk and ophioglossid ferns (Ophioglossidae), and leptosporangiate ferns (Polypodiidae)—that trace back to the Devonian radiation of vascular plants circa 393 to 359 Ma (e.g., refs. 16, 46). Prior to phylogenetic analyses including molecular data (e.g., ref. 45), the lineages comprising ferns sensu lato were not thought to form a clade, but analyses have continued to recover this relationship with strong support (5, 7, 45). However, relationships among the four major constituent lineages remain contentious (e.g., refs. 7, 22). Our results suggest that the conflict underlying earliest divergences in fern phylogeny may also coincide with pronounced phases of rapid morphological innovation, a possibility that deserves further study given that many aspects of phenotypic evolution in this clade (e.g., the origins of megaphyll leaves) remain poorly known (47).
Discussion
We demonstrate that several instances of high phylogenomic conflict correspond to instances of elevated rates in morphological innovation. This has implications for understanding both phenomena. First, biological processes, perhaps in addition to error, have a large impact on the patterns of conflict even at deep time scales and when filtering for strongly supported signals. That biological processes can cause conflicting signals in gene trees is well known. However, typically, researchers attempt to overcome this conflict through modeling or methodological developments intended to construct a species tree. We suggest that population-level processes such as fluctuations in population size, rapid speciation, ILS, and introgression can leave strong macroevolutionary signatures observable in patterns of both gene-tree discordance and phenotypic evolution. The broad view engendered by our examination may be enhanced by targeted studies of the adaptive effects of ILS and other population-level processes during more recent radiations (e.g., ref. 48).
Disentangling the sources of conflict within phylogenomic datasets is fraught with difficulties. In the datasets examined here, while there may be instances of hybridization and, to a lesser extent, horizontal gene transfer that are contributing to the patterns of conflict, there is no published evidence that these phenomena are the source of biological conflict at the nodes that correspond to bursts in morphological innovation. There has been some suggestion of ancient hybridization early in the Caryophyllales (e.g., ref. 49); however, there are challenges of identifiability that have yet to be overcome. Whole genome duplications (perhaps in many cases stemming from hybridization, i.e., allopolyploidy events) may also result in conflicts across the Tree of Life, especially as they may create errors as a result of misidentified paralogy. Other systematic errors and background noise may be sources of conflict, especially when exacerbated by rapid evolution (3, 21). Finally, ILS has been recognized as a major source of gene-tree disagreement at all depths of the Tree of Life (50). Conflicts due to ILS will be amplified by particular population processes such as changes in effective population size from selection, bottlenecks, or other phenomena and/or rapid speciation (51). The vast majority of the nodes we highlight have been difficult to resolve using multiple data types and analytical techniques (5, 7, 14, 15, 21, 52), suggesting that the conflicting signals we examined stem predominantly from evolutionary processes rather than error.
One notable attribute of several of the clades examined (e.g., Ericales and Caryophyllales) is that many phenotypic traits show complex patterns of presence/absence across lineages stemming from regions of high conflict. Such character-state distributions are typically interpreted as independent gains and losses (i.e., homoplasy). However, it is possible that the inheritance of phenotypic traits from a polymorphic ancestral population might in some cases follow an ILS-like process (i.e., hemiplasy; refs. 53, 54). If the genes underlying phenotypic traits are subject to ILS, it follows that phenotypic traits themselves are also subject to this process. This might serve as a better explanation than independent gains/losses for complex distributions of phenotypic states in some clades. Hypothetically, this would imply that patterns of trait evolution during rapid “adaptive” radiations in some cases might be partly or largely stochastic—a byproduct of the sorting of underlying genetic variation. However, combinations of traits inherited through such a process could subsequently prove adaptive. This possibility deserves future attention and, importantly, would complement our results here. Elevated macroevolutionary rates of phenotypic evolution represent the rapid fixation of numerous traits within a short temporal window. This remains the case even if the substitution rate at the population level is artificially inflated due to unaccounted discordance (55, 56). In cases of rapid speciation events from a polymorphic ancestor, hemiplasy might serve as one mechanistic link between genomic conflict and elevated rates of phenotypic evolution in the emergent lineages. Nevertheless, examining this possibility will be challenging given the complexity of the genotypic-to-phenotypic map and the noisiness of discrete morphological datasets.
A link between population processes and phenotypic innovation has long been theorized (57). Although the specific relationship between microevolutionary processes and phenotypic changes observable at a macroevolutionary scale is not well understood, previous work has considered how demographic events such as rapid fluctuations in population size and genic exchange between diverging populations might work synergistically with subsequent environmental adaptation to drive the exploration of diverse areas of morphospace by facilitating the emergence of both new phenotypic features and novel combinations of character states (8, 58–60). Importantly, these processes are also likely to cause phylogenetic conflict if they persist over many generations and coincide with lineage divergences. We therefore suggest that phylogenomic conflict observable at deep timescales may often stem from the same population processes that facilitated the rapid and dramatic episodes of phenotypic change underpinning the origins of major clades. For example, the rapid speciation events that occur during evolutionary radiations may both generate conflicting phylogenomic signals due to population-level processes (e.g., rapid fluctuations in population size and selection) and drive the emergence of novel morphologies as diverging lineages carve out new ecological niches. In particular, a large population size will result in a smaller probability that two lineages will coalesce within a particular time. This should result in high conflict between gene trees. Several episodes of high conflict observed here (for example, in Ericales) are succeeded by periods of low conflict. This would suggest population size reduction if ILS can safely be assumed to be the source of conflict.
In addition to the notable coincidence of high conflict and rapid morphological innovation at particular nodes, many of these nodes also appear to correspond with major geologic events. As has long been observed, major radiations in mammals and crown birds seem closely tied to the K-Pg mass extinction; our results here linking conflict to morphological innovation hint at underlying processes that may have driven these radiations in the wake of K-Pg ecological collapse. Gymnosperms and ferns show coincidence spikes near the P-T boundary, another widely studied mass extinction event (61). But given our limited sampling of these particular clades, this association is much more tenuous. Unfortunately, examining these patterns in greater detail requires a reliance on accurate divergence times, which are challenging to estimate from molecular phylogenies. A better integration of fossil and phylogenomic data may offer a clearer understanding of how major geologic events have influenced coordinated bursts of conflict and phenotypic innovation.
Conclusion
This study provides an important link between genomic and morphological evolution at deep timescales. We suggest that episodic evolutionary and population events leave signatures of conflict within genomes that may offer important insight on the processes responsible not only for conflict but also for massive changes in phenotype across disparate lineages. Moving forward, we suggest that closer examinations using more densely sampled molecular and morphological datasets representing taxa across the Tree of Life will help us to generate a deeper understanding of the micro- and macroevolutionary processes that drive major organismal changes from the genomic to the phenotypic level.
Materials and Methods
Genomic Datasets.
We examined several published phylogenomic datasets including 624 loci of Caryophyllales from Yang et al. (20), 387 loci of Ericales from Larson et al. (21), 1,332 loci of ferns from Shen et al. (22), 1,308 loci of gymnosperms from Ran et al. (23), 424 loci of mammals from Song et al. (4), and 259 loci of Aves from Prum et al. (15). Species trees were all based on those presented in the original publications. Gene trees were reestimated using IQ-TREE (version 1.6.11) with support calculated using the approximate likelihood ratio test (-alrt 1000) and with the GTR and Γ model for nucleotides (-m GTR+G) and the LG and Γ model for amino acids (-m LG+G).
Morphological Datasets.
Large morphological datasets for major clades of flowering plants are surprisingly few, and, additionally, most available datasets are relatively limited in their sampling of either taxa and/or characters. For the Carophyllales, we generated a phenotypic dataset corresponding to the phylogenomic sampling, including 23 traits representing various vegetative and reproductive morphological features as well as several biochemical and physiological traits (e.g., betalain presence and photosynthetic pathway type). This matrix is available in the data supplement. For the gymnosperms, we used the published data from Nixon et al. (62), in which the data are primarily scored at family level. The taxon circumscriptions applied in Nixon et al. (62) for their operational taxonomic units (OTUs) are monophyletic and consistent with our current understanding of gymnosperm phylogeny and thus were applicable to the clades examined here. Thus, multiple exemplars within a clade corresponding to one of their OTUs will be identical in their states. For the Ericales, we used the published matrix developed by Chartier et al. (63) focused on floral characters. For mammals, we used the published dataset of O’Leary et al. (10), which covers mammal morphology broadly, as well as body-size data gathered from the PanTHERIA database (24). For the ferns, we used the morphological dataset from Pryer et al. (44), which was developed for analysis of fern relationships within the context of vascular plant phylogeny. Finally, for the birds, we used the morphological matrix compiled by Livezay and Zusi (27) as well as the principal component scores calculated from the large dataset of linear measurements presented by Pigot et al. (28). Many of the morphological datasets were sampled across more (or different) taxa than are present in the genomic datasets—in these cases, we pruned the morphological datasets to include only the taxa present in the genomic data (and vice versa).
Phylogenomic Analyses.
Conflict analyses were conducted with the program bp (available from https://github.com/FePhyFoFum/gophy) using species and gene trees. We used the options -tv to print the concordance, conflict, and noninformative values back to the species tree and -scut 80 to ignore edges that had less than 80% for the approximate likelihood ratio test (aLRT) statistic as calculated from the IQ-TREE analyses.
Morphological Analyses.
We calculated rates of discrete character evolution by estimating the number of character transitions along each branch using parsimony and dividing this estimate by the amount of time represented by the corresponding branch in the time-scaled tree. Rates calculated from discrete-character matrices were therefore presented in units of inferred character transitions per unit of time. For quantitative traits, we calculated the average character divergences along each branch by estimating ancestral states for each character under a Brownian motion model of evolution and then representing the amount of change along each edge as the absolute value of the difference between the trait value inferred for the ancestor and that inferred for the descendant. Like with the qualitative analyses, we then transformed these raw divergences to rates by calculating their ratio to the corresponding amount of elapsed time for the corresponding edge in the time-scaled tree. We also performed analyses on the discrete datasets using a model-based approach. We are concerned about the adequacy of existing Markov models for discrete phenotypic characters, but we provide these results in the supplement to provide further context for interested readers. Morphological branch lengths were calculated on the species tree topology using RAxML v8.2.12 (64) under the Mk model (-K MK) with ascertainment bias (-m ASC_MULTIGAMMA).
Assessing Statistical Significance.
We designed a permutation test to evaluate the statistical significance of the associations between phylogenomic discordance and morphological rate. We removed all edges with either no conflict or no phenotypic change because they can skew the distribution heavily, causing nodes with only low or moderate conflict/rate to be erroneously identified as significantly high. This step did not substantially modify the vertebrate datasets, which had few zero values, but did induce changes to the Caryophyllales dataset (SI Appendix, Figs. S6 and S7). The zero values stem from the low sampling of the trait dataset. This reduces power but is indicative of the general trend, which appears both through our permutation approach and a scatterplot of the raw data (SI Appendix, Fig. S7). We identified nodes with “significantly” elevated discordance and rate by resampling observed values 1,000 times (with replacement) and generating an empirical null distribution using the means calculated for each replicate. Significant nodes were identified at a one-tailed 0.05 threshold. We then checked whether the number of nodes with both significantly high discordance and rate departed from the expectation if the same number of nodes was sampled randomly. This test examines the strength of the association between discordance and morphological rate but was conservative for datasets with only a small number of nodes from which to sample due to low power. Since permutation tests depend on resampling, they can yield a range of significance values that result from random variations between resampled replicates. We therefore performed each test 1,000 times and present the range of P values observed across all replicates. Some degree of qualitative assessment using the discordance- and rate-through-time plots was therefore still needed.
Selection of Species-Tree Topology.
We used the same topology for both the morphological and molecular analyses. Conflict and morphological rates were mapped onto the species tree resolved in the study from which the genomic data were sourced.
Supplementary Material
Acknowledgments
We thank J. Pease for comments that improved the manuscript. C.P.-F. thanks G. Auteri and M. Foote for helpful conversations. S.A.S. thanks his laboratory group for comments on early results. G.W.S. thanks W. DiMichele for comments on early results. C.P.-F. was supported as T.C. Chamberlin Postdoctoral Fellow in the Department of Geophysical Sciences at the University of Chicago during the conception and execution of this project. G.W.S. acknowledges support from the Chinese Academy of Sciences, the President’s International Fellowship Initiative (No. 2020PB0009), and the China Postdoctoral Science Foundation International Postdoctoral Exchange Program. S.A.S. was supported by University of Michigan MICDE Catalyst Award and NSF DEB-1917146.
Footnotes
The authors declare no competing interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2023058118/-/DCSupplemental.
Data Availability
Gene trees, dated species trees, phenotypic matrices, and code used to perform the analyses are publicly available from Figshare: https://figshare.com/articles/dataset/pf_stull_smith_tgz/13190816/2 (65). All other study data are included in the article and/or SI Appendix.
References
- 1.Smith S. A., Moore M. J., Brown J. W., Yang Y., Analysis of phylogenomic datasets reveals conflict, concordance, and gene duplications with examples from animals and plants. BMC Evol. Biol. 15, 150 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chen M. Y., Liang D., Zhang P., Phylogenomic resolution of the phylogeny of laurasiatherian mammals: Exploring phylogenetic signals within coding and noncoding sequences. Genome Biol. Evol. 9, 1998–2012 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Smith S. A., Walker-Hale N., Walker J. F., Brown J. W., Phylogenetic conflicts, combinability, and deep phylogenomics in plants. Syst. Biol. 69, 579–592 (2020). [DOI] [PubMed] [Google Scholar]
- 4.Song S., Liu L., Edwards S. V., Wu S., Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model. Proc. Natl. Acad. Sci. U.S.A. 109, 14942–14947 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wickett N. J., et al., Phylotranscriptomic analysis of the origin and early diversification of land plants. Proc. Natl. Acad. Sci. U.S.A. 111, E4859–E4868 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Nosenko T., et al., Deep metazoan phylogeny: When different genes tell different stories. Mol. Phylogenet. Evol. 67, 223–233 (2013). [DOI] [PubMed] [Google Scholar]
- 7.One Thousand Plant Transcriptomes Initiative , One thousand plant transcriptomes and the phylogenomics of green plants. Nature 574, 679–685 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Simpson G. G., Tempo and Mode in Evolution (Columbia University Press, New York, NY, 1944). [Google Scholar]
- 9.Eldredge N., Gould S. J., “Punctuated equilibria: An alternative to phyletic gradualism” in Models in Paleobiology, Schopf T. J. M., Ed. (Freeman, San Francisco, 1972), pp. 82–115. [Google Scholar]
- 10.O’Leary M. A., et al., The placental mammal ancestor and the post-K-Pg radiation of placentals. Science 339, 662–667 (2013). [DOI] [PubMed] [Google Scholar]
- 11.Morgan C. C., et al., Heterogeneous models place the root of the placental mammal phylogeny. Mol. Biol. Evol. 30, 2145–2156 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Romiguier J., Ranwez V., Delsuc F., Galtier N., Douzery E. J., Less is more in mammalian phylogenomics: AT-rich genes minimize tree conflicts and unravel the root of placental mammals. Mol. Biol. Evol. 30, 2134–2144 (2013). [DOI] [PubMed] [Google Scholar]
- 13.Teeling E. C., Hedges S. B., Making the impossible possible: Rooting the tree of placental mammals. Mol. Biol. Evol. 30, 1999–2000 (2013). [DOI] [PubMed] [Google Scholar]
- 14.Jarvis E. D., et al., Whole-genome analyses resolve early branches in the tree of life of modern birds. Science 346, 1320–1331 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Prum R. O., et al., A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing. Nature 526, 569–573 (2015). [DOI] [PubMed] [Google Scholar]
- 16.Kenrick P., Crane P. R., The Origin and Early Diversification of Land Plants: A Cladistic Study (Smithsonian Institution Press, Washington, DC, 1997). [Google Scholar]
- 17.Friis E. M., Crane P. R., Pedersen K. R., Early Flowers and Angiosperm Evolution (Cambridge University Press, NY, 2011). [Google Scholar]
- 18.Lee E. K., et al., A functional phylogenomic view of the seed plants. PLoS Genet. 7, e1002411 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Drew B. T., et al., Another look at the root of the angiosperms reveals a familiar tale. Syst. Biol. 63, 368–382 (2014). [DOI] [PubMed] [Google Scholar]
- 20.Yang Y., et al., Improved transcriptome sampling pinpoints 26 ancient and more recent polyploidy events in Caryophyllales, including two allopolyploidy events. New Phytol. 217, 855–870 (2018). [DOI] [PubMed] [Google Scholar]
- 21.Larson D. A., Walker J. F., Vargas O. M., Smith S. A., A consensus phylogenomic approach highlights paleopolyploid and rapid radiation in the history of Ericales. Am. J. Bot. 107, 773–789 (2020). [DOI] [PubMed] [Google Scholar]
- 22.Shen H., et al., Large-scale phylogenomic analysis resolves a backbone phylogeny in ferns. Gigascience 7, 1–11 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ran J.-H., Shen T.-T., Wang M.-M., Wang X.-Q., Phylogenomics resolves the deep phylogeny of seed plants and indicates partial convergent or homoplastic evolution between Gnetales and angiosperms. Proc. Biol. Sci. 285, 20181012 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Jones K. E., et al., PanTHERIA: A species‐level database of life history, ecology, and geography of extant and recently extinct mammals: Ecological archives E090‐184. Ecology 90, 2648 (2009). [Google Scholar]
- 25.Calder W. A., Ecological scaling: Mammals and birds. Annu. Rev. Ecol. Syst. 14, 213–230 (1983). [Google Scholar]
- 26.Stearns S. C., The influence of size and phylogeny on patterns of covariation among life-history traits in the mammals. Oikos 41, 173–187 (1983). [Google Scholar]
- 27.Livezey B. C., Zusi R. L., Higher-order phylogeny of modern birds (Theropoda, Aves: Neornithes) based on comparative anatomy. II. Analysis and discussion. Zool. J. Linn. Soc. 149, 1–95 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Pigot A. L., et al., Macroevolutionary convergence connects morphological form to ecological function in birds. Nat. Ecol. Evol. 4, 230–239 (2020). [DOI] [PubMed] [Google Scholar]
- 29.Brockington S. F., et al., Phylogeny of Caryophyllales sensu lato: Revisiting hypotheses on pollination biology and perianth differentiation in the core Caryophyllales. Int. J. Plant Sci. 170, 627–643 (2009). [Google Scholar]
- 30.Brockington S. F., Walker R. H., Glover B. J., Soltis P. S., Soltis D. E., Complex pigment evolution in the Caryophyllales. New Phytol. 190, 854–864 (2011). [DOI] [PubMed] [Google Scholar]
- 31.Brockington S., Dos Santos P., Glover B., De Craene L. R., Androecial evolution in Caryophyllales in light of a paraphyletic Molluginaceae. Am. J. Bot. 100, 1757–1778 (2013). [DOI] [PubMed] [Google Scholar]
- 32.Carlquist S., Caryophyllales: A key group for understanding wood anatomy character states and their evolution. Bot. J. Linn. Soc. 164, 342–393 (2010). [Google Scholar]
- 33.Ronse De Craene L. P., Reevaluation of the perianth and androecium in Caryophyllales: Implications for flower evolution. Plant Syst. Evol. 299, 1599–1636 (2013). [Google Scholar]
- 34.Ronse De Craene L. P., Brockington S. F., Origin and evolution of petals in angiosperms. Plant Ecol. Evol. 146, 5–25 (2013). [Google Scholar]
- 35.Timoneda A., et al., The evolution of betalain biosynthesis in Caryophyllales. New Phytol. 224, 71–85 (2019). [DOI] [PubMed] [Google Scholar]
- 36.Schönenberger J., Friis E. M., Fossil flowers of ericalean affinity from the Late Cretaceous of Southern Sweden. Am. J. Bot. 88, 467–480 (2001). [PubMed] [Google Scholar]
- 37.Crepet W. L., Nixon K. C., Daghlian C. P., Fossil Ericales from the Upper Cretaceous of New Jersey. Int. J. Plant Sci. 174, 572–584 (2013). [Google Scholar]
- 38.Martínez C., et al., Rariglanda jerseyensis, a new ericalean fossil flower from the Late Cretaceous of New Jersey. Botany 94, 747–758 (2016). [Google Scholar]
- 39.Crepet W. L., Timing in the evolution of derived floral characters: Upper Cretaceous (Turonian) taxa with tricolpate and tricolpate-derived pollen. Rev. Palaeobot. Palynol. 90, 339–359 (1996). [Google Scholar]
- 40.Nixon K. C., Crepet W. L., Late Cretaceous fossil flowers of ericalean affinity. Am. J. Bot. 80, 616–623 (1993). [Google Scholar]
- 41.Rose J. P., et al., Phylogeny, historical biogeography, and diversification of angiosperm order Ericales suggest ancient Neotropical and East Asian connections. Mol. Phylogenet. Evol. 122, 59–79 (2018). [DOI] [PubMed] [Google Scholar]
- 42.Magallón S., Gómez-Acevedo S., Sánchez-Reyes L. L., Hernández-Hernández T., A metacalibrated time-tree documents the early rise of flowering plant phylogenetic diversity. New Phytol. 207, 437–453 (2015). [DOI] [PubMed] [Google Scholar]
- 43.Zhong B., Yonezawa T., Zhong Y., Hasegawa M., The position of gnetales among seed plants: Overcoming pitfalls of chloroplast phylogenomics. Mol. Biol. Evol. 27, 2855–2863 (2010). [DOI] [PubMed] [Google Scholar]
- 44.Crane P. R., Phylogenetic analysis of seed plants and the origin of angiosperms. Ann. Mo. Bot. Gard. 72, 716–793 (1985). [Google Scholar]
- 45.Pryer K. M., et al., Horsetails and ferns are a monophyletic group and the closest living relatives to seed plants. Nature 409, 618–622 (2001). [DOI] [PubMed] [Google Scholar]
- 46.Gensel P. G., Glasspool I., Gastaldo R. A., Libertin M., Kvacek J., “Back to the beginnings: The Silurian-Devonian as a time of major innovations in plants and their communities” in Nature through Time, Martinetto E., Tschopp E., Gastaldo R., Eds. (Springer Textbooks in Earth Sciences, Geography and Environment, Springer, Cham, Switzerland, 2020), pp. 367–398. [Google Scholar]
- 47.Harrison C. J., Morris J. L., The origin and early evolution of vascular plant shoots and leaves. Philos. Trans. R. Soc. Lond. B Biol. Sci. 373, 20160496 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Pease J. B., Haak D. C., Hahn M. W., Moyle L. C., Phylogenomics reveals three sources of adaptive variation during a rapid radiation. PLoS Biol. 14, e1002379 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Morales-Briones D. F., et al., Disentangling sources of gene tree discordance in phylotranscriptomic datasets: A case study from Aamaranthaceae sl. Syst. Biol. 70, 219–235 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Maddison W. P., Knowles L. L., Inferring phylogeny despite incomplete lineage sorting. Syst. Biol. 55, 21–30 (2006). [DOI] [PubMed] [Google Scholar]
- 51.Pease J. B., Hahn M. W., More accurate phylogenies inferred from low-recombination regions in the presence of incomplete lineage sorting. Evolution 67, 2376–2384 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Walker J. F., et al., Widespread paleopolyploidy, gene tree conflict, and recalcitrant relationships among the carnivorous Caryophyllales. Am. J. Bot. 104, 858–867 (2017). [DOI] [PubMed] [Google Scholar]
- 53.Avise J. C., Robinson T. J., Hemiplasy: A new term in the lexicon of phylogenetics. Syst. Biol. 57, 503–507 (2008). [DOI] [PubMed] [Google Scholar]
- 54.Guerrero R. F., Hahn M. W., Quantifying the risk of hemiplasy in phylogenetic inference. Proc. Natl. Acad. Sci. U.S.A. 115, 12787–12792 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Mendes F. K., Hahn M. W., Gene tree discordance causes apparent substitution rate variation. Syst. Biol. 65, 711–721 (2016). [DOI] [PubMed] [Google Scholar]
- 56.Mendes F. K., Fuentes-González J. A., Schraiber J. G., Hahn M. W., A multispecies coalescent model for quantitative traits. eLife 7, e36482 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Wright S., “The roles of mutation, inbreeding, crossbreeding and selection in evolution” in Proceedings of the 6th International Congress of Genetics (1932), pp. 356–366.
- 58.Marques D. A., Meier J. I., Seehausen O., A combinatorial view on speciation and adaptive radiation. Trends Ecol. Evol. 34, 531–544 (2019). [DOI] [PubMed] [Google Scholar]
- 59.Parins-Fukuchi C., Mosaic evolution, preadaptation, and the evolution of evolvability in apes. Evolution 74, 297–310 (2020). [DOI] [PubMed] [Google Scholar]
- 60.McGee M. D., et al., The ecological and genomic basis of explosive adaptive radiation. Nature 586, 75–79 (2020). [DOI] [PubMed] [Google Scholar]
- 61.Benton M. J., Twitchett R. J., How to kill (almost) all life: The end-Permian extinction event. Trends Ecol. Evol. 18, 358–365 (2003). [Google Scholar]
- 62.Nixon K. C., Crepet W. L., Stevenson D., Friis E. M., A reevaluation of seed plant phylogeny. Ann. Mo. Bot. Gard. 81, 484–533 (1994). [Google Scholar]
- 63.Chartier M., et al., How (much) do flowers vary? Unbalanced disparity among flower functional modules and a mosaic pattern of morphospace occupation in the order Ericales. Proc. Biol. Sci. 284, 20170066 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Stamatakis A., RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Parins-Fukuchi C., Smith S., Stull G., Gregory.pf_stull_smith.tgz. Figshare. 10.6084/m9.figshare.13190816.v2. Deposited 4 November 2020. [DOI]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Gene trees, dated species trees, phenotypic matrices, and code used to perform the analyses are publicly available from Figshare: https://figshare.com/articles/dataset/pf_stull_smith_tgz/13190816/2 (65). All other study data are included in the article and/or SI Appendix.