Significance
Why rates of phenotypic evolution vary across the tree of life is still not well known. A widespread view holds that fast rates result from the colonization of empty adaptive zones, or sets of similar niches, followed by rate deceleration with the crowding of available niche space. By developing a flexible phylogenetic model that examines how rates vary across lineages and applying it to thousands of extinct and extant species, I find that morphological rates of evolution for body size have not slowed down, regardless of the increase in phenotypic varieties. Rather, macroevolutionary trends result from persistent evolution of lineages’ body size followed by their selective extinction, suggesting unattenuated evolutionary dynamics.
Keywords: evolutionary rates, adaptive landscape, macroevolution, phylogenetic models, species sorting
Abstract
Rates of evolution are fundamental to understand the processes that shaped the history of life. The predominant view holds that high rates of phenotypic evolution result from lineage transitions across peaks in an adaptive landscape, with subsequent slow-downs, but evidence remains debated. I developed a phylogenetic “diffused Brownian motion” model that characterizes nuanced variations in evolutionary rates and use it to comprehensively assess body size evolution and its underlying rates for 2,950 extinct and 792 extant species that span over 450 Mys of evolution. I find that evolutionary rates do not conform to expectations from adaptive landscape theory, but rather have been stable, unaffected by the accumulation of phenotypic disparity. Long-term evolutionary trends, such as several net increases in clade-average body size, result both from sustained evolution at the lineage level and the sorting of species phenotypes and their underlying evolutionary rates at the clade level, sometimes acting in opposite directions. These findings substantiate an active role of species in shaping their environment that generate continuous novelty of life forms.
A fundamental theme in evolutionary biology revolves around the tempo and mode of species’ niche diversification during evolutionary radiations, which stems from the observed variation in rates of phenotypic novelty (1, 2). Most notable is the common pattern of geologically “abrupt” appearances of many new taxa in the fossil record, particularly at higher levels, followed by comparatively slower modifications (1, 3, 4). Condoning some redundancy that ensues from defining taxa based on similarity, a rapid and early phenotypic diversification appears to characterize several evolutionary radiations together with a paucity of “transitional” fossils between higher taxa (1, 3, 5). Simpson canonized Wright’s adaptive genetic landscape metaphor at the organism level (6) into phenotypic terms at the species level where a continuous function with peaks and valleys represent species fitness (or “adaptive value”) in phenotypic space (1). This phenotypic adaptive landscape (henceforth, simply “adaptive landscape”) depicts an adaptive topography where external factors generate multiple local fitness maxima along phenotypic space toward which species are positively selected, and which he called “adaptive zones” or “peaks.” These adaptive peaks are separated by valleys of low fitness, with gradients representing the intensity of selection (1). Simpson explained these infrequent (un)observed instances of rapid evolution as species movements from one “adaptive zone” or “peak” to another (1, 7, 8). Colonization of adaptive zones can happen progressively, or, Simpson continued, during “adaptive radiations,” which he defined as almost “simultaneous” transitions into divergent adaptive zones from one ancestral “adaptive type,” as in the origin of most higher taxa (2, 9, 10).
I refer to this idea of species evolving within and across shared adaptive peaks as “adaptive landscape theory.” Here, the opening of “ecological opportunities,” including acquisition of “key innovations” that enable access to new resources, dispersal to new environments, or removal of previous occupants, facilitate the colonization of new adaptive zones (2, 11). In turn, entry into a zone often triggers a steep increase in rates of lineage and phenotypic diversification, that is, an “adaptive radiation” (2, 11, 12). Because these adaptive zones represent discontinuous sets across phenotypic space, after this initial phase of explosive divergence, further accumulation of disparity within zones decelerates as lineages subdivide available (“intrazonal”) adaptive space (2). While a combination of diversification of taxa and ecologically important traits is expected (2, 13), the importance of the former resides mostly in that adaptation to different zones must entail separate lineages. That is, there can be taxonomic diversification without disparity, but not the other way around. It seems pertinent then to study the evolution of species phenotypes and their underlying rates to test the general predictions behind adaptive landscape theory. Namely, that at one or few points during the evolutionary history of the clade there are sharp increases in evolutionary rates as a consequence of the colonization of new adaptive zones, and that subsequent evolutionary rates slow down as a result of partitioning a limited niche space (13, 14) (Fig. 1).
Fig. 1.
Expectations under classic adaptive landscape theory. Pictogram illustrating expected patterns for (A) the evolving trait and (B) the underlying evolutionary rate under the classic paradigm of adaptive landscape theory (2). Some tip names (i to viii) are included to ease correspondence between trait and rate evolution pattern plots. Here, there are two further transitions among “adaptive peaks,” one leading to the tips that include ii, iii, and vii, and a subsequent one leading to the tips that include vi.
Despite the pervasive influence and adoption of Simpson’s phenotypic adaptive landscape iconography that underlie adaptive landscape theory, its pertinence, importance, and prevalence in macroevolutionary dynamics remain debated (14–21). Arguably, this partly derives from the difficulty in laying out and identifying its predictions when applied to empirical observations. Early on, adaptive radiations have been broadly singled out based on a relatively rapid and, for fossil groups, an early, accumulation of phenotypic disparity, that is, trait variance (2, 3, 5, 13, 22). However, focusing on the ancestor-descendant (i.e., phylogenetic) distribution of phenotypic evolutionary rates allows to directly measure lineage evolution through time; that is, using tempo to infer mode (1, 14, 23, 24). In this regard, the pioneering “early burst” phylogenetic models, where Brownian motion (BM) evolutionary rates are allowed to decrease through time, introduced a formal examination of adaptive landscape theory (14, 25). Implicit in these previous tests, however, are several potentially misleading assumptions. First, that the taxonomic delimitation of the empirical clade matches the (only) adaptive radiation (i.e., that the adaptive radiation initiated at the ancestor of the designated group of organisms). Second, that there were no additional adaptive transitions involved. And, third, that all lineages share the same exact evolutionary rate, which is only allowed to decrease, disregarding any between-lineage rate heterogeneity. A clade with rate increases scattered along some lineages, expected from nested adaptive radiations, would not lead these methods to detect an overall decreasing rate, as has indeed been found (24–27). Thus, even if life arose from a series of adaptive radiations, as previously advocated (2, 12), wherein evolutionary rates predominantly decelerate after punctuated increases at adaptive zones transitions (Fig. 1), one would be unlikely to recover the recurrent early bursts pattern.
Unraveling mode from tempo demands a flexible model capable of characterizing a range of evolutionary rate dynamics along separate lineages, including those expected under adaptive landscape theory (Fig. 1).
A Diffused Model of Evolutionary Rates
Here, I developed the “diffused Brownian motion” (“DBM”) model, which allows evolutionary rates to diffuse continuously along a phylogenetic tree, enabling a fine-grained characterization of rate dynamics (SI Appendix, Fig. S1; Materials and Methods). Specifically, I expand the “strict” Brownian motion (BM) process, wherein the instantaneous stochastic diffusion (evolutionary) rate is constant, to a process where instantaneous stochastic rates are themselves also continuously and separately diffusing according to a geometric Brownian motion process (“GBM;” Materials and Methods). The choice of a GBM is natural since it reflects the nonnegative nature of rates and assumes a multiplicative, rather than additive, diffusion (i.e., based in percentage rather than absolute value), as expected from multiplicative interactions among determining factors (21, 28–31). Unlike previous major methodological developments that account for rate heterogeneity during trait evolution (Materials and Methods), the DBM model allows for separate lineage-specific rates that change continuously through time.
At the start of the DBM process there is one lineage with continuous trait x0 and underlying evolutionary rate (i.e., the instantaneous diffusion rate), which represents the expected accumulation of trait variance after a unit time (i.e., units ; SI Appendix, Fig. S1). With time, lineages evolve their specific trait under a concomitantly evolving evolutionary rate , and, at speciation, lineages inherit both their trait and their rate to their descendants (SI Appendix, Fig. S1). Overall trends in traits are reflected in the drift αx and, for evolutionary rates, in the drift ασ, with an expectation to increase or accelerate when positive or to decrease or decelerate when negative. Finally, the diffusion of the logarithm of , takes place with rate , interpreted as the percentage expected change in evolutionary rates (units ), and can be understood as the variation in rates across time and lineages, or the evolution of evolutionary rates (SI Appendix, Fig. S1).
Estimating the likelihood under a process where BM diffusion rates follow a GBM process remains intractable (31), but I was able to perform full probabilistic estimation under the DBM using Bayesian data augmentation (“DA;” Materials and Methods). The solution is to “augment” the data such that the evaluation of the likelihood given the augmented data becomes possible and viable, leading to an estimation of the parameters’ posterior distribution using Monte Carlo sampling (32), and has been successfully employed previously in phylogenetic models (21, 33–35) (SI Appendix, Fig. S1; Materials and Methods). For the DBM, this entails the imputation of trait, , and evolutionary rate, , histories (i.e., the full diffusion paths) along the phylogenetic tree in such a way that they remain consistent with the observed data (SI Appendix, Fig. S1). Missing data or variance around species traits can be organically integrated into the estimation of these probabilities. Finally, this procedure returns not only the posterior distribution of the major process parameters (i.e., x0, , αx, ασ and ) but also of the augmented processes, which facilitate post hoc analyses and visualizations (SI Appendix, Fig. S1; Materials and Methods) (21, 34). I developed, implemented, and validated a DA algorithm to perform inference under the DBM model (SI Appendix, Fig. S2; Materials and Methods). The DBM model takes a time-calibrated phylogenetic tree with or without fossils together with trait information with or without uncertainty for some or all extinct and extant organisms in the tree.
Importantly, the DBM tests the predictions from adaptive landscape theory. Foremost, a slow-down in evolutionary rates is demonstrated by a negative evolutionary rate drift parameter ασ. Second, the few, if any, lineages that underwent zone transitions are readily accounted for by the model. Unlike the “early burst” models in which the assumption is that all lineages share the same (decaying) rate at any moment in time (14, 25), a negative ασ would be congruent with the core expectations of adaptive landscape theory by indicating a tendency for evolutionary rates for each lineage to wane with time, regardless of their magnitude, while allowing for deviations from it.
Rate Dynamics of Phenotypic Evolution
I applied the DBM model to examine phenotypic rate variation across evolutionary radiations. Jointly incorporating available paleontological and neontological information not only substantially improves the reconstruction of past evolutionary dynamics but might support evolutionary models that conflict when applied to extant taxa alone (36–41). Thus, I considered groups for which available phylogenetic information included a representative sample of extinct and extant species and was estimated under Fossilized Birth–Death (FBD) model (42, 43) (Materials and Methods). Together, these groups comprise over extinct and extant species across taxonomically diverse groups, with species representing archosaurs (including nonavian dinosaurs, birds, pseudosuchians), sphenodonts, mammals, turtles, therocephalians, ray-finned fishes, and arthropod groups, that together span the last Mys (SI Appendix, Table S1, Materials and Methods). Moreover, they embody dramatic ecological transitions, including shifts between aerial, terrestrial, and aquatic environments, as well as displaying a great variety of ways of life.
To assess their phenotypic evolutionary dynamics I focused on species average body size. Size is a fundamental feature of every organism, profoundly interrelated with most of their ecological, physiological, and behavioral features (44–48) and should be indicative of patterns expected from adaptive landscape theory (49). As such, its evolutionary dynamics across a wide range of taxa has been much studied in elucidating general patterns and underlying processes of species evolution (14, 18, 25, 41, 50–52). Besides its biological importance, body size can be estimated for a large number of species, including extinct specimens with fragmentary evidence, enabling a comparable and comprehensive analysis. I compiled body size information from the literature for the majority of species within the taxa, using weight, body length, and, in some cases, body measurement proxies (SI Appendix, Table S1 and Dataset S1; Materials and Methods). Missing information and, when available, intraspecific variance, measurement error, or uncertainty around the estimates was integrated (SI Appendix, Table S1). To examine body size evolutionary dynamics, I performed inference under the DBM model on each group, across a sample of empirical trees to account for phylogenetic uncertainty.
Body size has been extraordinarily dynamic, with several groups exhibiting differences exceeding an order of magnitude (Fig. 2 and SI Appendix, Figs. S3–S19). Some of the largest and long-lived radiations, including sea scorpions, pseudosuchians, and dinosaurs (Fig. 2A, D, and G and SI Appendix, Figs. S3, S6, S9, and S20 A, D, and G), saw a relatively early and rapid accumulation in body size disparity, in ostensible accordance with adaptive landscape theory. Most others, however, display either a gradual accumulation of body size differences, as in turtles, tetraodontiform fish, sloths, and new world monkeys (Fig. 2E, H, M, and O and SI Appendix, Figs. S7, S10, S15, S17, and S20 E, H, M, and O), or a delayed diversification as in penguins, short-faced kangaroos, and canins (Fig. 2I, N, and P and SI Appendix, Figs. S11, S16, S18, and S20 I, M, and P).
Fig. 2.

Diffused evolution of body size across distinct evolutionary radiations conflict with expectations from classic adaptive radiation. Inference on trait and rate evolutionary patterns under the diffused Brownian motion process, where traits and its underlying rates undergo separate yet concomitant diffusion processes, as described by the stochastic differential equations on the Top Left. For each group (A–Q), the first column shows the average posterior reconstruction for body size, colored by the underlying rates (i.e., color gradient from black to yellow). The second column represents the stacked posterior distributions of the drift in traits, αx, over sample posterior of trees [except for (G and M), where only one empirical tree was used; see Materials and Methods]. Distributions with a strong positive or negative trend (i.e., those where does not overlap the 95% Credible Interval) are shown in red or blue, respectively. The third column depicts the evolution of the underlying evolutionary rates, colored by the evolving trait (i.e., color gradient from blue to red). For visual clarity only one phylogenetic tree out of from the posterior tree distribution is shown. The fourth column follows the description of the second, but for the evolutionary rate drift, ασ. The evolutionary radiations represented, and their measurements for body size, are (A) sea scorpions (Eurypterida; body length [mm]), (B) therocephalians (skull length [mm]), (C) Sphenodontidae (lower jaw length [mm]), (D) pseudosuchians (compound proxy for body size), (E) turtles (Testudinata; carapace length [mm]), (F) pterosaurs (wingspan in M), (G) nonavian dinosaurs (body mass [kg]), (H) tetraodontiform fish (body length [cm]), (I) penguins (Spheniscidae; body mass [kg]), (J) proboscideans (body mass [kg]), (K) brontotheres (body mass [kg]), (L) cetaceans (body length [mm]), (M) sloths (Folivora; body mass [kg]), (N) short-faced kangaroos (Sthenurinae; body mass [kg]), (O) new world monkeys (Platyrrhini; body mass [kg]), (P) Caninae (body mass [kg]), and (Q) horses (Equinae; body mass [kg]). Silhouettes from phylopic.org.
The Unfit Macroevolutionary Adaptive Landscape
While evolutionary rates also varied substantially within radiations, usually by orders of magnitude (Fig. 2 and SI Appendix, Figs. S3–S19), they did not exhibit a tendency to decrease, as predicted by adaptive landscape theory. Remarkably, rather than slowing down, most groups sustain their evolutionary rates throughout their evolutionary history (i.e., ; Fig. 2), with turtles even revealing accelerating rates during their ca. Mys of evolution (95% Highest Posterior Density for across phylogenetic uncertainty; Fig. 2E). The addition of variation around trait values for clades without this information does not affect results (SI Appendix, Fig. S22). Only one out of all groups, the new world monkeys, fulfilled the expectation of a general decrease in evolutionary rates (i.e., 95% Highest Posterior Density for across phylogenetic uncertainty, Fig. 2O), with the exception of two lineages, as would be expected from subsequent zone transitions (Fig. 2O and SI Appendix, Fig. S17). For groups that initiated after or endured abrupt extinction events, there is no evidence of an increase in rates in their aftermath to then be followed by a renewed decrease (Fig. 2 and SI Appendix, Fig. S23), as expected from ecological opportunity after the opening of previously occupied adaptive zones. Finally, an increasing accumulation of phenotypic disparity, usually posited as evidence for adaptive landscape dynamics (3, 5), was not associated with decreasing trends in evolutionary rates (Fig. 2 and SI Appendix, Fig. S20).
Together, these results contradict the basic tenets of adaptive landscape theory, yet remain contingent on the pertinence of using body size and the DBM model to examine its predictions. By maximizing taxonomic coverage in the search for general patterns, I focused only on species body size, which, while determining several facets of organismic biology, ignores the multivariate complexity of phenotypes (20, 53). The uncommonness of rate decreases in body size, however, would imply that, even in the extreme case where rates for other functionally relevant phenotypic axes slowed down, body size would seem to continuously supply an axis of evolutionary versatility. Given that one given body size can accommodate various phenotypes, considering the intricacies of forms on top of their body size can only increase their total amount of change. Since organisms are “integrated entities” (54), sustained evolution in at least one relevant biological variable, here body size, seems to dispute the basic expectations from adaptive landscape theory.
Another concern is the ability of the DBM model to accurately capture trait dynamics under an adaptive landscape. For instance, if evolutionary rates do not diffuse but rather undergo instantaneous shifts. While the Brownian diffusion assumption of rates may lead to smoothing of trait and rate variation through time, the empirical reconstructions evince some ability to recover sudden changes (see, e.g., >10× increase in rates in less than Mys in Equidae, SI Appendix, Fig. S19). Regardless of the mode of trait and rate evolution, if the total amount of trait change decreases, as expected from periodic early burst patterns, the DBM should still be able to capture a tendency of rates to slow down.
An argument could be made, however, for a more intricate adaptive landscape beyond that from its original formulation, wherein fluctuating ecological opportunities or frequent transitions among adaptive zones account for the absence of a general decrease in rates of evolution. That is, since adaptive zones (or peaks) across phenotypic space can take all shapes and forms, in terms of their number, narrowness, distribution, and displacement (1), it could be argued to still have generated the observed evolutionary patterns across groups. Paradoxically, given that any evolutionary pattern can be described by retrospectively reaccommodating the dynamics of the adaptive zones themselves, or, put another way, no empirical pattern can refute the existence of a phenotypic adaptive landscape, this widespread view faces the logical flaw of not being empirically falsifiable (55). In practice, this is compounded by the common procedure of defining these zones post hoc by the very existence of species (e.g., “there is a penguin adaptive zone because there are penguins”), quickly becoming tautological (15). Although imaginary constructs, or models, such as the phenotypic adaptive landscape, help us make sense of the world, the problem is often that, once created, we tend to consider them as objective realities (56). Those for which no evidence can refute their validity (or whose ontological status is no longer even questioned), can give the false appearance of explanation.
Prominence of Species Sorting in Determining Macroevolutionary Trends
Rather than mapping the peaks of an adaptive landscape, with species assumed to be in an optimal state, the distribution of species phenotypes (and their underlying rates) can instead reflect the differential proliferation and removal of species within clades (27, 57, 58). Under this view, evolutionary trends are not predetermined by some adaptive zones available from the outset, but by the differential survival of species that constructed those spaces with time (50, 57, 59). Germane to this study is the much discussed trend of an increase in species size over evolutionary time, dubbed “Cope’s rule” (60, 61). Most of the groups studied here do seem to follow this pattern, with a net increase in average body size since the start of the radiation ( out of , with therocephalians, turtles, and penguins as exceptions; SI Appendix, Figs. S3–S19). Most of these enlargements, however, seem to be driven by the differential diversification of species rather than by increases at the lineage-level, as evinced by the drift in body size, αx (Fig. 2). From those groups that increased in average size, only nonavian dinosaurs, cetaceans, tetraodontiform fish, and sloths presented a lineage tendency to increase in size (i.e., ). Illustrative examples include clades notorious for attaining gigantic sizes, such as proboscideans and brontotheres, which did not reflect size increase at the lineage level (Fig. 2 and SI Appendix, Figs. S12 and S13), and, turtles that, despite a lineage trend to become larger, did not do so at the clade level (Fig. 2 and SI Appendix, Fig. S7).
I then examined the extent to which species sorting accounts for changes in average body size relative to lineage-level evolution (Fig. 3). Without explicitly incorporating unobserved speciation events and their relation to trait dynamics into past reconstructions, one cannot disentangle phenotypic evolution due to speciation from anagenetic change. However, one can have a minimum estimate of how the disappearance of observed species generated the empirical evolutionary trends (50, 62). I estimated group-average body size trajectories (i.e., “trends”) and estimated the amount of change in the trend explained by the removal of species, which I call the “prominence” of species sorting. Species sorting should not be confused with species-level selection (a cause of sorting) (63), and I emphasize that the prominence of species sorting used here only quantifies the relative importance of observed species disappearance compared to both observed speciation and anagenetic change (rather than of speciation and extinction compared to anagenetic change). These analyses were made possible by using the posterior data augmented trait histories rather than focusing on the trait and rate posterior averages, since the latter are not themselves DBM processes. Prominence ranges from , with sign denoting whether sorting has a decrease or increase effect on trends, and magnitude reflecting the proportion of change explained (Fig. 3; Materials and Methods). Intuitively, the extinction of species with similar body size with respect to the current clade average results in a prominence close to , whereas if larger or smaller species are selectively pruned out, prominence will be close to and , respectively (Fig. 3A–C).
Fig. 3.
Prominence of species sorting in driving evolutionary trends. Two illustrative examples on how the selective removal of species can lead to evolutionary trends in clades. (A–C) At about Mya, coinciding with the Triassic-Jurassic extinction event, average turtles (Testudinata) body size was reduced drastically. (A) Shows the posterior average carapace length (m) with 50% and 95% Credible Intervals (CI) in lighter shades. (B) Five posterior lineage-specific evolutionary trajectories, showcasing the prominence of morphology removal over their transformation in generating evolutionary trends. (C) Average prominence of species sorting (i.e., the amount of average change explained by species removal over species change) through time, with 95% CI shown (red) when not overlapping , as at Mya. (D–F) During the Lower Devonian, a loss of diversity of sea scorpions (Eurypterida) caused a slow-down in body size evolutionary rates. (D) Average evolutionary rate of body length (m) with 50% and 95% CI shown in lighter shades. (E) Lineage-specific evolutionary rates for five posterior samples. (F) Prominence of species sorting in determining rate trends, with 95% CI shown (red) when not overlapping , as is the case at the ca. Mya and ca. Mya decreases.
Among hundreds of strong trend changes (those consistent across posterior trait histories) across groups, over 70% were explained by the disappearance of species (when measured at My intervals, ca. 60% at the Mys interval; SI Appendix, Figs. S23 and S24). When summed across all change in mean body size, species sorting explained, on average, and ranged from in sphenodontids to in nonavian dinosaurs (mean of with range to at the Mys interval; SI Appendix, Fig. S25). Nonetheless, simulations show that the total amount of average trait change explained by species sorting is almost entirely determined by the reconstructed phylogenetic tree (SI Appendix, Fig. S25). Much more relevant is then the directionality of prominence (which simulations confirm it does not depend on the empirical trees, as expected; SI Appendix, Fig. S26). The sorting of species sizes was not always random but its directionality was clade-specific (SI Appendix, Fig. S26): On average, smaller species were selectively pruned out in pterosaurs, proboscideans, brontotheres, short-faced kangaroos, and horses, while larger species were selectively removed in pseudosuchians, turtles, tetraodontiforms, cetaceans, and sloths. Thus, for some groups, species sorting was largely responsible for their average increase in body size, while, for others, most notably turtles, tetraodontiform fish, and cetaceans, species sorting acted against their lineage-level tendency for enlargement.
Species sorting could happen for evolutionary rates as well if, say, fast-evolving lineages are more likely to survive, so I carried on the same analysis using body size evolutionary rates (Fig. 3D–F). A notable, yet lower proportion of strong trend changes in evolutionary rates were explained by the sorting of species (about 44% and 39% at My and Mys interval, respectively). I found no consistent signal of selectively pruning fast (or slowly) evolving species: While on average, fast-evolving lineages were pruned out in pseudosuchians, turtles, and canins, slowly evolving species were more often removed in proboscideans, short-faced kangaroos, and new world monkeys (SI Appendix, Fig. S26). Unexpectedly, these results for rates again show that tendencies below the species level can oppose in directionality those above the species level, as with turtles, showing accelerated rates of lineage evolution but a selective pruning of fast-evolving species, or with new world monkeys, with decreasing rates of body size evolution yet the pruning of slowly evolving species.
Together, these findings suggest that a large extent of long-term evolutionary trends are shaped by the differential survival of species, and that the causes of the birth and death of organisms can differ from those of species (16, 27, 50).
Conclusions
It has been long recognized that the tempo and mode in evolution is reflected in the variation and distribution of evolutionary rates across the tree of life (1, 16). Incorporating the hyperdimensionality in phenotypes beyond body size, together with assessing how speciation determines phenotypic rate variation, as expected, for instance, by speciational evolution (62), are exciting avenues for future research made possible through the combination of the algorithms presented here and before (21, 34). While the findings in this study highlight the strong heterogeneity in evolutionary rates within and among taxa, their dynamics challenge the longstanding view of macroevolution wherein species are predetermined to fill a limited set of “adaptive zones” (15, 64). Adaptive landscape theory presumes the preexistence of a fitness gradient common to all species, autonomously determined by environmental factors [as epitomized in Ornstein–Uhlenbeck models (65)], and where species are reduced to passive entities (64, 66). This view predicts that evolutionary dynamics leads to determinate and relatively stable phenotypes that have been prestated before the emergence of species (66, 67). Although overarching physical constraints evidently exist (e.g., pressures of life on land), this schematic view ignores how organisms modulate and shape their environment, actively constructing their individual selection pressures, in a self-referential feedback that undermines the predictability embodied by adaptive landscape theory (66, 68). The findings here support instead an ever-expanding diversification of forms, curtailed by selective extinctions to assemble what, on aggregate, generate long-term macroevolutionary trends. Since one function can have a limited number of forms, but any one form has uncountable functions, species’ phenotypes should not be considered as composed of optimal adaptations to preexisting adaptive zones, but rather as enablements that continuously supply new possibilities (58, 69, 70). This is consistent with a more contingent perspective of evolution, indeterminate and far from equilibrium, where organisms actively explore, manipulate, and influence their ever changing environment (68, 71), in a continuous profusion of life forms.
Materials and Methods
Rate Heterogeneity in Trait Evolution Models.
One of the simplest models used to describe the evolution of continuous traits along a phylogenetic tree is BM. Here, the evolution through time of the average trait value of lineage i, denoted , is assumed to follow a Brownian diffusion process described by the stochastic differential equation (SDE): , where denotes the Wiener process (i.e., the standard Brownian motion) and represents the rate of evolution, which is assumed to be time-constant and homogeneous across lineages. This last assumption of lineage-homogeneous rates has been shown to be biologically unrealistic and unfit for testing various hypotheses about the tempo and mode of evolution, encouraging the emergence of more flexible models of trait evolution that allow for heterogeneity in evolutionary rates across lineages and time.
Some phylogenetic models of trait evolution directly test whether their rate has not been constant through time. Here, all lineages alive at some time share the same rate, but the rate is allowed to fluctuate as a function of time or, for instance, following an environmental covariate (72). Pertinent to this study is the “early burst” model (EB) (14), later expanded by the nested EB (25). These models identify only one increase in rates, which for the EB is defined as having happened just before the crown of the tree while the nested EB identifies its location among the branches in the tree, and estimates a parameter that controls how rates slow down afterward. It is important to note that rates are only allowed to be constant or decay, and do not test if rates could have actually increased. Again, the decaying rate is assumed to be the same across all lineages within the clade delineated by the initial shift.
Others rather attempt to extract rate patterns from the data. To this end, they assign distinct evolutionary rates, which might be correlated or not, by either dividing the tree into subtrees, or to each branch in the tree. The Relaxed Random Walk (73) assigns an independent rate to each branch, which becomes identifiable when rates are assumed to share the same prior. FossilBM (74) and BayesTraits (18, 75) identify the number of distinct and uncorrelated rates (and drifts) across the tree. Within each branch, or subclade, rates and drifts are shared and are assumed constant. MuSSCRat (76) examines whether some of the estimated rate heterogeneity in a continuous trait is explained by another discrete character. Finally, models in which rates of evolution themselves follow a GBM have been recently developed in a Maximum Likelihood framework (30), and in a Bayesian framework “evorates” (31). I build on these important developments, which still rely on approximations of the likelihood of a “true” GBM-type rate model by resorting to branch or node specific average rates (30, 31), and add several new features as described below.
Diffused Brownian Motion Model.
Here, I assume that both the trait value and its rate of evolution for a given lineage follow a process of continuous diffusion through time (SI Appendix, Fig. S1). That is, the trait evolves under a stochastic process such that its rate of evolution is itself determined by another stochastic process, a particular instance of “stochastic volatility.” Specifically, the average trait for lineage i, , and the associated evolutionary rate, , evolve following the coupled system of SDEs:
| [1] |
| [2] |
where αx and ασ represents the drift (or average tendency) for traits and evolutionary rates, respectively, and γ the diffusion speed for evolutionary rates (SI Appendix, Fig. S1). That is, if , traits tend to increase with time, while if , traits are expected to decrease over evolutionary time. Similarly, if evolutionary rates have a tendency to accelerate, then , or, contrarily, if the tendency is to decelerate, then . Note that these drifts do not assume that rates are the same for all lineages alive at any time t (as in the EB or nested EB models), but rather describe a tendency for separate lineages to increase or decrease, regardless of their current trait or rate value. The degree of heterogeneity in evolutionary rates across lineages is represented by γ. The use of a GBM for the evolution of rates, , is congruent with their nonnegative nature and with multiplicative drivers, such as the environment–organism interactions (66). At cladogenesis, the trait and associated rate are inherited identically among both daughter lineages. That is, for parent lineage a that gives rise to daughter lineages d1 and d2 at time ta, we have that and that . The assumption that rates are inherited follows from long-standing observations (2), but also from understanding their proximal mechanisms, such as the build up of genetic variation, which are themselves expected to be inherited (mutation rates, generation times, among others). I call this process the “diffused Brownian motion” (DBM) model of trait evolution. Note that when , and as γ becomes infinitely small, the DBM converges to a simple BM process.
Likelihood.
Let a DBM process start at time with initial value and initial rate , then the probability after some time t, for the trait value and rate is
where is the stochastic integral representation.
Closed expression for this probability remains untractable (31), so I resort to numerically solving it using the Euler–Maruyama method. I partition time into a discrete grid of length m, where time steps are spaced from one another by a sufficiently small period δ, such that and where , and generate sample paths for both and at these times. The probability for the DBM process can then be straightforwardly computed as
| [3] |
where
with being the geometric mean, and
Let Ψ be a rooted and time-calibrated phylogenetic tree with or without sampled fossils with E edges. For clarity, if a fossil is a sampled ancestor (i.e., a fossil with at least one sampled descendant), I treat the edge before and after as two different edges. Then, based on a vector of trait values observed from extant or fossil species X, then the total likelihood for the process along the tree is the product of the likelihood (Eq. 3) across all edges
| [4] |
while taking into account the tree structure in the inheritance of traits and rates.
Inference.
I developed Bayesian DA techniques to perform full probabilistic inference on the DBM model given a time-calibrated phylogenetic tree, with extinct and extant lineages, and trait values with their respective measures of uncertainty (if available) for all or some of the sampled lineages (SI Appendix, Fig. S1). In DA, one is able to estimate the posterior density (i.e., the probability of θ, the set of model parameters, given the observed data o) by including augmented data a, such that (32). In DBM, given mean trait values X with associated uncertainty measures S, and phylogenetic tree Ψ, I can estimate the posterior probability for αx, ασ, γ, x0 (the ancestral trait), and (the ancestral evolutionary rate) as well as the whole diffusion paths describing the evolutionary history of the trait and rates along the tree, that is, arrive at . Uncertainty around missing trait information is naturally and properly incorporated in the inference procedure while simultaneously generating posterior predictions for unobserved traits (as well as all rate values).
I start by estimating the evolutionary rate under a simple Brownian motion under phylogenetic independent contrasts (77). I then generate sample paths of the DBM process along the tree using this rate so that they match the observed X, sampling every small time step δ. In practice, edge lengths are not a multiple of δ, so the last time step is always smaller, but I remove this intricacy for clarity and simplicity in the following description. I then use Markov chain Monte Carlo (MCMC) to iteratively sample from the joint posterior distribution, integrating over all likely DBM histories given the data, with the following specific updates. This procedure yields posterior samples of the full trait and respective rate evolution which can be used for post hoc analyses and visualizations (SI Appendix, Fig. S1).
Rate diffusion path updates.
For a diffusion process along a tree with drift a, I make use of the general conjugate relationship where the distribution for a bifurcating node with ancestral lineage q and descendant lineages d1 and d2, with respective edge lengths of tq, , and and absolute rates of sq, , and ,
| [5] |
where . Similarly, for the distribution at a sampled ancestor node, with only one ancestor and one descendant edge, I use the following general conjugate distribution
| [6] |
Initial attempts at sampling the rate diffusion paths using the full conditional distribution were computationally inefficient, so I use simple Metropolis–Hastings (MH) updates for . I first randomly and uniformly select an internal node in the tree. If the node is a bifurcating node, I propose a new rate value (note that ) conditional on the rates at the start of the ancestor and at the end of the daughters using Eq. 5, where , , and . If the node is a sampled ancestor, I propose a new rate value conditional on the rates at the endpoints of the ancestor and daughter using Eq. 6. I then make diffusion path proposals using Brownian bridges conditional on the endpoints and the new proposed value at the node for each lineage. To sample a Brownian bridge for a diffusion process along lineage i, , starting at and ending at ti, conditional on and , I use the conjugate relationship
Note that the drift, ασ, cancels out in this distribution. Finally, if the node leads to a terminal edge, I make a forward BM simulation for starting at the node’s value.
Since the proposal probability cancels out with the diffusion of in the likelihood, the MH acceptance ratio, a, for lineage i becomes
| [7] |
I also added a scaling update for the rates, that is, I multiply all rates by a positive scalar s, itself drawn from a log Normal distribution, such that , and accepted following the MH ratio.
Trait diffusion path updates.
To update the trait diffusion paths I use the full conditional distribution ( is conditionally independent of ασ and γ), that is, I use Gibbs sampling. One can sample directly from this distribution given that the full path of is available. As with the rate updates, I use the conjugate relationships in Eqs. 5 and 6 to update internal nodes, where , , and . If the internal node leads to a sampled ancestor with trait information (i.e., available in X), I simply sample Brownian bridges conditioned on the observed trait value and the rate diffusion path. When there is uncertainty around the observed trait value, I assume it follows a Gaussian distribution with the average observed trait as mean and the respective uncertainty as variance, and then sample using conjugate relationships. Other probability distributions could readily be used as priors for observed trait uncertainty, but for simplicity I assume a Normal variance. When there is no information in trait values, I perform forward DBM simulations. This properly and naturally incorporates extant and fossil missing trait information as well as uncertainty around observed traits. Moreover, following the Bayesian paradigm where posterior inference and prediction are done simultaneously (78), the data augmentation algorithm generates posterior predictions of trait values given the other observed data and the model.
αx, ασ, and γ updates.
I take advantage of the conjugate relationship of the Normal and Inverse Gamma distribution for the drift in traits, αx, and rates ασ, as well as for the diffusion of rates, γ, to perform Gibbs sampling. Specifically, I specify a Gaussian distribution as prior on αx, such that , leading to the following full conditional posterior distribution
where and . Here, is the geometric mean of the evolutionary rates between tj and . A derivation of this distribution can be found in SI Appendix.
For ασ, I also specify a Gaussian distribution, , leading to the full conditional posterior distribution
where L is the tree length (sum of all edge lengths). For γ, I specify an Inverse Gamma distribution, such that , which leads to the full conditional posterior distribution
where . A derivation of these distributions for ασ and γ, can be found in ref. 21 and corresponds to the results in ref. 79.
Validation.
I assessed the behavior of the DBM model inference using a simulation study. I first simulated trees of at least sampled extant or extinct species at the tips using the constant FBD model, with speciation rate and extinction rate of , and fossilization rate of during time units (21). This resulted in trees with an average of and a range of about to tips. I then simulated on each of these trees a DBM process where I uniformly sampled the trait root state following , the root rate state , the traits drift , and the rates drift , and the rates diffusion . For each simulation I performed inference assuming the priors and , and running an MCMC chain for iterations, discarding the first as burn-in and sampling every .
SI Appendix, Fig. S2 shows the comparison between the simulated and posterior values, with good overall model and inference behavior. For the trait and rate root values, x0 and , respectively, as well as the drifts for traits αx and rates ασ, there is proper accuracy and proper coverage, with about 95% of the true values falling within the 95% Highest Posterior Interval (HPD) for these parameters (SI Appendix, Fig. S2). Similarly, when sampling the simulated and the posterior distribution of both and diffusion processes along the tree at regular intervals, there is no bias, good accuracy, and proper average coverage at over 95% (SI Appendix, Fig. S2). The estimates for γ, the diffusion of evolutionary rates, increased concomitant with the simulated values but display lower accuracy and coverage than all the other parameters (SI Appendix, Fig. S2). While there seems to be some information in the data, the specific parameterization of the prior, , exerts considerable influence on the shape of the posterior, as found also in birth–death models with diffusion of rates (21). This proportionality suggests that at least one can compare relative, if not absolute, values of γ. Importantly, mediocre accuracy and coverage does not seem to have an impact on our ability to recover the simulated trait and rate diffusion values.
Implementation.
All simulation and inference algorithms were implemented and are available as part of the “Tapestree” package for Julia software (80), available at https://github.com/ignacioq/Tapestree.jl (81).
Body Size Evolution.
I compiled phylogenetic and body size information for groups with a wide taxonomic coverage, detailed in SI Appendix, Table S1. I selected phylogenetic trees that were time-calibrated using the fossilized birth–death model (43), and where more than half of the (known) species were represented in the tree. I compiled and supplemented body size information from the primary literature to have the widest coverage for the species within the phylogenetic trees as possible (Dataset S1). When available, I added trait uncertainty (either due to known error, intraspecies variance, or other; SI Appendix, Table S1). In total, this dataset contains extinct and extant species represented across the phylogenetic trees, with body size data for species. I randomly sampled trees from the posterior for each taxa, except for nonavian dinosaurs and sloths (Folivora) for which I could not obtain the posterior distribution of trees.
I parameterized ambiguous priors: For αx and ασ, I used a and for γ a . I ran the DBM model for each of the trees of each of the groups using the DA procedure for iterations with the first iterations discarded as burn-in and sampling every . Note however that one iteration actually comprises multiple parameter updates. Together, this comprises separate MCMC chains and each returned posterior samples containing full trait and rate histories and the governing parameters (i.e., α, γ) for each posterior tree for each taxonomic group. I manually checked for chain convergence and Effective Sample Sizes. All analyses, manipulations, post hoc analyses, including posterior average and variance estimation, and plots were conducted in the “Tapestree” package for Julia (80, 81). All the data used together with the code are available as SI Appendix. Group specific results are displayed in SI Appendix, Figs. S3–S19.
Influence of trait variance.
This implementation of the DBM model can account for trait variance due to measurement error, uncertainty, or within-species variation. However, in the empirical analyses, I was only able to include information for trait variance for three (i.e., Proboscidea, Cetacea, and Caninae) out of all the analyzed taxonomic groups. Trait variance can overestimate rate information, particularly between closely related species (31, 82, 83), and thus could, contingent on the data configuration, bias against, or toward finding decaying or increasing evolutionary rates. Consequently, I carried further analyses to explore whether and how the main results could be biased by not accounting for trait uncertainty.
First, I carried a simple simulation study to test how ασ can be influenced when adding tip variance, where I simulated DBM models on fossilized birth–death trees, with equal parameters: , , , γ = 0.2, and, importantly, with . I then made inference assuming no trait variance (the “truth” in this scenario), and the same trait variance of and for all species, respectively representing about 10% and 20% of between species variance. I found that including trait variance has a deflating effect on ασ, with higher trait uncertainty yielding a more negative rate drift (SI Appendix, Fig. S21). This could be explained by the model allocating part of the evolutionary rate into trait variance (thus underestimating rates between closely related lineages, increasing the slowing down effect).
Second, and given the above, I ran the DBM model again for all groups for which I did not include trait variation. Namely, I estimated the average amount of trait variance from Cetaceans per their logarithmic unit of size, which was about SDs, and used this to add variances around trait information particular to each group’s size measurement. Adding this trait variance had negligible effects on the estimates of ασ, as shown in SI Appendix, Fig. S22, suggesting that the absence of rate slow downs in body size evolution seem not to be an artifact of discounting measurement errors, uncertainty, or within species variation.
Prominence of Species Sorting.
I estimated the effect of species sorting on the evolutionary trend of each clade, which I call the “prominence” of species sorting. Because I am not explicitly modeling the birth–death process, I cannot differentiate if trait change is due to unobserved speciation events, or from anagenetic evolution. However, I can measure the minimal effect that the disappearance of species has had in fashioning trends. While, model-wise, there should not be a bias in the traits of unobserved extinct species, certain phenotypes are more liable to fossilization than others, including body size. Indeed, ceteris paribus, larger body sizes are more prone to being represented in the fossil record than smaller organisms (84). Thus, a caveat of this analysis is not accounting for unobserved species, whose probability of being represented is likely size-dependent.
Prominence ranges from , estimating both the directionality and magnitude of the effect of species sorting on the average trait trend of the taxonomic group. A negative value of prominence indicates that the removal of species contributes to a decrease in the trend, and vice versa (Fig. 3). Its absolute magnitude indicates the proportion of change explained by species sorting, with denoting that all the change in average trait change was explained by species sorting. Prominence is measured for the change over a given time interval. Let be the start and end time where we sample the vector of traits and , respectively. The total change in trend is then the difference of the two averages, . This total change can be totally or partly explained by the anagenetic change of each lineage i alive at t1. If there are n lineages alive at time t1, then this amounts to the average of their change, . Note that some of the n lineages can be species that arose in the interval , so I take into account the effect of new species on top of the anagenetic evolution. Prominence, the role of species sampled at time t0, but not at t1, is then what remains, standardized, , and later constrained so as to remain between . Unlike the average trait change or the anagenetic change, the standardization and limits imposed on Prominence cause it to have a skewed distribution. In particular, note that very small changes in average traits can still generate large values (close to or ) of Prominence.
Taking advantage of the posterior DA samples of the full trait and rate histories, I estimated the prominence of species sorting every My and every Mys for each one. I then combined all samples per taxa and highlighted those intervals with strong changes in traits and rates, that is, those where the 50% Credible Interval of changes across DA samples did not overlap . Similarly, I highlighted those intervals in which the 50% Credible Interval of prominence across DA samples did not overlap . The results were similar regardless of measuring every or Mys, so I highlight only the former ones in Fig. 3 and SI Appendix, Figs. S21 and S22.
To estimate the total amount of change explained species sorting for each DA tree, for each taxonomic group, I estimated the proportion of the total amount of absolute change in traits and rates that is explained by species sorting. Specifically, this is the sum of absolute changes multiplied by the prominence of sorting across every or Mys in traits or rates, divided by the sum of all absolute changes, that is, for every j interval of or Mys out of m, , where is the prominence of species sorting for interval j. The results for each clade for intervals of or Mys are shown in SI Appendix, Fig. S23. Similarly, to estimate if species were selectively removed based on their size or their rates, I estimated the average prominence for each DA tree. If large species were selectively pruned out, the average prominence of traits should be less than , while if smaller species were selectively removed, average prominence should be larger than . Likewise with evolutionary rates, where a prominence in rates below reflects the removal of fast-evolving species, while a prominence larger than presents evidence for the selective pruning of slow rate species. The results for each clade for intervals of or Mys are shown in SI Appendix, Fig. S24.
Finally, to estimate the extent to which the particular empirical trees could explain the differences observed in the total amount of change and the directionality explained by species sorting, I conducted a simulation study. Specifically, I simulated Brownian motion processes for each empirical tree for each clade, and then carried out the same analyses as above. These null simulations show that the total amount of change explained by prominence is almost completely determined by the empirical trees (plotted as gray shadows in the background for each taxonomic group in SI Appendix, Fig. S25). However, as expected, the number of consistent trait changes and prominence was , and the directionality of prominence was not biased (also plotted as gray shadows in the background for each taxonomic group in SI Appendix, Fig. S26).
Supplementary Material
Appendix 01 (PDF)
Dataset S01 (CSV)
Acknowledgments
I thank Hélène Morlon, Jérémy Andreoletti, the Morlon lab in general, and Bruce S. Martin for feedback.
Author contributions
I.Q. designed research; performed research; contributed new reagents/analytic tools; analyzed data; and wrote the paper.
Competing interests
The author declares no competing interest.
Footnotes
This article is a PNAS Direct Submission.
Data, Materials, and Software Availability
Simulation and inference algorithm data have been deposited in GitHub (https://github.com/ignacioq/Tapestree.jl) (81). All other data are included in the manuscript and/or supporting information.
Supporting Information
References
- 1.Simpson G. G., Tempo and Mode in Evolution (Columbia University Press, 1944). [Google Scholar]
- 2.Simpson G. G., The Major Features of Evolution (Columbia University Press, 1953). [Google Scholar]
- 3.M. Foote, The evolution of morphological diversity. Annu. Rev. Ecol. Evol. Syst. 28, 129–152 (1997).
- 4.Hunt G., The relative importance of directional change, random walks, and stasis in the evolution of fossil lineages. Proc. Natl. Acad. Sci. U.S.A. 104, 18404–18408 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hughes M., Gerber S., Wills M. A., Clades reach highest morphological disparity early in their evolution. Proc. Natl. Acad. Sci. U.S.A. 110, 13875–13879 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wright S., et al. , The roles of mutation, inbreeding, crossbreeding, and selection in evolution. Sixth Int. Congr. Genet. 1, 356–366 (1932). [Google Scholar]
- 7.Lande R., Natural selection and random genetic drift in phenotypic evolution. Evolution 30, 314–334 (1976). [DOI] [PubMed] [Google Scholar]
- 8.Eldredge N., A question of individuality: Charles darwin, george gaylord simpson and transitional fossils. Evol. Educ. Outreach 2, 150–155 (2009). [Google Scholar]
- 9.G. G. Simpson, “The nature and origin of supraspecific taxa” in Cold Spring Harbor Symposia on Quantitative Biology (Cold Spring Harbor Laboratory Press, 1959), vol. 24, pp. 255–271. [DOI] [PubMed]
- 10.N. Eldredge, “The nature and origin of supraspecific taxa revisited-with special reference to trilobita in Fossils” in Phylogeny, and Form: An Analytical Approach, J. M. Adrain, G. D. Edgecombe, B. S. Lieberman, Eds. (Springer, 2001), pp. 341–375.
- 11.Yoder J. B., et al. , Ecological opportunity and the origin of adaptive radiations. J. Evol. Biol. 23, 1581–1596 (2010). [DOI] [PubMed] [Google Scholar]
- 12.Schluter D., The Ecology of Adaptive Radiation (OUP Oxford, 2000). [Google Scholar]
- 13.Gavrilets S., Losos J. B., Adaptive radiation: Contrasting theory with data. Science 323, 732–737 (2009). [DOI] [PubMed] [Google Scholar]
- 14.Harmon L. J., et al. , Early bursts of body size and shape evolution are rare in comparative data. Evolution 64, 2385–2396 (2010). [DOI] [PubMed] [Google Scholar]
- 15.Cracraft J., A nonequilibrium theory for the rate-control of speciation and extinction and the origin of macroevolutionary patterns. Syst. Biol. 31, 348–365 (1982). [Google Scholar]
- 16.Gould S. J., The Structure of Evolutionary Theory (Harvard University Press, 2002). [Google Scholar]
- 17.Kaplan J., The end of the adaptive landscape metaphor? Biol. Philos. 23, 625–638 (2008). [Google Scholar]
- 18.Venditti C., Meade A., Pagel M., Multiple routes to mammalian diversity. Nature 479, 393–396 (2011). [DOI] [PubMed] [Google Scholar]
- 19.Erwin D. H., Novelty and innovation in the history of life. Curr. Biol. 25, R930–R940 (2015). [DOI] [PubMed] [Google Scholar]
- 20.Slater G. J., Topographically distinct adaptive landscapes for teeth, skeletons, and size explain the adaptive radiation of carnivora (mammalia). Evolution 76, 2049–2066 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Quintero I., Lartillot N., Morlon H., Imbalanced speciation pulses sustain the radiation of mammals. Science 384, 1007–1012 (2024). [DOI] [PubMed] [Google Scholar]
- 22.Erwin D. H., Disparity: Morphological pattern and developmental context. Palaeontology 50, 57–73 (2007). [Google Scholar]
- 23.Gould S. J., Tempo and mode in the macroevolutionary reconstruction of darwinism. Proc. Natl. Acad. Sci. U.S.A. 91, 6764–6771 (1994). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hopkins M. J., Smith A. B., Dynamic evolutionary change in post-paleozoic echinoids and the importance of scale when interpreting changes in rates of evolution. Proc. Natl. Acad. Sci. U.S.A. 112, 3758–3763 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Puttick M. N., Mixed evidence for early bursts of morphological evolution in extant clades. J. Evol. Biol. 31, 502–515 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Slater G. J., Pennell M. W., Robust regression and posterior predictive simulation increase power to detect early bursts of trait evolution. Syst. Biol. 63, 293–308 (2013). [DOI] [PubMed] [Google Scholar]
- 27.Jablonski D., Approaches to macroevolution: 2. Sorting of variation, some overarching issues, and general conclusions. Evol. Biol. 44, 451–475 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Haldane J. B. S., Suggestions as to quantitative measurement of rates of evolution. Evolution 3, 51–56 (1949). [DOI] [PubMed] [Google Scholar]
- 29.Gingerich P. D., Rates of evolution. Annu. Rev. Ecol. Evol. Syst. 40, 657–675 (2009). [Google Scholar]
- 30.Revell L. J., A variable-rate quantitative trait evolution model using penalized-likelihood. PeerJ 9, e11997 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Martin B. S., Bradburd G. S., Harmon L. J., Weber M. G., Modeling the evolution of rates of continuous trait evolution. Syst. Biol. 72, 590–605 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Tanner M. A., Wong W. H., The calculation of posterior distributions by data augmentation. J. Am. Stat. Assoc. 82, 528–540 (1987). [Google Scholar]
- 33.Horvilleur B., Lartillot N., Monte carlo algorithms for brownian phylogenetic models. Bioinformatics 30, 3020–3028 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Quintero I., Landis M. J., Interdependent phenotypic and biogeographic evolution driven by biotic interactions. Syst. Biol. 69, 739–755 (2020). [DOI] [PubMed] [Google Scholar]
- 35.Maliet O., Morlon H., Fast and accurate estimation of species-specific diversification rates using data augmentation. Syst. Biol. 71, 353–366 (2022). [DOI] [PubMed] [Google Scholar]
- 36.Slater G. J., Harmon L. J., Alfaro M. E., Integrating fossils with molecular phylogenies improves inference of trait evolution. Evolution 66, 3931–3944 (2012). [DOI] [PubMed] [Google Scholar]
- 37.Betancur-R R., Ortí G., Pyron R. A., Fossil-based comparative analyses reveal ancient marine ancestry erased by extinction in ray-finned fishes. Ecol. Lett. 18, 441–450 (2015). [DOI] [PubMed] [Google Scholar]
- 38.Slater G. J., Goldbogen J. A., Pyenson N. D., Independent evolution of baleen whale gigantism linked to plio-pleistocene ocean dynamics. Proc. R. Soc. B Biol. Sci. 284, 20170546 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Tavares Vd. C., Warsi O. M., Balseiro F., Mancina C. A., Dávalos L. M., Out of the antilles: Fossil phylogenies support reverse colonization of bats to south America. J. Biogeogr. 45, 859–873 (2018). [Google Scholar]
- 40.McCullough J. M., Moyle R. G., Smith B. T., Andersen M. J., A laurasian origin for a pantropical bird radiation is supported by genomic and fossil data (aves: Coraciiformes). Proc. R. Soc. B 286, 20190122 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Burin G., Park T., James T. D., Slater G. J., Cooper N., The dynamic adaptive landscape of cetacean body size. Curr. Biol. 33, 1787–1794 (2023). [DOI] [PubMed] [Google Scholar]
- 42.Stadler T., Sampling-through-time in birth-death trees. J. Theor. Biol. 267, 396–404 (2010). [DOI] [PubMed] [Google Scholar]
- 43.Heath T. A., Huelsenbeck J. P., Stadler T., The fossilized birth-death process for coherent calibration of divergence-time estimates. Proc. Natl. Acad. Sci. U.S.A. 111, E2957–E2966 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.B. K. McNab, “Basal rate of metabolism, body size, and food habits in the order carnivora” in Carnivore Behavior, Ecology, and Evolution, J. L. Gittleman, Ed. (Springer, 1989), pp. 335–354.
- 45.M. LaBarbera, Analyzing body size as a factor in ecology and evolution. Annu. Rev. Ecol. Syst. 20, 97–117 (1989).
- 46.Brown J. H., Marquet P. A., Taper M. L., Evolution of body size: Consequences of an energetic definition of fitness. Am. Nat. 142, 573–584 (1993). [DOI] [PubMed] [Google Scholar]
- 47.Gittleman J. L., Purvis A., Body size and species-richness in carnivores and primates. Proc. R. Soc. Lond. Ser. B Biol. Sci. 265, 113–119 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Smith F. A., Lyons S. K., Animal Body Size: Linking Pattern and Process Across Space, Time, and Taxonomic Group (University of Chicago Press, 2013). [Google Scholar]
- 49.Ingram T., Harmon L. J., Shurin J. B., When should we expect early bursts of trait evolution in comparative data? Predictions from an evolutionary food web model J. Evol. Biol. 25, 1902–1910 (2012). [DOI] [PubMed] [Google Scholar]
- 50.Stanley S. M., An explanation for cope’s rule. Evolution 27, 1–26 (1973). [DOI] [PubMed] [Google Scholar]
- 51.F. A. Smith et al. Body size evolution across the geozoic. Annu. Rev. Earth Planet. Sci. 44, 523–553 (2016).
- 52.Landis M. J., Schraiber J. G., Pulsed evolution shaped modern vertebrate body sizes. Proc. Natl. Acad. Sci. U.S.A. 114, 13224–13229 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Goswami A., et al. , Attenuated evolution of mammals through the cenozoic. Science 378, 377–383 (2022). [DOI] [PubMed] [Google Scholar]
- 54.S. J. Gould, R. C. Lewontin, The spandrels of san marco and the panglossian paradigm: A critique of the adaptationist programme. Proc. R. Soc. Lond. B Biol. Sci. 205, 581–598 (1979). [DOI] [PubMed]
- 55.Popper K., Conjectures and Refutations: The Growth of Scientific Knowledge (Routledge, ed. 3, 1968). [Google Scholar]
- 56.Graeber D., Debt: The First 5,000 Years, Updated and Expanded (Melville House, 2014). [Google Scholar]
- 57.Gould S. J., Trends as changes in variance: A new slant on progress and directionality in evolution. J. Paleontol. 62, 319–329 (1988). [Google Scholar]
- 58.Vermeij G. J., The evolutionary interaction among species: Selection, escalation, and coevolution. Annu. Rev. Ecol. Syst. 25, 219–236 (1994). [Google Scholar]
- 59.Erwin D. H., The topology of evolutionary novelty and innovation in macroevolution. Philos. Trans. R. Soc. Lond. B Biol. Sci. 372, 20160422 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Cope E. D., The Primary Factors of Organic Evolution (Open Court Publishing Company, 1904). [Google Scholar]
- 61.Heim N. A., Knope M. L., Schaal E. K., Wang S. C., Payne J. L., Cope’s rule in the evolution of marine animals. Science 347, 867–870 (2015). [DOI] [PubMed] [Google Scholar]
- 62.Lloyd E. A., Gould S. J., Species selection on variability. Proc. Natl. Acad. Sci. U.S.A. 90, 595–599 (1993). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Vrba E. S., Gould S. J., The hierarchical expansion of sorting and selection: Sorting and selection cannot be equated. Paleobiology 12, 217228 (1986). [Google Scholar]
- 64.Lewontin R. C., The organism as the subject and object of evolution. Scientia 77, 63–82 (1983). [Google Scholar]
- 65.Hansen T. F., Stabilizing selection and the comparative analysis of adaptation. Evolution 51, 1341–1351 (1997). [DOI] [PubMed] [Google Scholar]
- 66.Levins R., Lewontin R., The Dialectical Biologist (Harvard University Press, 1985). [Google Scholar]
- 67.G. Longo, M. Montévil, S. Kauffman, “No entailing laws, but enablement in the evolution of the biosphere” in Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation, GECCO ’12, C. Igel, Ed. (Association for Computing Machinery, New York, NY, USA, 2012), p. 13791392.
- 68.J. Odling-Smee, Niche Construction: How Life Contributes to Its Own Evolution (MIT Press, 2024).
- 69.Hordijk W., Evolution: Limited and predictable or unbounded and lawless? Biol. Theory 11, 187–191 (2016). [Google Scholar]
- 70.Newman S. A., Inherency and agency in the origin and evolution of biological functions. Biol. J. Linn. Soc. 139, 487–502 (2022). [Google Scholar]
- 71.Lewontin R., The Triple Helix: Gene, Organism, and Environment (Harvard University Press, 2000). [Google Scholar]
- 72.Clavel J., Morlon H., Accelerated body size evolution during cold climatic periods in the cenozoic. Proc. Natl. Acad. Sci. U.S.A. 114, 4183–4188 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Lemey P., Rambaut A., Welch J. J., Suchard M. A., Phylogeography takes a relaxed random walk in continuous space and time. Mol. Biol. Evol. 27, 1877–1885 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Silvestro D., et al. , Early arrival and climatically-linked geographic expansion of new world monkeys from tiny African ancestors. Syst. Biol. 68, 78–92 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Pagel M., O’Donovan C., Meade A., General statistical model shows that macroevolutionary patterns and processes are consistent with darwinian gradualism. Nat. Commun. 13, 1113 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.May M. R., Moore B. R., A Bayesian approach for inferring the impact of a discrete character on rates of continuous-character evolution in the presence of background-rate variation. Syst. Biol. 69, 530–544 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Felsenstein J., Maximum-likelihood estimation of evolutionary trees from continuous characters. Am. J. Hum. Genet. 25, 471 (1973). [PMC free article] [PubMed] [Google Scholar]
- 78.Gelman A., et al. , Bayesian Data Analysis (Chapman and Hall/CRC, ed. 3, 2013). [Google Scholar]
- 79.Polson N. G., Roberts G. O., Bayes factors for discrete observations from diffusion processes. Biometrika 81, 11–26 (1994). [Google Scholar]
- 80.Bezanson J., Edelman A., Karpinski S., Shah V. B., Julia: A fresh approach to numerical computing. SIAM Rev. 59, 65–98 (2017). [Google Scholar]
- 81.I. Quintero, Tapestree. Github. https://github.com/ignacioq/Tapestree.jl. Deposited 12 November 2024.
- 82.Ives A. R., Midford P. E., Garland T. Jr., Within-species variation and measurement error in phylogenetic comparative methods. Syst. Biol. 56, 252–270 (2007). [DOI] [PubMed] [Google Scholar]
- 83.Felsenstein J., Comparative methods with sampling error and within-species variation: Contrasts revisited and revised. Am. Nat. 171, 713–725 (2008). [DOI] [PubMed] [Google Scholar]
- 84.A. K. Behrensmeyer, D. E. Dechant Boaz, A. Hill, “The recent bones of amboseli park, kenya, in relation to east African paleoecology” in Fossils in the Making: Vertebrate Taphonomy and Paleoecology, A. K. Behrensmeyer, A. P. Hill, Eds. (University of Chicago Press Chicago, 1980), pp. 72–92.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Appendix 01 (PDF)
Dataset S01 (CSV)
Data Availability Statement
Simulation and inference algorithm data have been deposited in GitHub (https://github.com/ignacioq/Tapestree.jl) (81). All other data are included in the manuscript and/or supporting information.


