How Well Can We Detect Lineage-Specific Diversification-Rate Shifts? A Simulation Study of Sequential AIC Methods

Michael R May; Brian R Moore

doi:10.1093/sysbio/syw026

. 2016 Apr 1;65(6):1076–1084. doi: 10.1093/sysbio/syw026

How Well Can We Detect Lineage-Specific Diversification-Rate Shifts? A Simulation Study of Sequential AIC Methods

Michael R May ^1,^*, Brian R Moore ¹

PMCID: PMC5066061 PMID: 27037081

Abstract

Evolutionary biologists have long been fascinated by the extreme differences in species numbers across branches of the Tree of Life. This has motivated the development of statistical methods for detecting shifts in the rate of lineage diversification across the branches of phylogenic trees. One of the most frequently used methods, MEDUSA, explores a set of diversification-rate models, where each model assigns branches of the phylogeny to a set of diversification-rate categories. Each model is first fit to the data, and the Akaike information criterion (AIC) is then used to identify the optimal diversification model. Surprisingly, the statistical behavior of this popular method is uncharacterized, which is a concern in light of: (1) the poor performance of the AIC as a means of choosing among models in other phylogenetic contexts; (2) the ad hoc algorithm used to visit diversification models, and; (3) errors that we reveal in the likelihood function used to fit diversification models to the phylogenetic data. Here, we perform an extensive simulation study demonstrating that MEDUSA (1) has a high false-discovery rate (on average, spurious diversification-rate shifts are identified $\approx 30 %$ of the time), and (2) provides biased estimates of diversification-rate parameters. Understanding the statistical behavior of MEDUSA is critical both to empirical researchers—in order to clarify whether these methods can make reliable inferences from empirical datasets—and to theoretical biologists—in order to clarify the specific problems that need to be solved in order to develop more reliable approaches for detecting shifts in the rate of lineage diversification. [Akaike information criterion; extinction; lineage-specific diversification rates; phylogenetic model selection; speciation.]

Many evolutionary phenomena entail differential rates of diversification (speciation rate – extinction rate); adaptive radiation, diversity-dependent diversification, key innovations, and mass extinction. Phylogeny-based statistical methods have been developed to detect shifts in diversification rate through time, such as tree-wide shifts in diversification rate associated with episodes of mass extinction or adaptive radiation (Morlon et al. 2011; Stadler 2010; 2011; Höhna et al. 2015; May et al. 2016), or diversity-dependent decreases in diversification rate associated with ecological limits on speciation (Rabosky and Lovette 2008; Morlon et al. 2010; Etienne and Haegeman 2012; Etienne et al. 2012; Höhna 2014; Leventhal et al. 2014). Other methods seek to identify correlations between rates of diversification and some other variable, such as the evolution of discrete or continuous traits (Maddison et al. 2007; FitzJohn 2010) or episodes of biogeographic or climatic change (Moore and Donoghue 2009; Goldberg et al. 2011). Here, we focus on a third class of methods that seek to detect shifts in diversification rate along lineages of a phylogenetic tree (Moore et al. 2004; Chan and Moore 2005; Rabosky et al. 2007; Alfaro et al. 2009; Rabosky 2014).

The detection of lineage-specific diversification rates is often pursued using the approach proposed by Alfaro et al. (2009): Modeling Evolutionary Diversification Using Stepwise AIC (MEDUSA), implemented in the R (R Development Core Team 2009) package GEIGER (Pennell et al. 2014). The popularity of MEDUSA stems from several advantages it offers relative to alternative approaches: (1) rather than requiring complete species-level phylogenies, MEDUSA allows unsampled species to be included within unresolved terminal subclades; (2) rather than requiring the location of diversification-rate shifts to be specified a priori, MEDUSA agnostically evaluates diversification-rate shifts along all branches of the tree; (3) rather than assuming a pure-birth (Yule 1924) model (c.f., Chan and Moore 2005), MEDUSA is based on a more realistic birth-death model that accommodates extinction, and; (4) in addition to inferring the location(s) of diversification-rate shifts, MEDUSA also provides estimates of the diversification-rate parameters for each branch of the tree.

The statistical behavior of this popular method, however, is unknown. This is particularly disconcerting, as the use of the AIC for model selection has been shown to be problematic in other phylogenetic contexts (Alfaro and Huelsenbeck 2006; Boettiger et al. 2012), where it is biased toward overly complex models. Additionally, we demonstrate that the likelihood function used by MEDUSA to fit diversification models to the phylogenetic data is incorrect, which has the potential to impact the accuracy of parameter estimates and the reliability of the model-selection procedure. With these considerations in mind, we performed an extensive simulation study to characterize the statistical behavior of MEDUSA.

The MEDUSA algorithm.—MEDUSA begins by fitting a constant-rate birth-death process to an ultrametric phylogeny (i.e., where branch lengths and node heights are rendered proportional to time, and all species are contemporaneous) by means of maximum likelihood [using equation (3), detailed in the next section]. Each possible two-rate model—where a two-rate model includes a separate set of speciation- and extinction-rate parameters for a particular subclade of the tree—is then evaluated. If the difference in the AIC scores between the one-rate model and the best two-rate model exceeds a specified threshold ( $Δ {AIC}_{crit}$ ), the two-rate model is accepted. If the two-rate model is accepted, the process is repeated for each possible three-rate model. This procedure iteratively adds diversification-rate shifts to the phylogeny until additional diversification-rate shifts do not confer a sufficient improvement in the AIC score (Fig. 1). We provide a more detailed description of the MEDUSA algorithm in Section S1 of the Supplementary Material available on Dryad at http://dx.doi.org/10.5061/dryad.61q64.

Figure 1. — The MEDUSA algorithm. 1) The one-rate model, $M_{0}$ , is first fit to the data using maximum likelihood. 2) Every possible two-rate model is then fit to the data, and the models are ranked by their likelihood score (with the best model, $M_{1}$ , at the bottom). 3) Every possible three-rate model that is consistent with the best two-rate model is then fit to the data, and the models are ranked as in the previous step. 4) Step 3 is repeated for increasingly complex models until the most complex model has been evaluated. 5) The AIC score is computed for the best model in each index (i.e., with $0, 1, \dots$ rate shifts). 6) The $Δ$ AIC value is calculated for each pair of adjacent models (as the difference in their AIC scores). Each of the $Δ {AIC}_{i}$ comparisons are evaluated in succession, where the more complex model is selected if the computed $Δ$ AIC value exceeds some prespecified threshold, $Δ {AIC}_{crit}$ . This process continues until either the most complex model is accepted or the improvement in the AIC score is too small to exceed the $Δ {AIC}_{crit}$ threshold.

There have been several versions of the MEDUSA algorithm that differ primarily in how they specify $Δ {AIC}_{crit}$ : the first version adopted a fixed (and conventional value) $Δ {AIC}_{crit}$ of 4, whereas more recent versions of MEDUSA compute $Δ {AIC}_{crit}$ based on the number of terminal lineages. In this article, we present results using the version of MEDUSA—available in the R package GEIGER (Pennell et al. 2014)—that we found to exhibit the best overall statistical behavior, which is also the version that has been used most frequently in empirical studies (we provide results for the other versions of the program in the Supplementary Material available on Dryad).

The Likelihood Function in MEDUSA is Incorrect

The likelihood function computes the probability of the data—typically a molecular phylogeny with estimated divergence times—given the parameters of the model. We assume the phylogeny, $Ψ$ , with $k$ terminal lineages was generated by a birth-death process with a set of branch-specific speciation rates, $λ = {λ_{1}, λ_{2}, \dots, λ_{2 k - 2}}$ , and a vector of branch-specific extinction rates, $μ = {μ_{1}, μ_{2}, \dots, μ_{2 k - 2}}$ , where $λ_{i}$ and $μ_{i}$ are the speciation and extinction rates for branch $i$ , respectively.

The likelihood of observing the data is calculated piecewise (c.f., Rabosky et al. 2007). First, the function computes the likelihood of the internal branches, then computes the likelihood of the terminal lineages, and finally combines these two likelihoods. The probability density of observing an internal branch of length $t_{i}$ that arose at time $t_{i}^{b}$ used by MEDUSA is:

\begin{array}{l} P (t_{i} ∣ λ_{i}, μ_{i}, t_{i}^{b}) = \frac{(λ_{i} - μ_{i}) \exp (- (λ_{i} - μ_{i}) t_{i})}{1 - \frac{μ_{i}}{λ_{i}} \exp (- (λ_{i} - μ_{i}) t_{i}^{b})} . \end{array}

(1)

The probability of observing terminal lineage $i$ with $n_{i}$ species, conditional on it having descendants at the present is

\begin{array}{l} P (n_{i} ∣ t_{i}^{b}, λ_{i}, μ_{i}) & = & (1 - \frac{\exp ((λ_{i} - μ_{i}) t_{i}^{b}) - 1}{\exp ((λ_{i} - μ_{i}) t_{i}^{b}) - \frac{μ_{i}}{λ_{i}}}) \\ \times {(\frac{\exp ((λ_{i} - μ_{i}) t_{i}^{b}) - 1}{\exp ((λ_{i} - μ_{i}) t_{i}^{b}) - \frac{μ_{i}}{λ_{i}}})}^{n_{i} - 1} . \end{array}

(2)

Parsing the tree into $T$ terminal lineages and $ℐ$ internal branches, the likelihood for the phylogeny and species richness data is calculated as

\begin{array}{l} P (Ψ ∣ λ, μ) = (\prod_{i \in T} P (t_{i} ∣ t_{i}^{b}, λ_{i}, μ_{i})) \times (\prod_{i \in ℐ} P (n_{i} ∣ t_{i}^{b}, λ_{i}, μ_{i})), \end{array}

(3)

as proposed by Rabosky et al. (2007).

This likelihood function, however, is incorrect in a number of ways. First, equation (1) is the probability density of the next birth event [equation (1)], rather than the probability density of an internal branch of length $t_{i}$ . Second, it conditions on the survival of each of the terminal lineages. Typically, likelihoods under birth-death process models are conditioned on the survival of the entire process to the present (i.e., the total number of species present, $N$ , must be greater than 0). The rationale for the condition is simple: if the process had not survived to the present, we would not have observed the tree. In the context of the present likelihood function, conditioning on the survival of every terminal lineage is different from conditioning on the survival of the entire process: in order for the process to survive, it is simply necessary that at least one terminal lineage survives, not that all terminal lineages survive.

This derivation of the likelihood function also suffers from a more fundamental—and more difficult—issue. Specifically, it requires that we are able to assign a set of speciation- and extinction-rate parameters to each lineage that arose during the evolution of the tree (extant or otherwise). Under a birth-death process, some lineages may go extinct before the present, and so are unobserved. Accordingly, it is not possible to coherently assign diversification parameters to extinct lineages. Consequently, this likelihood function implicitly assumes that diversification-rate shifts cannot occur along extinct lineages. When diversification-rate shifts occur along lineages that ultimately went extinct, the likelihood function will be incorrect. We discuss the provenance and derivation of these probability functions in more detail in Appendix.

Use of the AIC to Select Diversification Models may be Problematic

Use of the AIC to choose among models in other phylogenetic settings is known to be problematic (e.g.,Alfaro and Huelsenbeck 2006; Boettiger et al. 2012). Specifically, there is little theory to guide the specification of an appropriate critical threshold ( $Δ {AIC}_{crit}$ ) for preferring one model to another, and arbitrarily specified thresholds may strongly bias the model-selection procedure. Moreover, the AIC assumes that the sample size (in this case, the number of sampled lineages) is large. In practice, however, MEDUSA is generally applied to trees with a small number of incompletely sampled terminal lineages. Accordingly, the large sample size assumption of the AIC is apt to be violated, which may compromise the ability of MEDUSA to reliably choose the correct model.

Methods

Simulation Study

These considerations—regarding errors in the likelihood function and the reliability of the AIC as a means of selecting diversification models—motivated a simulation study to determine how well MEDUSA recovers the correct diversification-rate model under a variety of realistic conditions. All simulations were performed in R (R Core Team 2013) using the packages ape and TreeSim (Paradis et al. 2004; Stadler 2013); trees were subsequently manipulated with custom R scripts.

To ensure that our simulation study targeted an empirically relevant region of parameter space, we conducted a survey of all published studies that applied MEDUSA to empirical trees (Table S1 available on Dryad). From these empirical studies, we summarized relevant metadata, including: (1) the total number of species encompassed by the phylogeny (regardless of whether they were actually sampled), $N$ ; (2) the number of terminal lineages (i.e., unresolved subclades) included in the phylogeny, $k$ , and; (3) speciation- and extinction-rate estimates provided by MEDUSA. We then designed our simulation study to reflect the characteristics of empirical MEDUSA studies (Fig.S2 available on Dryad).

We simulated constant-rate trees with a fixed speciation rate, $λ = 0.01$ , under various extinction rates, $μ = λ \times {0, 0.1, \dots, 0.9}$ , and of various sizes, $N = {100, 1000, 10, 000}$ . We simulated 1000 trees for each combination of these settings, and then collapsed each tree to various numbers of terminal lineages, $k = {10, 15, 20, 25, 30, 40, 100}$ , by retaining the oldest $k - 1$ nodes and assigning the pruned species to their corresponding terminal lineages; this protocol is equivalent to the “diversified-sampling” strategy described by Höhna et al. (2011). We then analyzed each tree with MEDUSA. For each analysis, we recorded(1) whether a rate shift was inferred, and; (2) the parameter estimates under the preferred diversification model. We computed the false-discovery rate as the fraction of analyses that inferred at least one diversification-rate shift. For analyses that correctly selected the constant-rate model, we computed the bias for each diversification-rate parameter as the estimated value divided by the true value. When MEDUSA incorrectly rejected the constant-rate model, we computed the magnitude of the inferred diversification-rate shift as the “derived rate” (i.e., the net-diversification-rate estimated for the subclade that experienced the diversification-rate shift) divided by the “initial” net-diversification rate (i.e., estimated for the remainder of the tree); $δ = r_{shift} / r_{initial}$ . We performed a pilot study to determine the number of replicate simulations required to ensure precise estimates of the false-discovery rate, which indicated that 500 replicate simulations were sufficient to ensure negligible Monte Carlo error (results not shown). To be conservative, we based our inferences regarding the statistical behavior of MEDUSA on 1000 replicate simulations for each unique parameter combination that we explored in our study.

We also performed additional simulations to comprehensively explore the parameter space defined by our empirical survey, and used multiple versions of the MEDUSA algorithm to accommodate implementation-specific effects. Overall, the results of these simulations are qualitatively and quantitatively similar to the simulation study described above; we refer the reader to the Supplementary Material available on Dryad for detailed descriptions (and results) of these additional simulation experiments.

Results

We focus on two primary aspects of the statistical behavior of MEDUSA: (1) the propensity to identify spurious diversification-rate shifts (i.e., the false-discovery rate), and; (2) the accuracy of diversification-rate parameter estimates (i.e., the estimator bias).

MEDUSA has a High False-Discovery Rate

The overall false-discovery rate (FDR)—calculated as the unweighted average overall of simulated trees—is $29 %$ (Fig. 2), which is $\approx 6$ times higher than the nominal significance level, $α = 0.05$ . Additionally, the FDR increases as the proportion of the species in unresolved terminal clades increases; in fact, when there are no unresolved clades (i.e., $k = N$ ; Fig. 2, left panel, right column), the false-discovery rate is approximately $5 %$ . The FDR appears relatively insensitive to the true extinction rate, $μ$ , although higher extinction rates tend to have higher FDR, on average. These patterns are consistent with violation of the large-sample assumptions of the AIC driving the inflated false-discovery rate: as $k$ decreases, the sample size decreases, exacerbating the bias toward overly complex models.

Figure 2. — False-discovery rate. We simulated constant-rate trees with a fixed speciation rate, $λ = 0.01$ , under various extinction rates, $μ = λ \times {0, 0.1, \dots, 0.9}$ ( $y$ -axis), and of various sizes, $N = {100, 1000, 10, 000}$ (left to right panels). We simulated 1000 trees for each combination of these settings, and then collapsed each tree to various numbers of terminal lineages, $k = {10, 15, 20, 25, 30, 40, 100}$ ( $x$ -axis of each panel), by retaining the oldest $k - 1$ nodes and assigning the pruned species to their corresponding terminal lineages. We then analyzed each tree with MEDUSA, and recorded the fraction of trees in which at least one diversification-rate shift was inferred (the *false-discovery rate*; FDR).

MEDUSA Provides Biased Parameter Estimates

The ability of MEDUSA to estimate parameters of the birth-death process represents an important advantage over competing methods. This allows users to not only identify the number and phylogenetic distribution of diversification-rate shifts, but also to estimate the magnitude of corresponding changes in the speciation and extinction rates. Accordingly, even if—as we have demonstrated above—MEDUSA does not provide reliable estimates of the number and location of diversification-rate shifts, it may nevertheless prove useful for estimating parameters of the birth-death process. Moreover, if most of the spurious diversification-rate shifts are inferred to involve relatively minor shifts in diversification rate, then it might be possible to use the diversification-rate parameter estimates to distinguish between spurious and bonafide diversification-rate shifts. We focus on estimates of the net-diversification rate, $r = λ - μ$ , as this compound parameter is expected to be considerably easier to estimate compared to the speciation and extinction rates (Kubo and Iwasa 1995; Rabosky et al. 2007). Our results agree with this expectation; estimates of the relative-extinction rate, $μ / λ$ , are more severely biased than are estimates of the the net-diversification rate (see Supplementary Material available on Dryad).

Our simulation study reveals that—even when MEDUSA selects the correct (constant-rate) model—the net-diversification rate estimate is $1.13$ times the true value, on average (Fig. 3, top row). As with the false-discovery rate, this bias is exacerbated as $k$ decreases. The severity of the bias also increases as the true extinction rate, $μ$ , increases. When MEDUSA incorrectly selects a two-rate model, it infers a 2.7-fold change in the net-diversification rate, on average (Fig. 3, bottom row), with the upper 95% quantile of diversification-rate shifts entailing 7.7-fold increases in the net-diversification rate. Thus, when MEDUSA incorrectly identifies a diversification-rate shift, the magnitude of the spurious diversification-rate shifts is apt to be large.

Figure 3. — Parameter estimates. We simulated constant-rate trees and analyzed them as described in Figure 2. We binned the trees into categories: (1) where the (correct) constant-rate model was selected (top row), and; (2) where the constant-rate model was rejected (bottom row). Top row: We computed the bias in the estimated net-diversification rate, $r = λ - μ$ , as $r_{estimated} / r_{true}$ . Bottom row: We computed the magnitude of inferred diversification-rate shifts, $δ$ , as the estimated net-diversification rate for the subclade with the shift divided by the estimated net-diversification rate for the remainder of the tree, $δ = r_{shift} / r_{initial}$ .

Dicussion

The Statistical Behavior of MEDUSA

Biologists are clearly interested in identifying shifts in diversification rate across lineages, and have enthusiastically embraced MEDUSA as an approach for addressing this problem (Table S1 available on Dryad). Nevertheless, our simulation study reveals that MEDUSA is unreliable. For almost 30% of the trees that we simulated in our study—where rates were strictly constant across lineages of the tree—MEDUSA identified strong support for one or more diversification-rate shifts. This result applies to all versions of the algorithm (see Supplementary Material available on Dryad), and holds over a wide range of absolute parameter values that broadly encompass the conditions encountered in the analyses of actual empirical data sets (Table S1, Fig. S2 available on Dryad).

MEDUSA also provides biased estimates of diversification-rate parameters. Even when MEDUSA correctly identified the (true) constant-rate model, the net-diversification estimates were on average $1.13$ times their true values (Fig. 3, top row; see also Table S3 available on Dryad), and the relative-extinction rate estimates were on average $0.20$ times their true values (Table S4 available on Dryad). Similarly, when MEDUSA identified the (incorrect) multi-rate model, the spurious diversification-rate shifts were inferred to involve large rate shifts. On average, spurious diversification-rate shifts were estimated to involve a $2.7$ -fold shift in net-diversification rate (Fig. 3, bottom row; see also Table S5 available on Dryad), with a large spread of values (the upper $95 %$ quantile had an average 7.7-fold shift in net-diversification rate, Table S6 available on Dryad). Similarly, these spurious diversification-rate shifts were estimated to entail an average $3.1$ -fold shift in relative-extinction rate (Table S7 available on Dryad), with a correspondingly large spread of values (the upper $95 %$ quantile had an average 6.7-fold shift in relative-extinction rate, Table S8 available on Dryad).

Prospects for Addressing the $Δ$ AIC Threshold in MEDUSA

The high false-discovery rate exhibited by MEDUSA stems from: (1) the use of an arbitrary $Δ {AIC}_{crit}$ threshold (the improvement in model fit necessary to justify selecting a more complex model), and; (2) the comparison of a large number of alternative diversification models. The earliest applications of MEDUSA iterated over several fixed values $Δ {AIC}_{crit}$ to assess the sensitivity of biological inferences to this arbitrary threshold (c.f., Alfaro et al. 2009), but generally interpreted results for a fixed $Δ {AIC}_{crit}$ of 4, a conventional threshold value advocated by Burnham and Anderson (2003). Recent versions of MEDUSA address multiple-testing concerns by computing the $Δ {AIC}_{crit}$ threshold based on the number of terminal lineages, $k$ , in the observed phylogeny, which is directly related to the number of models being compared. The function relating the number of lineages to $Δ {AIC}_{crit}$ was derived from a simulation study performed with completely sampled (i.e., $k = N$ ), constant-rate birth-death trees (Pennell et al. 2014). As a result, the false-discovery rate is acceptable when $k = N$ (Fig. 1).

Unfortunately, the complex relationship between the false-discovery rate and aspects of the data ( $N$ , $k, λ$ , and $μ$ ) displayed in Figure 1 makes it clear that multiple testing is not the sole pathology, and that there is no generic solution for specifying a value for $Δ {AIC}_{crit}$ that will ensure appropriate statistical behavior. Fortunately, Monte Carlo simulation has proven to be an effective solution to similar problems (c.f., Moore et al. 2004; Boettiger et al. 2012), and we are optimistic that such a solution could be incorporated in the MEDUSA framework. However, developing an efficient and reliable Monte Carlo solution to this problem is nontrivial, and is an area for future work.

Prospects for Resolving Issues with the Likelihood Function of MEDUSA

We identified three issues with the MEDUSA likelihood function derived by Rabosky et al. (2007). First, the likelihood function uses an incorrect probability density for internal branches [equation (1)]; we provide the correct equation in the Appendix, equation (A.6). Second, the likelihood function incorrectly conditions on the survival of the process. In lieu of a stochastic model that explicitly describes diversification-rate shifts (see below), it may be more reasonable not to condition on survival. However, the consequences of removing this incorrect condition—e.g., on parameter estimates—are unknown.

The third—and more challenging—issue is the inability to account for diversification-rate shifts along lineages that ultimately go extinct. In order to accommodate these unobserved diversification-rate shifts, we must be able to compute the probability that a diversification-rate shift occurred along an extinct lineage, which in turn requires a model that describes how diversification-rate shifts occur through time and across (extant or extinct) lineages of the tree. MEDUSA, however, does not explicitly model the occurrence of diversification-rate shifts. Instead, MEDUSA compares a very explicit set of diversification-rate models while remaining agnostic about how the diversification-rate shifts arise (i.e., the diversification-rate shifts themselves are not described as events that occur with some probability). In the absence of a formal stochastic model of the diversification process, it is unclear how to accommodate diversification-rate shifts along extinct lineages within the MEDUSA framework. It is possible that a Monte Carlo approach using a rate-shift model (e.g., where rate shifts occur as a continuous-time Markov process along lineages, similar to the compound Poisson process model implied by BAMM;Rabosky 2014) could be devised to account for diversification-rate shifts along extinct lineages, but the feasibility and general applicability of such an approach requires further study.

Conclusions

In summary, many of the advances that make MEDUSA an attractive approach for detecting rate shifts appear to be problematic. For example, the ability to accommodate incompletely sampled trees with MEDUSA is a useful feature of the method. In fact, most applications ( $82.5 %$ ) of this method involve incomplete trees. Nevertheless, the pathological behavior of MEDUSA is most extreme under these conditions (Fig. 2). Likewise, the adoption of a birth-death model is presented as an important advantage of MEDUSA over competing methods that do not accommodate extinction. However, (relative) extinction-rate estimates obtained using MEDUSA are extremely unreliable (Table S4 available on Dryad). This finding is relevant to the general debate regarding whether (relative) extinction rates can be reliably estimated from phylogenies (Rabosky 2010; Pyron and Burbrink 2013).

The primary conclusion of our simulation study—that the MEDUSA approach is unreliable—might be viewed as a “negative” result. After all, our findings cast considerable doubt on the conclusions of the many empirical studies that have used MEDUSA, and argue strongly against the application of this approach in future empirical studies. Nevertheless, we believe that it is important to be aware of the limitations of a method if it is likely to provide spurious results. Moreover, beyond demonstrating that MEDUSA is unreliable, the results of our simulation study provide insights into the reasons why the method is unreliable. Understanding the nature of these problems can focus theoretical efforts to develop more reliable methods. Accordingly, far from discouraging, we view the findings of this study as cause for optimism. We are hopeful that future efforts will resolve issues afflicting the MEDUSA framework for identifying diversification-rate shifts that—coupled with rigorous evaluation of these new methods—will continue to enhance our ability to explore a broad range of fundamental evolutionary processes.

Supplementary Material

Data available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.61q64.

Funding

This research was supported by National Science Foundation (NSF) [grants DEB-0842181, DEB-0919529, DBI-1356737, and DEB-1457835 awarded to BRM]. Computational resources for this work were provided by NSF XSEDE [grant TG-DEB140025].

Acknowledgments

We are grateful to Michael Turelli and Peter Wainwright for encouraging us to pursue this project, to Yaniv Brandvain, Tracy Heath, Sebastian Höhna, John Huelsenbeck, Michael Landis, Bruce Rannala, and Tanja Stadler for thoughtful discussion, and to Andrew Magee for assistance with the survey of empirical studies. We are particularly grateful to Mike Alfaro, Joseph Brown and Luke Harmon for sharing their insights on this work.

Appendix 1

The likelihood functions used by MEDUSA are based on equations derived in Rabosky et al. (2007). For completeness, we present those equations and the equations from which they were derived. We note that these probabilities are not formally derived for a birth-death process model that allows among-lineage rate variation. Instead, these probabilities are derived for constant-rate birth-death processes, and then (incorrectly) applied to a variable-rate birth-death process by simply substituting different speciation and extinction rates for different branches of the tree.

Rabosky et al. (2007) derived equation (1) from Nee et al. (1994), equation (17), which is (substituting our notation):

\begin{array}{l} P (t_{i} ∣ n, λ, μ, t_{i}^{b}) & = & n (λ - μ) \exp (- n (λ - μ) t_{i}) \\ \times \frac{{(1 - \frac{μ}{λ} \exp (- (λ - μ) t_{i}))}^{n - 1}}{{(1 - \frac{μ}{λ} \exp (- (λ - μ) t_{i}^{b}))}^{n}}, \end{array}

(A.1)

where $n$ is the number of species alive at time $t_{i}^{b}$ . For the special case in which $n = 1$ (i.e., we consider a single branch that starts with a single species), the above equation simplifies to

\begin{array}{l} P (t_{i} ∣ λ, μ, t_{i}^{b}) = \frac{(λ - μ) \exp (- (λ - μ) t_{i})}{1 - \frac{μ}{λ} \exp (- (λ - μ) t_{i}^{b})} . \end{array}

(A.2)

Allowing the rates of speciation and extinction on branch $i$ to be $λ_{i}$ and $μ_{i}$ produces equation (1). We note that, in its original context, this equation describes the probability density of times between birth events, rather than of internal branches of particular lengths. We describe the correct equation in the following section.

Kendall (1949) derived the probability of realizing $n$ species from a single species that starts at time $t_{i}^{b}$ , which is [substituting our notation into his equation (21)]:

\begin{array}{l} P (n ∣ t_{i}^{b}, λ, μ) & = & (1 - \frac{μ}{λ} \frac{\exp ((λ - μ) t_{i}^{b}) - 1}{\exp ((λ - μ) t_{i}^{b}) - \frac{μ}{λ}}) \\ \times (1 - \frac{\exp ((λ - μ) t_{i}^{b}) - 1}{\exp ((λ - μ) t_{i}^{b}) - \frac{μ}{λ}}) \\ \times {(\frac{\exp ((λ - μ) t_{i}^{b}) - 1}{\exp ((λ - μ) t_{i}^{b}) - \frac{μ}{λ}})}^{n - 1}, \end{array}

(A.3)

which is equivalent to equation (A17) in Raup (1985)and equation (8.46) from Bailey (1964). Conditioning probability (A.3) on the survival of the clade, which happens with probability

\begin{array}{l} P (n > 0 ∣ t_{i}^{b}, λ, μ) = & (1 - \frac{μ}{λ} \frac{\exp ((λ - μ) t_{i}^{b}) - 1}{\exp ((λ - μ) t_{i}^{b}) - \frac{μ}{λ}}) \end{array}

(A.4)

gives

\begin{array}{l} P (n ∣ t_{i}^{b}, λ, μ) & = & (1 - \frac{\exp ((λ - μ) t_{i}^{b}) - 1}{\exp ((λ - μ) t_{i}^{b}) - \frac{μ}{λ}}) \\ \times {(\frac{\exp ((λ - μ) t_{i}^{b}) - 1}{\exp ((λ - μ) t_{i}^{b}) - \frac{μ}{λ}})}^{n - 1} . \end{array}

(A.5)

Again, allowing rates of speciation and extinction for clade $i$ to be $λ_{i}$ and $μ_{i}$ produces equation (2).

The Probability Density of an Internal Branch

Following Stadler (2010), the probability density of a branch of length $t_{i}$ is:

\begin{array}{l} P (t_{i} ∣ λ, μ, t_{i}^{b}) = P (1 ∣ t_{i}^{b}, λ, μ) / P (1 ∣ t_{i}^{e}, λ, μ), \end{array}

(A.6)

where $t_{i}^{e}$ is the time at the end of the branch, and $P (1 ∣ t, λ, μ)$ comes from equation (A.5).

References

Alfaro M.E., Huelsenbeck J.P. (2006). Comparative performance of Bayesian and AIC-based measures of phylogenetic model uncertainty. Syst. Biol. 55:89–96. [DOI] [PubMed] [Google Scholar]
Alfaro M.E., Santini F., Brock C., Alamillo H., Dornburg A., Rabosky D.L., Carnevale G., Harmon L.J. (2009). Nine exceptional radiations plus high turnover explain species diversity in jawed vertebrates. Proc. Natl Acad. Sci. 106:13410–13414. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bailey N.T. (1964). The elements of stochastic processes with applications to the natural sciences. New York: John Wiley & Sons. [Google Scholar]
Boettiger C., Coop G., Ralph P. (2012). Is your phylogeny informative? Measuring the power of comparative methods. Evolution 66:2240–2251. [DOI] [PMC free article] [PubMed] [Google Scholar]
Burnham K.P., Anderson D.R. (2003). Model selection and multimodel inference: a practical information-theoretic approach. Springer Science & Business Media. [Google Scholar]
Chan K.M.A., Moore B.R. (2005). SymmeTREE: whole-tree analysis of differential diversification rates. Bioinformatics 21:1709–1710. [DOI] [PubMed] [Google Scholar]
Etienne R., Haegeman B. (2012). A conceptual and statistical framework for adaptive radiations with a key role for diversity dependence. Am. Nat. 180:75–89. [DOI] [PubMed] [Google Scholar]
Etienne R., Haegeman B., Stadler T., Aze T., Pearson P., Purvis A., Phillimore A. (2012). Diversity-dependence brings molecular phylogenies closer to agreement with the fossil record. Proc. Roy. Soc. B 279:1300–1309. [DOI] [PMC free article] [PubMed] [Google Scholar]
FitzJohn R.G. (2010). Quantitative traits and diversification. Syst. Biol. 59:619–633. [DOI] [PubMed] [Google Scholar]
Goldberg E.E., Lancaster L.T., Ree R.H. (2011). Phylogenetic inference of reciprocal effects between geographic range evolution and diversification. Syst. Biol. 60:451–465. [DOI] [PubMed] [Google Scholar]
Höhna S. (2014). Likelihood inference of non-constant diversification rates with incomplete taxon sampling. PLoS One 9:e84184. [DOI] [PMC free article] [PubMed] [Google Scholar]
Höhna S., May M.R., Moore B.R. (2015). TESS: Bayesian inference of lineage diversification rates from (incompletely sampled) phylogenies in R. Bioinformatics 32:789–791. [DOI] [PubMed] [Google Scholar]
Höhna S., Stadler T., Ronquist F., Britton T. (2011). Inferring speciation and extinction rates under different sampling schemes. Mol. Biol. Evol. 28:2577–2589. [DOI] [PubMed] [Google Scholar]
Kendall D.G. (1949). Stochastic processes and population growth. J. Roy. Stat. Soc. B 11:230–282. [Google Scholar]
Kubo T., Iwasa Y. (1995). Inferring the rates of branching and extinction from molecular phylogenies. Evolution 49:694–704. [DOI] [PubMed] [Google Scholar]
Leventhal G.E., Günthard H.F., Bonhoeffer S., Stadler T. (2014). Using an epidemiological model for phylogenetic inference reveals density dependence in HIV transmission. Mol. Biol. Evol. 31:6–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
Maddison W., Midford P., Otto S. (2007). Estimating a binary character’s effect on speciation and extinction. Syst. Biol. 56:701–710. [DOI] [PubMed] [Google Scholar]
May M.R., Höhna S., Moore B.R. (2016). A Bayesian approach for detecting the impact of mass-extinction events when rates of lineage diversification vary. Methods Ecol. Evol. 7 (in press). [Google Scholar]
Moore B.R., Chan K.M.A., Donoghue M.J. (2004). Detecting diversification-rate variation in supertrees. In Phylogenetic Supertrees. Springer; Netherlands: pp. 487–533. [Google Scholar]
Moore B.R., Donoghue M.J. (2009). A Bayesian approach for evaluating the impact of historical events on rates of diversification. Proc. Natl Acad. Sci. 106:4307–4312. [DOI] [PMC free article] [PubMed] [Google Scholar]
Morlon H., Parsons T., Plotkin J. (2011). Reconciling molecular phylogenies with the fossil record. Proc. Natl Acad. Sci. 108:16327–16332. [DOI] [PMC free article] [PubMed] [Google Scholar]
Morlon H., Potts M., Plotkin J. 2010. Inferring the dynamics of diversification: a coalescent approach. PLoS Biol. 8:e1000493. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nee S., May R.M., Harvey P.H. (1994). The reconstructed evolutionary process. Phil. Trans. Roy. Soc. B 344:305–311. [DOI] [PubMed] [Google Scholar]
Paradis E., Claude J., Strimmer K. (2004). APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20:289–290. [DOI] [PubMed] [Google Scholar]
Pennell M.W., Eastman J.M., Slater G.J., Brown J.W., Uyeda J.C., FitzJohn R.G., Alfaro M.E., Harmon L.J. (2014). geiger v2.0: an expanded suite of methods for fitting macroevolutionary models to phylogenetic trees. Bioinformatics 30:2216–2218. [DOI] [PubMed] [Google Scholar]
Pyron R.A., Burbrink F.T. (2013). Phylogenetic estimates of speciation and extinction rates for testing ecological and evolutionary hypotheses. Trends Ecol. Evol. 28:729–736. [DOI] [PubMed] [Google Scholar]
R Core Team. 2013. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]
R Development Core Team. 2009. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]
Rabosky D., Lovette I. (2008). Explosive evolutionary radiations: decreasing speciation or increasing extinction through time? Evolution 62:1866–1875. [DOI] [PubMed] [Google Scholar]
Rabosky D.L. (2010). Extinction rates should not be estimated from molecular phylogenies. Evolution 64:1816–1824. [DOI] [PubMed] [Google Scholar]
Rabosky D.L. (2014). Automatic detection of key innovations, rate shifts, and diversity-dependence on phylogenetic trees. PLoS One 9:e89543. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rabosky D.L., Donnellan S.C., Talaba A.L., Lovette I.J. (2007). Exceptional among-lineage variation in diversification rates during the radiation of Australia’s most diverse vertebrate clade. Proc. Roy. Soc. B 274:2915–2923. [DOI] [PMC free article] [PubMed] [Google Scholar]
Raup D.M. (1985). Mathematical models of cladogenesis. Paleobiology 11:42–52. [Google Scholar]
Stadler T. (2010). Sampling-through-time in birth-death trees. J. Theor. Biol. 267:396–404. [DOI] [PubMed] [Google Scholar]
Stadler T. (2011). Mammalian phylogeny reveals recent diversification rate shifts. Proc. Natl Acad. Sci. USA 108:6187–6192. [DOI] [PMC free article] [PubMed] [Google Scholar]
Stadler T. (2013). TreeSim: Simulating trees under the birth-death model. R package version 1.8. [Google Scholar]
Yule G.U. (1924). A mathematical theory of evolution, based on the conclusions of Dr. J. C. Wills, F. R. S. Phil. Trans. Roy. Soc. Lond. Biol. 213:21–87. [Google Scholar]

[B1] Alfaro M.E., Huelsenbeck J.P. (2006). Comparative performance of Bayesian and AIC-based measures of phylogenetic model uncertainty. Syst. Biol. 55:89–96. [DOI] [PubMed] [Google Scholar]

[B2] Alfaro M.E., Santini F., Brock C., Alamillo H., Dornburg A., Rabosky D.L., Carnevale G., Harmon L.J. (2009). Nine exceptional radiations plus high turnover explain species diversity in jawed vertebrates. Proc. Natl Acad. Sci. 106:13410–13414. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] Bailey N.T. (1964). The elements of stochastic processes with applications to the natural sciences. New York: John Wiley & Sons. [Google Scholar]

[B4] Boettiger C., Coop G., Ralph P. (2012). Is your phylogeny informative? Measuring the power of comparative methods. Evolution 66:2240–2251. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] Burnham K.P., Anderson D.R. (2003). Model selection and multimodel inference: a practical information-theoretic approach. Springer Science & Business Media. [Google Scholar]

[B6] Chan K.M.A., Moore B.R. (2005). SymmeTREE: whole-tree analysis of differential diversification rates. Bioinformatics 21:1709–1710. [DOI] [PubMed] [Google Scholar]

[B7] Etienne R., Haegeman B. (2012). A conceptual and statistical framework for adaptive radiations with a key role for diversity dependence. Am. Nat. 180:75–89. [DOI] [PubMed] [Google Scholar]

[B8] Etienne R., Haegeman B., Stadler T., Aze T., Pearson P., Purvis A., Phillimore A. (2012). Diversity-dependence brings molecular phylogenies closer to agreement with the fossil record. Proc. Roy. Soc. B 279:1300–1309. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] FitzJohn R.G. (2010). Quantitative traits and diversification. Syst. Biol. 59:619–633. [DOI] [PubMed] [Google Scholar]

[B10] Goldberg E.E., Lancaster L.T., Ree R.H. (2011). Phylogenetic inference of reciprocal effects between geographic range evolution and diversification. Syst. Biol. 60:451–465. [DOI] [PubMed] [Google Scholar]

[B11] Höhna S. (2014). Likelihood inference of non-constant diversification rates with incomplete taxon sampling. PLoS One 9:e84184. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] Höhna S., May M.R., Moore B.R. (2015). TESS: Bayesian inference of lineage diversification rates from (incompletely sampled) phylogenies in R. Bioinformatics 32:789–791. [DOI] [PubMed] [Google Scholar]

[B13] Höhna S., Stadler T., Ronquist F., Britton T. (2011). Inferring speciation and extinction rates under different sampling schemes. Mol. Biol. Evol. 28:2577–2589. [DOI] [PubMed] [Google Scholar]

[B14] Kendall D.G. (1949). Stochastic processes and population growth. J. Roy. Stat. Soc. B 11:230–282. [Google Scholar]

[B15] Kubo T., Iwasa Y. (1995). Inferring the rates of branching and extinction from molecular phylogenies. Evolution 49:694–704. [DOI] [PubMed] [Google Scholar]

[B16] Leventhal G.E., Günthard H.F., Bonhoeffer S., Stadler T. (2014). Using an epidemiological model for phylogenetic inference reveals density dependence in HIV transmission. Mol. Biol. Evol. 31:6–17. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] Maddison W., Midford P., Otto S. (2007). Estimating a binary character’s effect on speciation and extinction. Syst. Biol. 56:701–710. [DOI] [PubMed] [Google Scholar]

[B18] May M.R., Höhna S., Moore B.R. (2016). A Bayesian approach for detecting the impact of mass-extinction events when rates of lineage diversification vary. Methods Ecol. Evol. 7 (in press). [Google Scholar]

[B19] Moore B.R., Chan K.M.A., Donoghue M.J. (2004). Detecting diversification-rate variation in supertrees. In Phylogenetic Supertrees. Springer; Netherlands: pp. 487–533. [Google Scholar]

[B20] Moore B.R., Donoghue M.J. (2009). A Bayesian approach for evaluating the impact of historical events on rates of diversification. Proc. Natl Acad. Sci. 106:4307–4312. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] Morlon H., Parsons T., Plotkin J. (2011). Reconciling molecular phylogenies with the fossil record. Proc. Natl Acad. Sci. 108:16327–16332. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22] Morlon H., Potts M., Plotkin J. 2010. Inferring the dynamics of diversification: a coalescent approach. PLoS Biol. 8:e1000493. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B23] Nee S., May R.M., Harvey P.H. (1994). The reconstructed evolutionary process. Phil. Trans. Roy. Soc. B 344:305–311. [DOI] [PubMed] [Google Scholar]

[B24] Paradis E., Claude J., Strimmer K. (2004). APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20:289–290. [DOI] [PubMed] [Google Scholar]

[B25] Pennell M.W., Eastman J.M., Slater G.J., Brown J.W., Uyeda J.C., FitzJohn R.G., Alfaro M.E., Harmon L.J. (2014). geiger v2.0: an expanded suite of methods for fitting macroevolutionary models to phylogenetic trees. Bioinformatics 30:2216–2218. [DOI] [PubMed] [Google Scholar]

[B26] Pyron R.A., Burbrink F.T. (2013). Phylogenetic estimates of speciation and extinction rates for testing ecological and evolutionary hypotheses. Trends Ecol. Evol. 28:729–736. [DOI] [PubMed] [Google Scholar]

[B27] R Core Team. 2013. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]

[B28] R Development Core Team. 2009. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]

[B29] Rabosky D., Lovette I. (2008). Explosive evolutionary radiations: decreasing speciation or increasing extinction through time? Evolution 62:1866–1875. [DOI] [PubMed] [Google Scholar]

[B30] Rabosky D.L. (2010). Extinction rates should not be estimated from molecular phylogenies. Evolution 64:1816–1824. [DOI] [PubMed] [Google Scholar]

[B31] Rabosky D.L. (2014). Automatic detection of key innovations, rate shifts, and diversity-dependence on phylogenetic trees. PLoS One 9:e89543. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B32] Rabosky D.L., Donnellan S.C., Talaba A.L., Lovette I.J. (2007). Exceptional among-lineage variation in diversification rates during the radiation of Australia’s most diverse vertebrate clade. Proc. Roy. Soc. B 274:2915–2923. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B33] Raup D.M. (1985). Mathematical models of cladogenesis. Paleobiology 11:42–52. [Google Scholar]

[B34] Stadler T. (2010). Sampling-through-time in birth-death trees. J. Theor. Biol. 267:396–404. [DOI] [PubMed] [Google Scholar]

[B35] Stadler T. (2011). Mammalian phylogeny reveals recent diversification rate shifts. Proc. Natl Acad. Sci. USA 108:6187–6192. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B36] Stadler T. (2013). TreeSim: Simulating trees under the birth-death model. R package version 1.8. [Google Scholar]

[B37] Yule G.U. (1924). A mathematical theory of evolution, based on the conclusions of Dr. J. C. Wills, F. R. S. Phil. Trans. Roy. Soc. Lond. Biol. 213:21–87. [Google Scholar]

PERMALINK

How Well Can We Detect Lineage-Specific Diversification-Rate Shifts? A Simulation Study of Sequential AIC Methods

Michael R May

Brian R Moore

Abstract

Figure 1.

The Likelihood Function in MEDUSA is Incorrect

Use of the AIC to Select Diversification Models may be Problematic

Methods

Simulation Study

Results

MEDUSA has a High False-Discovery Rate

Figure 2.

MEDUSA Provides Biased Parameter Estimates

Figure 3.

Dicussion

The Statistical Behavior of MEDUSA

Prospects for Addressing the $Δ$ AIC Threshold in MEDUSA

Prospects for Resolving Issues with the Likelihood Function of MEDUSA

Conclusions

Supplementary Material

Funding

Acknowledgments

Appendix 1

The Probability Density of an Internal Branch

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

How Well Can We Detect Lineage-Specific Diversification-Rate Shifts? A Simulation Study of Sequential AIC Methods

Michael R May

Brian R Moore

Abstract

Figure 1.

The Likelihood Function in MEDUSA is Incorrect

Use of the AIC to Select Diversification Models may be Problematic

Methods

Simulation Study

Results

MEDUSA has a High False-Discovery Rate

Figure 2.

MEDUSA Provides Biased Parameter Estimates

Figure 3.

Dicussion

The Statistical Behavior of MEDUSA

Prospects for Addressing the ΔAIC Threshold in MEDUSA

Prospects for Resolving Issues with the Likelihood Function of MEDUSA

Conclusions

Supplementary Material

Funding

Acknowledgments

Appendix 1

The Probability Density of an Internal Branch

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Prospects for Addressing the $Δ$ AIC Threshold in MEDUSA