Skip to main content
Systematic Biology logoLink to Systematic Biology
. 2021 Jun 24;71(1):172–189. doi: 10.1093/sysbio/syab049

Unifying Phylogenetic Birth–Death Models in Epidemiology and Macroevolution

Ailene MacPherson 1,2,3,, Stilianos Louca 4,5, Angela McLaughlin 6,7, Jeffrey B Joy 8,9,10, Matthew W Pennell 11
Editor: James Albert
PMCID: PMC8972974  PMID: 34165577

Abstract

Birth–death stochastic processes are the foundations of many phylogenetic models and are widely used to make inferences about epidemiological and macroevolutionary dynamics. There are a large number of birth–death model variants that have been developed; these impose different assumptions about the temporal dynamics of the parameters and about the sampling process. As each of these variants was individually derived, it has been difficult to understand the relationships between them as well as their precise biological and mathematical assumptions. Without a common mathematical foundation, deriving new models is nontrivial. Here, we unify these models into a single framework, prove that many previously developed epidemiological and macroevolutionary models are all special cases of a more general model, and illustrate the connections between these variants. This unification includes both models where the process is the same for all lineages and those in which it varies across types. We also outline a straightforward procedure for deriving likelihood functions for arbitrarily complex birth–death(-sampling) models that will hopefully allow researchers to explore a wider array of scenarios than was previously possible. By rederiving existing single-type birth–death sampling models, we clarify and synthesize the range of explicit and implicit assumptions made by these models. [Birth–death processes; epidemiology; macroevolution; phylogenetics; statistical inference.]


Evolutionary, demographic, and epidemiological pro- cesses leave a footprint in the branch length distribution and topology of reconstructed phylogenetic trees. This insight has inspired a huge effort to extract information about these processes by fitting stochastic models. For example, in molecular epidemiology, researchers have leveraged the fact that for many viral pathogens, such as HIV and SARS-CoV-2, accumulate genetic diversity on the timescale of transmission (Drummond et al. 2003; Duffy et al. 2008). This genetic diversity can be used to reconstruct the evolutionary relationships between viral variants sampled from different hosts, which in turn can help elucidate the epidemiological dynamics of a pathogen over time (Grenfell et al. 2004; Volz 2012). Similarly, phylogenetic trees can provide unique insights into the temporal variation in speciation and extinction rates (Morlon 2014).

Phylogenetic branching models can be broadly grouped into two classes. The first, based on Kingman’s coalescent process (Kingman 1982), has been widely used to examine changes in the historical population size of pathogens (Pybus et al. 2000; Strimmer and Pybus 2001; Drummond et al. 2005; Volz et al. 2009). These coalescent methods have also been applied to reconstruct macroevolutionary dynamics (Morlon et al. 2010). Coalescent models are well suited for estimating deterministic population dynamics; however, fitting highly stochastic processes, such as the dynamics of an emerging pathogen, is computationally intensive and in some cases the assumptions of the coalescent may not be appropriate (Stadler et al. 2015; Boskova et al. 2014; Volz and Frost 2014). The second class of models, which are collectively referred to as birth–death-sampling (BDS) models (Kendall 1948; Maddison et al. 2007; Stadler 2009; Stadler 2010), is well suited for stochastic scenarios, and are thus becoming an increasingly favorable and popular alternative to coalescent models in epidemiology (Stadler et al. 2012) and have long been the foundation of most macroevolutionary studies—both for inferring speciation and extinction dynamics (Raup 1985; Nee et al. 1994) and for estimating divergence times (Gernhard 2008; Heath et al. 2014). As the name implies, the BDS process includes three types of events: birth (pathogen transmission between hosts, or speciation in a macroevolutionary context), death (host death or recovery, or extinction in macroevolution), and sampling (including fossil collection in macroevolution).

In the context of epidemiology, BDS models have the additional property that the model parameters, which can be estimated from viral sequence data, explicitly correspond to parameters in classic structured epidemiological models that are often fit to case surveillance data. If we reparameterize these models, we can describe the dynamics of the basic and effective reproductive ratios (Inline graphic and Inline graphic, respectively) over time (Stadler et al. 2012; Stadler et al. 2013) (see Box 1). A common research aim is to describe how the frequency of birth, death, and sampling events, and other derived variables such as Inline graphic, change throughout the course of an epidemic. There has been less work in macroevolution linking the parameters of a BDS model to those of an underlying more mechanistic model (but see Ezard et al. 2016) but this seems like a promising avenue for future development. As we detail below and in the Supplementary material available at Dryad at https://doi.org/10.5281/zenodo.5028470, there has been an astounding rise in the variety and complexity of BDS model variants. A key assumption in the specification of BDS submodels is whether all lineages alive at some time point are exchangeable (Stadler 2013) (such models are hereafter “single-type” models), meaning they diversify according to the same process, or if rather the diversification process is variable (“multitype” models; e.g., Maddison et al. 2007; FitzJohn 2012; Stadler and Bonhoeffer 2013; Rasmussen and Stadler 2019; Barido-Sottani et al. 2018), with lineages belonging to one of multiple possible states each characterized by a unique process. Each of these diversification processes can then be characterized by different dynamical assumptions. In the epidemiological case, these assumptions specify, for example, the nature of viral transmission and the sampling procedure (Stadler et al. 2013; Kühnert et al. 2014; Gavryushkina et al. 2014). While typically not explicitly tied to mechanistic evolutionary processes, there are a similar abundance of dynamical assumptions employed in the macroevolutionary context specifying the nature of biodiversity change through time (Nee 2006; Gernhard 2008; Morlon et al. 2011; Stadler 2011; Morlon 2014; Heath et al. 2014; Louca 2020).

Box 1: The Connection between BDS and SIR Models

The single-type BDS model is intimately related to the SIR compartmental model used in classic theoretical epidemiology. This connection illustrates the explicit and implicit assumptions of the general BDS model and its sub models. Here, we define the SIR epidemiological model, discuss how it can inform and be informed by these diversification models, and examine the shared assumptions of the two frameworks.

The SIR Model

The SIR model partitions the host population via infection status into susceptible (S), infected (I), and recovered (R) hosts. Infection of susceptible hosts occurs at a per-capita rate Inline graphic. Infected hosts may recover (at rate Inline graphic), die of virulent cases (at rate Inline graphic), or be sampled (at rate Inline graphic). The cumulative number of sampled hosts is represented in the SIR model (Figure B.1 top) by Inline graphic. Upon sampling, infected hosts may be treated and hence effectively recover with probability Inline graphic. Hosts that have recovered from infection exhibit temporary immunity to future infection which wanes at rate Inline graphic. The special case of the SIR model with no immunity (the SIS model) is obtained in the limit as Inline graphic. In addition to these epidemiological processes, the SIR model includes demographic processes, such as host birth (rate Inline graphic) and death from natural causes (rate Inline graphic). While not shown explicitly in the figure, these epidemiological and demographic rates may change over time as a result of host behavioral change, pharmaceutical and nonpharmaceutical interventions, or host/pathogen evolution.

The BDS Model

The BDS model follows the number of sampled and unsampled viral lineages over time, analogous to the Inline graphic and Inline graphic classes of the SIR model. A key element of general BDS model is that birth and death rates may vary over time. This time dependence may be either continuous (Morlon et al. 2011; Rabosky and Lovette 2008b) or discrete (Stadler 2011; Stadler and Bonhoeffer 2013; Gavryushkina et al. 2014; Kühnert et al. 2014) Although arbitrarily time-dependent, the birth, death, and sampling rates in the general BDS model are assumed to be diversity-independent, analogous to the assumption of density-dependent transmission (pseudo mass action) in the SIR model (Keeling and Rohani 2008). Incorporating such diversity dependence into macroevolutionary models has been shown to increase the accuracy of extinction rate estimates and are necessary to accurately capture the saturation of diversity (Etienne et al. 2012). While some forms of diversity-dependence in diversification rates may be incorporated implicitly capturing deterministic diversity dependence as time dependence (Rabosky and Lovette 2008a), stochastic diversity-dependence (Etienne and Rosindell 2012) goes beyond the scope of the BDS models considered here.

The single-type BDS model assumes all viral lineages are exchangeable—this has several implications. First, all viral lineages are epidemiologically identical hence all mutations between them are neutral. Incorporating nonneutral genetic variation requires a multitype approach as in Equation (16). Second, transmission is independent of lineage age. In the macroevolutionary case, such age-dependence has been suggested to reflect niche differentiation in novel species (Hagen et al. 2015) and in the epidemiological case may reflect adaptation towards increased transmissibility following a host species-jumping event. Third, lineage exchangeability is reflected in the absence of an exposed (E) class in the SIR model in which hosts can, for example, transmit infections but not be sampled or vice versa. Finally, the single-type BDS model assumes all lineages are sampled at random and does not include submodels with nonrandom representation of lineages (Stadler et al. 2012).

Model Connections

Given their shared model assumptions, the single-type BDS model can be constrained explicitly to reflect an underlying SIR epidemic by setting the viral birth rate equal to the per-capita transmission rate of the infectious class Inline graphic and the viral death rate to the infectious recovery or removal rate Inline graphic, whereas the sampling rate Inline graphic is identical across models (Figure B1a). While constraining the birth, death, and sampling rates in this manner can be used to parameterize compartmental models (Kühnert et al. 2014) doing so is an approximation assuming independence between the exact timing of transmission, recovery or removal from the population, and sampling events in the SIR model and birth, death, and sampling events in the diversification model. The resulting tree likelihood in terms of the compartmental model is given by:

graphic file with name Equation1.gif (B1)

While they are not submodels of the general BDS process given by equation 13, likelihood models have been developed that capture the full nonindependence of viral diversification and epidemiological dynamics for the SIR model specifically (Leventhal et al. 2012) and in compartmental models in general (Vaughan et al. 2019). The connection between the BDS process and SIR epidemiological models can also be used after the diversification rates are inferred to estimate the basic and effective reproductive rates (Stadler et al. 2012; Stadler and Bonhoeffer 2013). Specifically, the effective reproductive rate at time Inline graphic before the present day is given by Inline graphic. Although the SIR model is a useful epidemiological model for is simplicity, realistically modeling epidemic dynamics requires far more complex compartmental models. As reflected by their shared structure, the application of the single-type BDS model is restricted, however, to the assumptions of the SIR model alone and further methodological advances in multitype modeling are necessary for direct inference for the larger class of epidemiological models.

This flourishing of methods and models has facilitated critical insights into epidemics (du Plessis and Stadler 2015; Joy et al. 2016) and the origins of contemporary biodiversity (Morlon 2014; Schluter and Pennell 2017). However, this diversity of models has made it difficult to trace the connections between variants and to understand the precise epidemiological, evolutionary, and sampling processes that differ between them. Furthermore, despite their apparent similarities, these models have been derived on a case-by-case basis using different notation and techniques; this creates a substantial barrier for researchers working to develop novel models for new situations. And critically, it is imperative that we understand the general properties of BDS phylogenetic models and the limits of inferences from them (Louca and Pennell 2020a; Louca et al. 2021), and this is difficult to do without considering the full breadth of possible scenarios. Here, we address all of these challenges by unifying the whole class of phylogenetic BDS models. We do so by first deriving a likelihood for general single- and multitype BDS models; in the general case, we do not assume anything about the functional forms (i.e., temporal dynamics) of the various parameters including the sampling rate through time, the possibility of sampling ancestors (or not), or how the process was conditioned. While such general models may be useful for studying the mathematical properties of BDS models as a whole (Lambert and Stadler 2013; Louca and Pennell 2020a; Louca et al. 2021), statistical inference from these models requires researchers to make further constraints on the process. We prove that existing BDS model variants are indeed submodels of the more general case—and thereby clarify the specific assumptions made by different models—and provide a standardized notation and technique for deriving these and other submodels that have not previously been considered in the literature.

The Single-type Birth–Death-Sampling Model

Model specification

The BDS stochastic process begins with a single lineage at time Inline graphic before the present day. We note that this may be considerably older than the age of the most recent common ancestor of an observed sample which is given by Inline graphic. While we focus primarily on applications to epidemiology, our approach is agnostic to whether the rates are interpreted as describing pathogen transmission or macroevolutionary diversification.

In the model, transmission/speciation results in the birth of a lineage and occurs at rate Inline graphic, where Inline graphic (Inline graphic) is measured in units of time before the present day (Inline graphic), such that Inline graphic can be time-dependent. We make the common assumption that lineages in the viral phylogeny coalesce exactly at transmission events, thus ignoring the within-host coalescent processes in the donor (Romero-Severson et al. 2016). Throughout, we will use Inline graphic as a a general time variable and Inline graphic to denote the time at which a specific event Inline graphic occurs as measured in units of time before the present day (see Supplementary Table S1). Lineage extinction, resulting from host recovery or death in the epidemiological case or the death of all individuals in a population in the macroevolutionary case, occurs at time-dependent rate Inline graphic. We allow for two distinct types of sampling: lineages are either sampled according to a Poisson process through time Inline graphic or binomially at very short intervals, which we term “concerted sampling attempts” (CSAs), where lineages at some specified time Inline graphic are sampled with probability Inline graphic (Inline graphic denotes a vector of concerted sampling events at different time points). In macroevolutionary studies based only on extant lineages, there is no Poissonian sampling, but a CSA at the present (Inline graphic). In epidemiology, CSAs correspond to large-scale testing efforts (relative to the background rate of testing) in a short amount of time (relative to the rates of viral sequence divergence); for full explanation, see Appendix. We call these attempts rather than events because if Inline graphic is small or the infection is rare in the population, few or no samples may be obtained. CSAs can also be incorporated into the model by including infinitesimally short spikes in the sampling rate Inline graphic (more precisely, appropriately scaled Dirac distributions). Hence, for simplicity, in the main text we focus on the seemingly simpler case of pure Poissonian sampling through time except at the present-day, where we allow for a CSA to facilitate comparisons with macroevolutionary models; the resulting formulas can then be used to derive a likelihood formula for the case where past CSAs are included (see Appendix).

In the epidemiological case, sampling may be concurrent (or not) with host treatment or behavioral changes resulting in the effective extinction of the viral lineage. Hence, we assume that sampling results in the immediate extinction of the lineage with probability Inline graphic. As with the CSAs, this arbitrary time dependence allows for the incorporation of Dirac spikes in any of these variables, for example, with mass extinctions (Inline graphic) and lagerstätten in the fossil record (Inline graphic) (Magee and Höhna 2021). Similarly, in the case of past CSAs, we must include the probability, Inline graphic, that sampled hosts are removed from the infectious pool during the CSA at time Inline graphic. Poissonian sampling without the removal of lineages (Inline graphic) can be employed in the macroevolutionary case to explicitly model the collection of samples from the fossil record (such as the fossilized-birth–death process; Heath et al. 2014).

For our derivation, we make no assumption about the temporal dynamics of Inline graphic, Inline graphic, Inline graphic, or Inline graphic; each may be constant, or vary according to any arbitrary function of time given that it is biologically valid (nonnegative and between 0 and 1 in the case of Inline graphic). Specifically, the time-varying functions may be any piecewise-continuous functions of time with at most finite number of discontinuities (see Rate Assumption section in the Appendix). Note that these functions need not be differentiable. We make the standard assumption that at any given time any given lineage experiences a birth, death or sampling event independently of (and with the same probabilities as) all other lineages. We revisit this assumption in Box 1 where we discuss how the implicit assumptions of the single-type BDS process are well summarized by the diversification model’s relationship to the SIR epidemiological model. Our resulting general time-variable BDS process can be fully defined by the parameter set Inline graphic.

In order to make inference about the model parameters, we need to calculate the likelihood, Inline graphic, that an observed phylogeny, Inline graphic, is the result of a given BDS process. With respect to the BDS process there are two ways to represent the information contained in the phylogeny Inline graphic, both of which have been used in the literature, which we call the “edge” and “critical time” representations, respectively. We begin by deriving the likelihood in terms of the edge representation and later demonstrate how to reformulate the likelihood in terms of critical times. In the edge representation, the phylogeny is summarized as a set of edges in the mathematical graph that makes up the phylogeny, numbered 1–11 in Figure B.1c, and the types of events that occurred at each node. We define Inline graphic as the probability that the edge Inline graphic which begins at time Inline graphic and ends at time Inline graphic gives rise to the subsequently observed phylogeny between time Inline graphic and the present day. The likelihood of the model for the observed tree is then, is by definition Inline graphic: the probability density that the stem lineage (Inline graphic in Figure B.1c) gives rise to the observed phylogeny from the origin, Inline graphic, to the present day. We find that it is more intuitive to derive the likelihood in terms of the edge representation, as we show below; from this it is straightforward to derive the critical times formulation which results in mathematical simplification. Below, we present our five-step technique for the derivation of the tree likelihood.

Figure B.1.

Figure B.1.

BDS-SIR model connection. Top: The SIR epidemiological model. Black (gray) lines and classes represent rates and variables followed (in)directly by the BDS model. The SIR model can be used to constrain the rates of the BDS model (a). Simulated forward in time, the result of the BDS stochastic processes is a full tree (b) giving the complete genealogy of the viral population. Pruning away extinct and unsampled lineages produces the sampled tree (c). Arising from a BDS process, this sampled tree can be summarized in two ways. First by the set of edges (labeled 1–11) or as a set of critical times (horizontal lines) including: 1) the time of birth events (solid, Inline graphic) 2) terminal sampling times (dashed, Inline graphic), and 3) ancestral sampling times (dotted, Inline graphic). Given the inferred rates from a reconstructed sampled tree, these rates can be used to estimate characteristic parameters of the SIR model, for example, the basic or effective reproductive number.

Step 1. Deriving the Initial Value Problem (IVP) for Inline graphic

We derive the IVP for the likelihood density Inline graphic using an approach first developed by (Maddison et al., 2007). We begin by deriving the recursion equation for Inline graphic by considering all the possible events that could occur along edge Inline graphic between time Inline graphic and Inline graphic assuming that that Inline graphic is small enough such that at most one event is likely to occur.

graphic file with name Equation2.gif (1)

Here, Inline graphic is the probability that a lineage alive at time Inline graphic leaves no sampled descendants at the present day. We will examine this probability in more detail below. Assuming Inline graphic is small, we can approximate the above recursion equation as the following difference equation.

graphic file with name Equation3.gif (2)

By the definition of the derivative, we have:

graphic file with name Equation4.gif (3)

Equation (3) is known as the Kolmogorov backward equation of the BDS process (Feller 1949; Louca and Pennell 2020b). Beginning at time Inline graphic, the initial condition of Inline graphic depends on which event occurred at the beginning of edge Inline graphic.

graphic file with name Equation5.gif (4)

Together Equations (3) and (4) define the initial value problem for Inline graphic as a function of the probability Inline graphic.

Because the likelihood density Inline graphic is the solution to a linear differential equation with initial condition at time Inline graphic, we can express its solution as follows:

graphic file with name Equation6.gif (5)

where the auxiliary function, Inline graphic, is given by:

graphic file with name Equation7.gif (6)

This function, Inline graphic, maps the value of Inline graphic at time Inline graphic to its value at Inline graphic, and hence is known as the probability “flow” of the Kolmogorov backward equation (Louca and Pennell 2020b).

Step 2. Deriving the IVP for Inline graphic

We derive the IVP for Inline graphic in a similar manner as above, beginning with a difference equation.

graphic file with name Equation8.gif (7)

By the definition of a derivative, we have:

graphic file with name Equation9.gif (8)

where Inline graphic is the probability a lineage is sampled at the present day. The initial condition at time Inline graphic is therefore the probability that a lineage alive at the present day is not sampled. Given an analytical or numerical general solution to Inline graphic, we can find the likelihood by evaluating Inline graphic, as follows.

Step 3. Deriving the expression for Inline graphic.

Given the linear nature of the differential equation for Inline graphic and hence the representation in Equation (5)), the likelihood Inline graphic is given by the product over all the initial conditions times the product over the probability flow for each edge.

graphic file with name Equation10.gif (9)

where Inline graphic, Inline graphic, and Inline graphic are the times at which individual birth, terminal sampling and ancestral sampling events occur as we elaborated below.

Step 4. Representing Inline graphic in terms of critical times.

Equation (9) can be further simplified by removing the need to enumerate over all the edges of the phylogeny (the last term of Equation (9)) and writing Inline graphic in terms of the tree’s critical times (horizontal lines in Fig. B.1). The critical times of the tree are made up of three vectors, Inline graphic, Inline graphic, and Inline graphic, as well as the time of origin Inline graphic. The vector Inline graphic gives the time of each birth event in the phylogeny and has length Inline graphic where Inline graphic is the number of lineages sampled at the present day and Inline graphic is the number of terminal samples. Unless noted otherwise the elements of vector Inline graphic are listed in decreasing order, such that Inline graphic and hence Inline graphic is the time of the most recent common ancestor Inline graphic. The vector Inline graphic gives the timing of each terminal sample and hence has length Inline graphic whereas vector Inline graphic gives the timing of each ancestral sample and has length Inline graphic. With respect to the BDS likelihood then the sampled tree is summarized by Inline graphic. We note that the critical times only contain the same information as the edges as a result of the assumptions of the BDS process but are not generally equivalent representations of Inline graphic.

As a result of the linear nature of Inline graphic it is straightforward to rewrite the likelihood in Equation (9) in terms of the critical-time representation of the sampled tree. Defining

graphic file with name Equation11.gif (10)

the probability flow Inline graphic can be rewritten as the following ratio:

graphic file with name Equation12.gif (11)

This relationship allows us to rewrite the likelihood by expressing the product over the edges as two separate products, one over the start of each edge and the other over the end of each edge which in turn allows us to rearrange and cancel terms to obtain an alternative likelihood expression. Edges begin (value of Inline graphic) at either: 1) the tree origin, 2) a birth event resulting to two lineages, or 3) an ancestral sampling event. Edges end (values of Inline graphic) at either: 1) a birth event, 2) an ancestral sampling event, 3) a terminal sampling event, or 4) the present day. Hence we have:

graphic file with name Equation13.gif (12)

Note Inline graphic. While Equations (9) and (12) are numerically identical, the critical time expression is more convenient for application as it requires numerically evaluating only a single function Inline graphic as given by Equation (10).

Step 5. Conditioning the likelihood

While Equation (12) is equal to the basic model likelihood for the phylogeny Inline graphic, it is often appropriate to condition the tree likelihood on the tree exhibiting some property, for example the condition there being at least sampled lineage. Imposing a condition on the likelihood is done by multiplying by a factor Inline graphic. Various conditioning schemes are considered in section A4 and listed in Table S3 with the value of Inline graphic ranging in complexity from a constant to a general function of the model parameters. The resulting likelihood expression for the general BDS model is:

graphic file with name Equation14.gif (13)

Many Existing Models are Special Cases of this General BDS Model

A large variety of previously published BDS models in epidemiology and macroevolution are special cases of the general model presented here (for a summary of the models we investigated, see Supplementary Table S2. Indeed, we can obtain the likelihood of these models by adding mathematical constraints (i.e., simplifying assumptions) to the terms in Equation (13). Our work thus not only provides a consistent notation for unifying a multitude of seemingly disparate models, it also provides a concrete and numerically straightforward recipe for computing their likelihood functions. We recognize that there are many valid approaches for deriving tree likelihoods for BDS models with share many similarities with our own (e.g., Nee et al. 1994; Maddison et al. 2007; Gernhard 2008; Morlon et al. 2011; Lambert and Stadler 2013; Lambert 2018; Laudanno et al. 2020; Louca and Pennell 2020b) and do not claim ours is superior to these; however, we have found our technique to be intuitive and flexible. We have implemented the single-type BDS likelihood in the R package Inline graphic (Louca and Doebeli 2018), including routines for maximum-likelihood fitting of BDS models with arbitrary functional forms of the parameters given a phylogeny and routines for simulating phylogenies under the general BDS models (functions Inline graphic, Inline graphic and Inline graphic).

Figure 1 summarizes the simplifying assumptions that underlie common previously published BDS models; these assumptions generally fall into four categories: 1) assumptions about the functional form of birth, death, and sampling rates over time, 2) assumptions pertaining to the sampling of lineages, 3) the presence of mass-extinction events, and 4) the nature of the tree-conditioning as given by Inline graphic. Here, we provide a brief overview of the type of previously invoked constraints which are consistent (or not) with our unified framework; for full details on each specific case, we refer readers to the Supplementary material. While we illustrate these constraints within the single-type context, analogous assumptions can be made within the multitype context examined in the following section.

Figure 1.

Figure 1.

Submodel assumptions. Rate, sampling, mass extinction, and conditioning assumptions of existing submodels of the general time-variable BDS process. The key points are that i) each of the previously developed models we considered can be obtained by adding specific combinations of constraints to the various parameters of the general BDS model; and ii) that there are many plausible, and potentially biologically informative combinations of constraints that have not been considered by researchers in epidemiology or macroevolution.

In regards to rate assumptions, many early BDS models (Stadler 2009; Stadler 2010; Stadler et al. 2012) assumed that the birth, death, and sampling rates remained constant over time. This is mathematically and computationally convenient since an analytical solution can easily be obtained for Inline graphic. In the epidemiological case, holding Inline graphic constant, however, implies that the number of susceptible hosts is effectively constant throughout the epidemic and/or that the population does not change its behavior over time; this is an unrealistic assumption given seasonal changes or changes in response to the disease itself. As such, this assumption is only really valid for small time periods or the early stages of an epidemic. This is useful for estimating the basic reproductive number, Inline graphic, of the SIR model (Box 1) but not for the effective reproductive number Inline graphic at later time points (Stadler et al. 2012).

A similarly tractable, but more epidemiologically relevant, model is known as the “birth–death-skyline” variant (Stadler and Bonhoeffer, 2013; Gavryushkina et al., 2014), in which rates are piecewise-constant functions through time (like the constant rate model, there is also an analytical way to calculate the likelihood of this model; see Appendix). The BDS skyline model has been implemented under a variety of additional assumptions in the Bayesian phylogenetics software BEAST2 (Bouckaert et al. 2019). The BDS skyline model has also been extended by (Kühnert et al., 2014) to infer the the parameters of an underlying stochastic SIR model. In this case the diversification model parameters Inline graphic are random variables that emerge from stochastic realizations of the epidemiological model given by Inline graphic, see Equation (B1). Finally, the birth–death skyline model with piecewise constant rates can also be applied in the macroevolutionary case when no sampling occurs through time, Inline graphic (Stadler 2011).

In addition to imposing constraints on the temporal variation in the rates, previously derived submodels have considered a variety of different assumptions about the nature of the sampling process. Most notably, in macroevolutionary studies, sampling of molecular data typically occurs only at the present day (Stadler 2009; Stadler 2011; Morlon et al. 2011) whereas past Poissonian sampling can be introduced to include the sampling of fossil data (Heath et al. 2014). In epidemiology, concerted sampling at the present day is likely biologically unrealistic (Stadler et al. 2012), though in some implementations of the models, such a sampling scheme has been imposed. These concerted sampling attempts prior to the present day as well as mass extinction events can be incorporated via the inclusion of Dirac distributions in the sampling and death rates, respectively. Finally, previous models often multiply the likelihood by a factor Inline graphic in order to condition on a particular observation (e.g., observing at least one lineage or exactly Inline graphic lineages), enumerate indistinguishable trees (e.g., accounting for possible orientations or unlabeled trees) (Gavryushkina et al. 2013; Gavryushkina et al. 2014; Stadler 2009), or to reflect known uncertainties. The “fossilized-birth–death” likelihood derived by Heath et al. (2014) for example, includes a factor that reflects the uncertainty in the attachment and placement of fossils on the macroevolutionary tree. This fossilized-birth–death process has been used to estimate divergence times and to model lineage diversification (Gavryushkina et al. 2017; Landis et al. 2021). Variants of the fossilized-birth–death process, for example including mass extinction events, are feasible and can be derived using our approach. We also note that models similar to the time-variable fossilized-birth–death process have been developed for cases when phylogenetic data is not available (i.e., when only including fossil occurrence data; see Silvestro et al. 2014; Lehtonen et al. 2017); we have not investigated how these models relate to our generalized BDS model but we speculate that it would be possible to also bring these models into a common framework with those that we have discussed. Supplementary material demonstrates how these submodels can be re-derived by either imposing the necessary constraints on the general likelihood formula given in Equation (13) or, alternatively, by starting from the combinations of assumptions and using the five-step procedure outlined above.

The Multitype Birth–Death-Sampling Model

A common extension of the single-type diversification models explored above is to consider cases where the diversification rates (Inline graphic) and probabilities (Inline graphic) vary among lineages as a function of a categorical “lineage type”. This lineage type can be defined in terms of specific (Maddison et al. 2007; Rasmussen and Stadler 2019) or unspecified traits (Beaulieu and O’Meara 2016) or trait combinations (FitzJohn 2012) (for reviews of these models see Morlon 2014; Ng and Smith 2014). Representing these lineage types as colors at nodes and along branches of the tree, we first extend the single-type model above by deriving the likelihood of a fully colored tree with topology Inline graphic where the states along all edges of the phylogeny are known as given by Inline graphic. The resulting likelihood is an extension of the likelihood first developed by Barido-Sottani et al. (2018), where the diversification rates and probabilities are allowed to vary arbitrarily through time. To illustrate that our derivation is indeed quite general, we follow the model developed (independently) by Magnuson-Ford and Otto (2012) and Goldberg and Igić (2012), where the state of lineages can change either anagenetically, with a lineage of type Inline graphic mutating to a type Inline graphic at rate Inline graphic or cladogenetically, with a lineage of type Inline graphic giving rise to a daughter lineage of type Inline graphic at rate Inline graphic. Lineages go extinct at a state-dependent rate Inline graphic and are sampled at rate Inline graphic. As in the single-type model, upon sampling lineages are removed from the population with probability Inline graphic whereas all lineages alive at the present day are sampled with a probability Inline graphic. As discussed in depth by Goldberg and Igić (2012), the other discrete variations of state-dependent diversification models (FitzJohn et al. 2009; Goldberg et al. 2011; FitzJohn 2012) fall out as special cases of this model. (See Ng and Smith 2014; Caetano et al. 2018; Louca and Pennell 2020b for further discussion of the connection between multitype models.)

We use the five-step technique specified above for the single-type case to derive the probability of observing a given colored tree under a general multitype model (see Supplementary material II). We first derive the initial value problem for the probability Inline graphic that an edge Inline graphic of type Inline graphic in the tree at time Inline graphic gives rise to the subsequently observed phylogeny. The edge Inline graphic here refers not to an edge in the topological tree, but to a segment of the tree all of one state between birth, sampling, or mutation events.

graphic file with name Equation15.gif (14)

Equation (16) distinguishes between multiple types of birth events as pictured in Supplementary Figure S1. Birth events may be symmetric, with both daughter lineages inheriting the parental type. The exchangeability of the resulting daughter lineages is reflected in the indicator variable Inline graphic which takes on value of 2 if Inline graphic and 1 otherwise. In contrast asymmetric birth events the resulting daughter lineages differ in type due to caldogenetic change. Importantly, the differential equation for Inline graphic is linear and hence has a known general solution Inline graphic. As in the single-type model Inline graphic is the probability flow (Louca and Pennell 2020b) mapping the probability Inline graphic from the initial state at time Inline graphic to the probability at time Inline graphic.

An analogous initial value problem can be derived for the probability Inline graphic, that a lineage of type Inline graphic alive at time Inline graphic leaves no observed descendants in the sampled tree.

graphic file with name Equation16.gif (15)

This is a nonlinear differential equation and must be solved numerically. Given the solution of Inline graphic and Inline graphic the likelihood for the fully colored tree is characterized by a series of critical times: first, Inline graphic the times at which a lineage of type Inline graphic gives birth to a lineage of type Inline graphic, Inline graphic the ages of tip samples of type Inline graphic, Inline graphic the ages of ancestral samples of type Inline graphic, and Inline graphic the times at which lineages are observed to transition events from type Inline graphic to type Inline graphic. The resulting likelihood is given by:

graphic file with name Equation17.gif (16)

Here, Inline graphic is an arbitrary form of conditioning as in Equation (13) and Inline graphic, a complete list of notation is given in Table S4.

Equation (16) gives the likelihood of a fully colored tree, the tree topology plus the state along each branch and at each node in the tree. This likelihood is a generalization of that presented by (Barido-Sottani et al., 2018, 2020). Maximizing Equation (16) while incrementally adding and removing changes in state along the branches of the tree can be used to identify clades with distinct diversification parameters. This method can be used, for example, to identify transmission clusters within a disease outbreak (Barido-Sottani et al. 2018). This likelihood is distinct from but related to post-traversal likelihood methods developed to infer state-dependent diversification rates given the known state of sampled lineages (e.g., Maddison et al. 2007; Magnuson-Ford and Otto 2012; Stadler and Bonhoeffer 2013). Specifically, these methods give the likelihood Inline graphic where Inline graphic is the state of present-day, Inline graphic, past Inline graphic, and ancestral, Inline graphic, sampled lineages. The relationship between the numerically obtained post-traversal likelihood and the closed-form fully colored likelihood (Equation (16)) is given by:

graphic file with name Equation18.gif (17)

Here, Inline graphic is one specific coloring of the tree Inline graphic (e.g., a maximum parsimony ancestral state reconstruction) that is consistent with the observed states. We include Equation (17) as it clarifies the relationship between these two different approaches that have been used to calculate multitype likelihoods in phylogenetics. Whether or not this is useful for inference is an open question as Inline graphic is challenging to compute (the details of which are beyond the scope of the present paper).

Concluding Remarks

In this article, we have unified a broad class of BDS models that have been widely used both in epidemiology and macroevolution. And in doing so, we have also presented a standardized notation and approach that can be used both for deriving the various submodels that have previously been studied as well as novel combinations of assumptions about the model parameters. The unification of these models clarifies the connections between BDS variants, facilitates the development of new variants tailored to specific scenarios, and provides a structure for understanding how results depend on model assumptions (Kirkpatrick et al. 2002; Lafferty et al. 2015; Louca and Pennell 2020a). And importantly, given the recent discovery of widespread nonidentifiability in birth–death processes fit to extant-only (Louca and Pennell 2020a) and serially sampled (Louca et al. 2021) phylogenetic data, there is a critical need to explore a much broader range of BDS models than were previously considered and the mathematical generalization presented here will be enable this.

Acknowledgments

We would like to thank Sally Otto for her thoughtful comments on this work.

Appendix: Adding assumptions to the general model

In this appendix, we demonstrate how one can obtain the likelihood of submodels with different sets of assumptions by applying constraints to the general likelihood. There are four classes of assumptions that are commonly applied in epidemiological and macroevolutionary studies. First, researchers can make assumptions about the functional form of the birth, death, and sampling rates. Here, we address two such unique assumptions: i) birth, death, and sampling rates are constant (see the Rate Assumptions section, Sections S1.1, S1.2, and S1.5 of the Supplementary material; and ii) birth, death, and sampling rates are piecewise-constant functions of time (see Piecewise-constant Rates section, Section S1.6 of the Supplementary material). The cases where birth, death, and sampling rates are defined by a stochastic or deterministic SIR model are mathematically analogous to the cases of the piecewise-constant and general time-variable models respectively. All additional constraints imposed will depend on the exact compartmental model used and hence we will not discuss them in detail in this section. The second major class of assumptions pertains to sampling. There are four such sampling assumptions: i) sampling happens only at the present day as in a birth–death model (see No Sampling at the Present Section, Sections S1.1, S1.3, and S1.4 of the Supplementary material) or as implemented in the “Birth Death Skyline Contemporary” prior in the BDSKY package in BEAST2; ii) the absence of concerted present-day sampling (see Birth–Death Models section, Section S1.5 of the Supplementary material); iii) the inclusion of ancestral samples with sampled descendants (Sections S1.6 and S1.7 of the Supplementary material; and iv) concerted sampling attempts (CSA) during which all lineages are sampled with a given probability (see Concerted Sampling Attempts section, Section S1.6 of the Supplementary material). The third assumption class considers the presence of mass extinction events (see Mass Extinction section, Section S1.5 of the Supplementary material). The fourth and final major class of assumptions deal with the conditioning of the likelihood. The various conditioning schemes are explored in below and summarized in Supplementary Table S3 available on Dryad.

Rate Assumptions

Constant Rates

  • Model assumptions: Constant diversification rates: Inline graphic, Inline graphic, Inline graphic, and constant removal probability Inline graphic.

  • The IVP for Inline graphic):  
    graphic file with name Equation19.gif
  • The IVP for Inline graphic:  
    graphic file with name Equation20.gif
    In this case the IVP for Inline graphic is a Bernoulli differential equation and has a known analytical solution. As given by Equation 1 in (Stadler, 2010) this solution is given by:
    graphic file with name Equation21.gif (A1)
  • The probability flow:  
    graphic file with name Equation22.gif
  • The likelihood:  
    graphic file with name Equation23.gif (A2)

Piecewise-Constant Rates

  • Model assumptions: Divide time into Inline graphic intervals defined by transition times  Inline graphic. Define rates and removal probabilities constant within a given interval.

    graphic file with name Equation24.gif
  • The IVP and solution for Inline graphic: Given the definitions of Inline graphic, Inline graphic, Inline graphic, and Inline graphic within each time interval the IVP for Inline graphic is identical to that given in Equations (3) and (4). If Inline graphic is the probability density within time interval Inline graphic than Inline graphic.

  • The IVP and solution for Inline graphic: As with Inline graphic, the IVP for Inline graphic is given by Equation (8). With the piecewise-constant rate assumptions, however, the general solution for Inline graphic between Inline graphic is known (similar to Equation (A1)). Defining Inline graphic and Inline graphic we have:
    graphic file with name Equation25.gif
    where Inline graphic for Inline graphic and Inline graphic.
  • The probability flow: We define a probability subflow within each time interval. Specifically, in the Inline graphicth time interval.

    graphic file with name Equation26.gif
    The complete flow can be expressed as a function of the subflows in the following manner:
    graphic file with name Equation27.gif (A3)
    where Inline graphic is the index of the time Inline graphic at or after time Inline graphic, that is, the largest index such that Inline graphic.
  • The likelihood: Given these piecewise definitions we substitute them into the general BDS likelihood (13).

    graphic file with name Equation28.gif
    where we use Inline graphic to denote the piecewise-constant assumption.

    We can simplify several of these products. Let Inline graphic be the number of birth events Inline graphic and Inline graphic the number of sampling events Inline graphic.

    graphic file with name Equation29.gif (A4)
    Let Inline graphic be the number of observed lineages alive at time Inline graphic. Because the number of observed lineages increases with each birth and decreases with each sampled tip, counting the root we have Inline graphic. Substituting the expressions for the into the likelihood and using the definition of Inline graphic we have:
    graphic file with name Equation30.gif (A5)

Sampling Assumptions

Birth–Death Models

  • Model assumptions: The birth–death model assumes that Inline graphic. Note that the probability of sampling a lineage given it is alive at the present day remains as Inline graphic (incomplete sampling).

  • IVP for Inline graphic:  
    graphic file with name Equation31.gif
  • IVP for Inline graphic:  
    graphic file with name Equation32.gif
    Note in this case Inline graphic equals Inline graphic, the probability a lineage leaves no sampled extant descendants. As demonstrated by (Morlon et al., 2011) there exists a general solution to this initial value problem, see Section ?? for more details. This general solution is given by:
    graphic file with name Equation33.gif
  • The probability flow: From (Morlon et al., 2011), the probability flow can be written as the following:
    graphic file with name Equation34.gif
  • The likelihood:  
    graphic file with name Equation35.gif (A6)

No Sampling at the Present

Here, we consider the case when Inline graphic. The likelihood follows exactly as in the general model case. The resulting likelihood expression is given by:

graphic file with name Equation36.gif (A7)

Note that in this case Inline graphic.

Concerted Sampling Attempts

  • Model assumptions: Here, we introduce Inline graphic concerted sampling attempts (CSA) at known points in time, Inline graphic. Like the CSA at the present day, and in contrast to the background Poissonian sampling rate, during the CSA at time Inline graphic every lineages is sampled with a fixed probability Inline graphic. In the derivation of the likelihood below, we must distinguish between three different sampling event types. First, past Poissonian sampling events are those that do not occur during CSAs. Second, past concerted sampling events are those that occur during a CSA at time Inline graphic. Finally, present concerted sampling events are those that occur at the present day Inline graphic. Past concerted sampling attempts can be included in the general model above by adding Inline graphic Dirac distributions to the Poisson sampling rate function. Namely,
    graphic file with name Equation37.gif (A8)
    where Inline graphic is the background Poissonian sampling rate and Inline graphic. The definition of Inline graphic comes from solving the CDF of the exponentially distribution for the “effective sampling rate” such that the probability of a lineage being sampled is Inline graphic.
  • IVP for Inline graphic:  
    graphic file with name Equation38.gif

    The solution to Inline graphic is given by Equations (5) and (6).

  • IVP for Inline graphic: As with Inline graphic, the IVP for Inline graphic is identical to that given for the general model in Equation (8). Except in rare cases the IVP must be solved numerically hence requiring numerical integration over Dirac distributions which can prove to be problematic.

    Note however, that when examining the integrals over the CSAs, a priori, it is a matter of convention whether the Dirac distribution should be considered as “integrated over” when located at the upper integration bound Inline graphic or at the lower integration bound Inline graphic. Whichever convention we chose, we must rigorously obey it so that the ratio Inline graphic correctly evaluates to Inline graphic whenever Inline graphic. Using the former convention, we can rewrite the probability Inline graphic at each concerted sampling time Inline graphic as:
    graphic file with name Equation39.gif
    where Inline graphic denotes the limit as time approaches Inline graphic from below. Hence, the probability Inline graphic at any time Inline graphic can be evaluated numerically by considering the dynamics between successive CSAs and at each CSA separately.
  • The probability flow: The probability flow is given by:
    graphic file with name Equation40.gif

    As with Inline graphic integration over the dirac distributions can be problematic and hence we rewrite this expression separating out these terms. Let Inline graphic be the oldest CSA occurring at or after time Inline graphic, that is, the largest index for which Inline graphic.

    graphic file with name Equation41.gif (A9)
    We define:
    graphic file with name Equation42.gif (A10)
    which means that we can rewrite Equation (A9) as:
    graphic file with name Equation43.gif (A11)
  • The likelihood: The edge representation of Inline graphic is given by:
    graphic file with name Equation44.gif
    The critical time representation of Inline graphic is given by:
    graphic file with name Equation45.gif
     
    graphic file with name Equation46.gif
    where Inline graphic is the number of tip samples (samples without descendants) obtained during the Inline graphicth CSA and Inline graphic is the number of ancestral samples (sequences with descendants). By changing how we enumerate birth, death, and sampling events we can greatly simplify this likelihood. First, let Inline graphic be the number of branching events at or before the Inline graphicth CSA. In other words, Inline graphic is the number of branching events if the tree were trimmed at the Inline graphicth CSA. Then:
    graphic file with name Equation47.gif (A12)
    Second, let Inline graphic be the number of past Poissonian sampling events before time Inline graphic. Then:
    graphic file with name Equation48.gif (A13)
    Finally, let Inline graphic be the number of past lineages sampled during a CSA at or before the CSA at time Inline graphic. Hence, Inline graphic. Then:
    graphic file with name Equation49.gif (A14)
    The likelihood hence simplifies to:
    graphic file with name Equation50.gif

    Let Inline graphic be the number of lineages that cross Inline graphic, that is, the number of lineages alive at time Inline graphic with sampled descendants at some younger age. Note that by this definition Inline graphic. Then Inline graphic is the number of tips in the tree had it been trimmed at age Inline graphic whereas Inline graphic is the number of branching events. Therefore we must have Inline graphic. This allows us to simplify the conditioned likelihood given below.

    graphic file with name Equation51.gif (A15)

Mass Extinction

  • Model assumptions: In addition to the Poisson birth death and sampling events considered in the general model, there are Inline graphic mass extinctions events occurring at times Inline graphic. During the Inline graphicth mass extinction event each lineage goes extinct with probability Inline graphic. As with concerted sampling such mass extinction events can be introduced into the model by adding a set of dirac-delta functions to the Poisson death rate, Inline graphic.

    graphic file with name Equation52.gif (A16)
    where Inline graphic.
  • IVP for Inline graphic: The initial value problem for Inline graphic is identical to that given in equation by Equations (3) and (4) except that Inline graphic is now includes the mass extinction events.

  • IVP for Inline graphic: The IVP for Inline graphic to that given by Equation (8) except where the extinction rate is given by Equation (A16). The solution to Inline graphic is obtained by numerical integration. Given the dirac-delta functions this numerical integration can be carried out in a piecewise manner integrating separately between and over each mass extinction event. Defining Inline graphic as the solution up to but not including the mass extinction event at time Inline graphic, we have:
    graphic file with name Equation53.gif

    The first term reflects the probability that a lineage that does not go extinct during the Inline graphicth mass extinction event leaves no observable offspring (with probability Inline graphic) whereas the second term reflects the fact that all lineages that go extinct during the Inline graphicth mass extinction leave no observed descendants with probability 1.

  • The probability flow: The solution to the IVP is once again given by Inline graphic where:
    graphic file with name Equation54.gif

    As with the CSAs, let Inline graphic be the last index Inline graphic such that Inline graphic. We can separate out the mass extinction terms in the following way.

    graphic file with name Equation55.gif
    where Inline graphic is defined as in Equation (A10).
  • The likelihood: Given these initial value problems the likelihood follows as in the general model.

    graphic file with name Equation56.gif
    As with the CSAs we can use relations analogous to Equations A12A14 to rewrite the likelihood:
    graphic file with name Equation57.gif (A17)
    where Inline graphic is defined as before as the number of lineages present at time Inline graphic.

Alternative Conditioning

Supplementary Table S3 lists a number of possible conditionings, Inline graphic that can be applied to the tree likelihood. First, is the trivial case of no conditioning Inline graphic which gives the probability of the observed tree including the stem edge between time Inline graphic and Inline graphic. To obtain the model likelihood excluding the stem edge, that is, conditioning of the Inline graphic, can be obtained by setting Inline graphic. Recall that the elements of Inline graphic are ordered such that Inline graphic is the first (oldest) birth event.

Acknowledging that one would not reconstruct a phylogeny without any sampled lineages, we can condition the likelihood on observing at least one sampled lineage (either at or before the present day) given the time of origin, Inline graphic. Or as with Inline graphic, conditioning on at least one sampled lineage given the Inline graphic. In order to have at least one sampled lineage and a most recent common ancestor, however, each daughter lineage of the common ancestor must have at least one descendent. Hence, we have Inline graphic. The general birth–death-sampling model assumes that all lineages alive at the present day are sampled with probability Inline graphic. As with concerted sampling attempts (CSAs) prior to the present day, this present day CSA may include the sampling of multiple lineages as well as possibly resulting in no sampled lineages. As with Inline graphic and Inline graphic we can condition the tree likelihood on observing at least one extant lineage at the present day. To do so, we define Inline graphic, the probability that a lineage alive at time Inline graphic has no extant descendants. Conditioning on the time of origin we have: Inline graphic. Conditioning on the time of the most recent common ancestor we have: Inline graphic, where now at least one of the two daughter lineages of the common ancestor has a present day sample. In many cases Inline graphic is modified, however, to condition on both daughter lineages having an extant sampled descendent: Inline graphic.

As an alternative to conditioning on at least one extant sampled descent, tree likelihoods can be conditioned on having exactly  Inline graphic sampled (extant) descendants. Let Inline graphic be the probability a lineage alive at time Inline graphic has exactly Inline graphic descendants. Although a general expression for Inline graphic is unknown, in the case of constant birth, death, and sampling rates (the case in which this form of conditioning has been applied), the expression for Inline graphic is given by (Gernhard, 2008), (Kendall, 1948) and Theorem 3.3 by (Stadler, 2010):

graphic file with name Equation58.gif

where, like Inline graphic, Inline graphic is given by Equation (10) evaluated with where Inline graphic. Given the time of origin we can condition on observing exactly Inline graphic lineages by setting Inline graphic. When Inline graphic is given instead, then the number of descendants of the two daughter lineages must add up to Inline graphic while both daughter lineages must still have at least one descendant (see (Stadler, 2010) Corollary 3.9).

graphic file with name Equation59.gif

While early BDS models often employed such conditioning (Stadler 2009; Stadler 2010), this form of conditioning has not been employed in many later models perhaps because the biological justification for such conditioning is vague.

The final form of conditioning used in the literature, which we will represent simply as Inline graphic, is the multiplication of the BDS likelihood by a constant to account for the enumeration over the possible indistinguishable representations of a given tree. The value of this constant depends on whether the tree considered is “labeled” and “oriented” (Gavryushkina et al. 2013) and whether, as in the derivation here, the vector of birth events, Inline graphic, is (un)ordered. Inclusion of such a constant should have no effect on the maximum likelihood inference of the model parameters given a specified tree. In cases where the constant is a function of the critical times (Heath et al. 2014), it can influence the inference when the parameters and the tree are jointly estimated.

Contributor Information

Ailene MacPherson, Zoology, University of British Columbia, Vancouver V6T 1Z4, Canada; Ecology and Evolutionary Biology, University of Toronto, Toronto M5S 3B2, Canada; Mathematics, Simon Fraser University, Burnaby V5A 1S6, Canada.

Stilianos Louca, Biology, University of Oregon, Eugene 97403, USA; Institute of Ecology and Evolution, University of Oregon, Eugene 97403, USA.

Angela McLaughlin, British Columbia Centre for Excellence in HIV/AIDS, Vancouver V6Z 1Y6, Canada; Bioinformatics, University of British Columbia, Vancouver V6T 1Z4, Canada.

Jeffrey B Joy, British Columbia Centre for Excellence in HIV/AIDS, Vancouver V6Z 1Y6, Canada; Bioinformatics, University of British Columbia, Vancouver V6T 1Z4, Canada; Medicine, University of British Columbia, Vancouver V6T 1Z4, Canada.

Matthew W Pennell, Zoology, University of British Columbia, Vancouver V6T 1Z4, Canada.

Supplementary Material

Data available from the Dryad Digital Repository: https://doi.org/10.5281/zenodo.5028470.

Funding

This work was supported by a Grant for Catalyzing Research Clusters awarded to the UBC Biodiversity Research Centre,NSF DEB Grant [#2028986 award to S.L. and M.W.P.]; a NSERC Discovery Grant to M.W.P.; and in part by the EEB department Postdoctoral Fellowship from the University of Toronto to A.M.; in part by a Canadian Institutes of Health Research (CIHR) doctoral award [#6557] to A.M.c.L..; a Genome Canada Bioinformatics and Computational Biology grant [287PHY], CIHR coronavirus rapid response program grant [440371] to J.B.J. and is grateful to the British Columbia Centre for Excellence in HIV/AIDS for additional funding support.

References

  1. Barido-Sottani J., Vaughan T.G., Stadler T.  2018. Detection of HIV transmission clusters from phylogenetic trees using a multi-state birth–death model. J. R. Soc. Interface  15:20180512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barido-Sottani J., Vaughan T.G., Stadler, T.  2020. A multitype birth–death model for Bayesian inference of lineage-specific birth and death rates. Syst. Biol.  69:973–986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Beaulieu J.M., O’Meara B.C.  2016. Detecting hidden diversification shifts in models of trait-dependent speciation and extinction. Syst. Biol.  65:583–601. [DOI] [PubMed] [Google Scholar]
  4. Boskova V., Bonhoeffer S., Stadler T.  2014. Inference of epidemiological dynamics based on simulated phylogenies using birth-death and coalescent models. PLoS Comput. Biol.  10:1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bouckaert R., Vaughan T.G., Barido-Sottani, J., Duchêne S., Fourment M., Gavryushkina A., Heled J., Jones G., Kühnert D., Maio N.D., Matschiner M., Mendes F.K., Müller N.F., Ogilvie H.A., du Plessis, Popinga A., Rambaut A., Rasmussen D., Siveroni I., Suchard M.A., Wu C.-H., Xie D., Zhang C., Stadler T., Drummond A.J.  2019. BEAST 2.5: an advanced software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 15:e1006650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Caetano D.S., O’Meara B.C., Beaulieu J.M.  2018. Hidden state models improve state-dependent diversification approaches, including biogeographical models. Evolution  72:2308–2324. [DOI] [PubMed] [Google Scholar]
  7. Drummond A.J., Pybus O.G., Rambaut A., Forsberg R., Rodrigo A.G.  2003. Measurably evolving populations. Trends Ecol. Evol.  18:481–488. [Google Scholar]
  8. Drummond A.J., Rambaut A., Shapiro B., Pybus O.G.  2005. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol.  22:1185–1192. [DOI] [PubMed] [Google Scholar]
  9. du Plessis L., Stadler T.  2015. Getting to the root of epidemic spread with phylodynamic analysis of genomic data. Trends Microbiol.  23:383–386. [DOI] [PubMed] [Google Scholar]
  10. Duffy S., Shackelton L.A., Holmes E.C.  2008. Rates of evolutionary change in viruses: patterns and determinants. Nat. Rev. Genetics  9:267–276. [DOI] [PubMed] [Google Scholar]
  11. Etienne R.S., Haegeman B., Stadler T., Aze T., Pearson P.N., Purvis, A., Phillimore A.B. 2012. Diversity-dependence brings molecular phylogenies closer to agreement with the fossil record. Proc. R. Soc. B  279:1300–1309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Etienne R.S., Rosindell J.  2012. Prolonging the past counteracts the pull of the present: protracted speciation can explain observed slowdowns in diversification. Syst. Biol.  61:204–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Ezard T.H.G., Quental T.B., Benton M.J.  2016. The challenges to inferring the regulators of biodiversity in deep time. Philos. Trans. R. Soc. B  371:20150216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Feller W.  1949. Proceedings of the [First] Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, Berkeley California. [Google Scholar]
  15. FitzJohn R.G.  2012. Diversitree: comparative phylogenetic analyses of diversification in R. Methods Ecol. Evol.  3:1084–1092. [Google Scholar]
  16. FitzJohn R.G., Maddison W.P., Otto S.P.  2009. Estimating trait-dependent speciation and extinction rates from incompletely resolved phylogenies. Syst. Biol.  58:595–611. [DOI] [PubMed] [Google Scholar]
  17. Gavryushkina A., Heath T.A., Ksepka D.T., Stadler T., Welch D., Drummond A.J.  2017. Bayesian total-evidence dating reveals the recent crown radiation of penguins. Syst. Biol.  66:57–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gavryushkina A., Welch D., Drummond A.J.  2013. Recursive algorithms for phylogenetic tree counting. Algorithms Mol. Biol.  8:26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gavryushkina A., Welch D., Stadler T., Drummond A.J.  2014. Bayesian inference of sampled ancestor trees for epidemiology and fossil calibration. PLoS Comput. Biol.  10:1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gernhard T.  2008. The conditioned reconstructed process. J. Theor. Biol.  253:769–778. [DOI] [PubMed] [Google Scholar]
  21. Goldberg E.E., Igić B.  2012. Tempo and mode in plant breeding system evolution. Evolution  66:3701–3709. [DOI] [PubMed] [Google Scholar]
  22. Goldberg E.E., Lancaster L.T., Ree R.H.  2011. Phylogenetic inference of reciprocal effects between geographic range evolution and diversification. Syst. Biol.  60:451–465. [DOI] [PubMed] [Google Scholar]
  23. Grenfell B.T., Pybus O.G., Gog J.R., J. Wood L.N., Daly J.M., Mumford J.A., Holmes E.C.  2004. Unifying the epidemiological and evolutionary dynamics of pathogens. Science  303:327–332. [DOI] [PubMed] [Google Scholar]
  24. Hagen O., Hartmann K., Steel M., Stadler T.  2015. Age-dependent speciation can explain the shape of empirical phylogenies. Syst. Biol.  64:432–440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Heath T.A., Huelsenbeck J.P., Stadler T.  2014. The fossilized birth–death process for coherent calibration of divergence-time estimates. Proc. Natl. Acad. Sci. USA  111:E2957–E2966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Joy J.B., McCloskey R.M., Nguyen T., Liang R.H., Khudyakov Y., Olmstead A., Krajden M., Ward J.W., Harrigan P.R., Montaner J.S.G., Poon A.F.Y.  2016. The spread of hepatitis C virus genotype 1a in North America: a retrospective phylogenetic study. Lancet Infectious Dis.  16:698–702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Keeling M.J., Rohani P.  2008. Modeling infectious diseases: in humans and animals. Princeton University Press. [Google Scholar]
  28. Kendall D.G.  1948. On the generalized “birth-and-death” process. Ann. Math. Stat.  19:1–15. [Google Scholar]
  29. Kingman J.F.C.  1982. On the genealogy of large populations. J. Appl. Prob.  19:27–43. [Google Scholar]
  30. Kirkpatrick M., Johnson T., Barton N.  2002. General models of multilocus evolution.  Genetics  161:1727–1750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kühnert D., Stadler T., Vaughan T.G., Drummond A.J.  2014. Simultaneous reconstruction of evolutionary history and epidemiological dynamics from viral sequences with the birth–death SIR model. J. R. Soc. Interface  11:20131106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lafferty K.D., DeLeo G., Briggs C.J., Dobson A.P., Gross T., Kuris A.M.  2015. A general consumer-resource population model. Science  349:854–857. [DOI] [PubMed] [Google Scholar]
  33. Lambert A.  2018. The coalescent of a sample from a binary branching process. Theor. Popul. Biol.  122:30–35. [DOI] [PubMed] [Google Scholar]
  34. Lambert A., Stadler T.  2013. Birth–death models and coalescent point processes: the shape and probability of reconstructed phylogenies. Theor. Popul. Biol.  90:113–128. [DOI] [PubMed] [Google Scholar]
  35. Landis M.J., Eaton D.A.R., Clement W.L., Park B., Spriggs E.L., Sweeney P.W.  Edwards E.J., Donoghue M.J.  2021. Joint phylogenetic estimation of geographic movements and biome shifts during the global diversification of viburnum. Syst. Biol.  70:67–85. [DOI] [PubMed] [Google Scholar]
  36. Laudanno G., Haegeman B., Rabosky D.L., Etienne R.S.  2020. Detecting lineage-specific shifts in diversification: a proper likelihood approach. Syst. Biol.  70:389–407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Lehtonen S., Silvestro D., Karger D.N., Scotese C., Tuomisto H., Kessler M., Peña C., Wahlberg N., Antonelli A.  2017. Environmentally driven extinction and opportunistic origination explain fern diversification patterns. Sci. Rep.  7:4831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Leventhal G.E., Kouyos R., Stadler T., von Wyl V., Yerly S., Böni J., Cellerai C., Klimkait T., Günthard H.F., Bonhoeffer S.  2012. Inferring epidemic contact structure from phylogenetic trees. PLoS Comput. Biol.  8:e1002413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Louca S.  2020. Simulating trees with millions of species. Bioinformatics.  26:2907–2908. [DOI] [PubMed] [Google Scholar]
  40. Louca S., Doebeli M.  2018. Efficient comparative phylogenetics on large trees. Bioinformatics  34:1053–1055. [DOI] [PubMed] [Google Scholar]
  41. Louca S., McLaughlin A., MacPherson A., Joy J.B., Pennell M.W.  2021. Fundamental identifiability limits in molecular epidemiology. Mol. Biol. Evol. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Louca S., Pennell M.W.  2020a. Extant timetrees are consistent with a myriad of diversification histories. Nature  580:1–4. [DOI] [PubMed] [Google Scholar]
  43. Louca S., Pennell M.W.  2020b. A general and efficient algorithm for the likelihood of diversification and discrete-trait evolutionary models. Syst. Biol.  69:545–556. [DOI] [PubMed] [Google Scholar]
  44. Maddison W.P., Midford P.E., Otto S.P.  2007. Estimating a binary character’s effect on speciation and extinction. Syst. Biol.  56:701–710. [DOI] [PubMed] [Google Scholar]
  45. Magee A.F., Höhna S.  2021. Impact of K-Pg mass extinction event on crocodylomorpha inferred from phylogeny of extinct and extant taxa. bioRxiv 2021.01.14.426715. Available from: https://www.biorxiv.org/content/10.1101/2021.01.14.426715v1. [Google Scholar]
  46. Magnuson-Ford K., Otto S.P.  2012. Linking the investigations of character evolution and species diversification. Am. Nat.  180:225–245. [DOI] [PubMed] [Google Scholar]
  47. Morlon H.  2014. Phylogenetic approaches for studying diversification. Ecol. Lett.  17:508–525. [DOI] [PubMed] [Google Scholar]
  48. Morlon H., Parsons T.L., Plotkin J.B.  2011. Reconciling molecular phylogenies with the fossil record. Proc. Natl. Acad. Sci. USA  108:16327–16332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Morlon H., Potts M.D., Plotkin J B.  2010. Inferring the dynamics of diversification: a coalescent approach. PLoS Biol.  8:e1000493. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Nee S.  2006. Birth-death models in macroevolution. Annu. Rev. Ecol. Evol. Syst.  37:1–17. [Google Scholar]
  51. Nee S., May R.M., Harvey P.H.  1994. The reconstructed evolutionary process. Philos. Trans. R. Soc. B  344:7. [DOI] [PubMed] [Google Scholar]
  52. Ng J., Smith S.D.  2014. How traits shape trees: new approaches for detecting character state-dependent lineage diversification. J. Evol. Biol.  27:2035–2045. [DOI] [PubMed] [Google Scholar]
  53. Pybus O.G., Rambaut A., Harvey P.H.  2000. An integrated framework for the inference of viral population history from reconstructed genealogies. Genetics  155:1429–1437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Rabosky D.L., Lovette I.J.  2008a. Density-dependent diversification in North American wood warblers. Proc. R. Soc. B  275:2363–2371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Rabosky D.L., Lovette I.J.  2008b. Explosive evoltuionary radiation: decreasing specaition or increasing extinction through time?  Evolution  62:1866–1875. [DOI] [PubMed] [Google Scholar]
  56. Rasmussen D.A., Stadler T.  2019. Coupling adaptive molecular evolution to phylodynamics using fitness-dependent birth-death models. eLife  8:1–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Raup D.M.  1985. Mathematical models of cladogenesis. Paleobiology  11:42–52. [Google Scholar]
  58. Romero-Severson E.O., Bulla I., Leitner T.  2016. Phylogenetically resolving epidemiologic linkage. Proc. Natl. Acad. Sci. USA  113:2690–2695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Schluter D., Pennell M.W.  2017. Speciation gradients and the distribution of biodiversity. Nature  546:48–55. [DOI] [PubMed] [Google Scholar]
  60. Silvestro D., Schnitzler J., Liow L.H., Antonelli A., Salamin N.  2014. Bayesian estimation of speciation and extinction from incomplete fossil occurrence data. Syst. Biol.  63:349–367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Stadler T.  2009. On incomplete sampling under birth-death models and connections to the sampling-based coalescent. J. Theor. Biol.  261:58–66. [DOI] [PubMed] [Google Scholar]
  62. Stadler T.  2010. Sampling-through-time in birth-death trees. J. Theor. Biol.  267:396–404. [DOI] [PubMed] [Google Scholar]
  63. Stadler T.  2011. Mammalian phylogeny reveals recent diversification rate shifts. Proc. Natl. Acad. Sci. USA  108:6187–6192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Stadler T.  2013. Recovering speciation and extinction dynamics based on phylogenies. J. Evol. Biol.  26:1203–1219. [DOI] [PubMed] [Google Scholar]
  65. Stadler T., Bonhoeffer S.  2013. Uncovering epidemiological dynamics in heterogeneous host populations using phylogenetic methods. Philos. Trans. R. Soc. B  368:20120198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Stadler T., Kouyos R.D., von Wyl V., Yearly S., Böni J.  2012. Estimating the basic reproductive number from viral sequence data. Mol. Biol. Evol.  29:347–357. [DOI] [PubMed] [Google Scholar]
  67. Stadler T., Kühnert D., Bonhoeffer S., Drummond A.J.  2013. Birth-death skyline plot reveals temporal changes of epidemic spread in HIV and hepatitis C virus (HCV). Proc. Natl. Acad. Sci. USA  110:228–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Stadler T., Vaughan T.G., Gavryushkin A., Guindon S., Kühnert D., Leventhal G.E., Drummond A.J.  2015. How well can the exponential-growth coalescent approximate constant-rate birth-death population dynamics?  Proc. Biol. Sci.  282:20150420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Strimmer K., Pybus O.G.  2001. Exploring the demographic history of DNA sequences using the generalized skyline plot. Mol. Biol. Evol.  18:2298–2305. [DOI] [PubMed] [Google Scholar]
  70. Vaughan T.G., Leventhal G.E., Rasmussen D.A., Drummond A.J., Welch D., Stadler, T.  2019. Estimating epidemic incidence and prevalence from genomic data. Mol. Biol. Evol.  36:1804–1816. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Volz E.M.  2012. Complex population dynamics and the coalescent under neutrality. Genetics  190:187–201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Volz E.M., Frost S.D.W.  2014. Sampling through time and phylodynamic inference with coalescent and birth–death models. J. R. Soc. Interface  11:20140945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Volz E.M., Kosakovsky Pond S.L., Ward M.J., Leigh Brown A.J., Frost S.D.W.  2009. Phylodynamics of infectious disease epidemics. Genetics  183:1421–1430. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Systematic Biology are provided here courtesy of Oxford University Press

RESOURCES