The expected loss of feature diversity (versus phylogenetic diversity) following rapid extinction at the present

Marcus Overwater; Daniel Pelletier; Mike Steel

doi:10.1007/s00285-023-01988-4

. 2023 Sep 2;87(3):53. doi: 10.1007/s00285-023-01988-4

The expected loss of feature diversity (versus phylogenetic diversity) following rapid extinction at the present

Marcus Overwater ¹, Daniel Pelletier ², Mike Steel ^1,^✉

PMCID: PMC10475005 PMID: 37658909

Abstract

The current rapid extinction of species leads not only to their loss but also the disappearance of the unique features they harbour, which have evolved along the branches of the underlying evolutionary tree. One proxy for estimating the feature diversity (FD) of a set S of species at the tips of a tree is ‘phylogenetic diversity’ (PD): the sum of the branch lengths of the subtree connecting the species in S. For a phylogenetic tree that evolves under a standard birth–death process, and which is then subject to a sudden extinction event at the present (the simple ‘field of bullets’ model with a survival probability of s per species) the proportion of the original PD that is retained after extinction at the present is known to converge quickly to a particular concave function $φ_{PD} (s)$ as t grows. To investigate how the loss of FD mirrors the loss of PD for a birth–death tree, we model FD by assuming that distinct discrete features arise randomly and independently along the branches of the tree at rate r and are lost at a constant rate $ν$ . We derive an exact mathematical expression for the ratio $φ_{FD} (s)$ of the two expected feature diversities (prior to and following an extinction event at the present) as t becomes large. We find that although $φ_{FD}$ has a similar behaviour to $φ_{PD}$ (and coincides with it for $ν = 0$ ), when $ν > 0$ , $φ_{FD} (s)$ is described by a function that is different from $φ_{PD} (s)$ . We also derive an exact expression for the expected number of features that are present in precisely one extant species. Our paper begins by establishing some generic properties of FD in a more general (non-phylogenetic) setting and applies this to fixed trees, before considering the setting of random (birth–death) trees.

Keywords: Feature diversity, Phylogenetic diversity, Birth–death processes, Extinction

Introduction

Phylogenetic trees provide a way to quantify biodiversity and the extent to which it might be lost in the face of the current mass extinction event. One such biodiversity measure is phylogenetic diversity (PD), which associates to each subset S of extant species, the sum of the branch lengths of the underlying evolutionary tree that connects (just) these species to the root of the tree. This measure, pioneered by Faith (1992), provides a more complete measure of biodiversity than merely counting the number of species in S (see e.g. Miller et al. 2018). Moreover, if new features evolve along the branches of a tree and are never lost, then the resulting features present amongst the species in the subset S (the feature diversity (FD) of S) are directly correlated with the phylogenetic diversity of S (Wicke et al. 2021). However, when features are lost, it has recently been shown mathematically that under simple (deterministic or stochastic) models of feature gain and loss, FD necessarily deviates from PD except for very trivial types of phylogenetic trees (Theorems 2 and 3 of Rosindell et al. (2022)). More generally, the question of the extent to which PD captures feature or functional diversity has been the subject of considerable debate in the biological literature (see Devictor et al. 2010; Mazel et al. 2017, 2018, 2019; Owen et al. 2019; Tucker et al. 2018, 2019).

In this paper, we investigate a related question: under a standard phylogenetic diversification model and a simple stochastic process of feature gain and loss, what proportion of feature diversity is expected to be lost in a mass extinction event at the present? And how does this ratio compare with the expected phylogenetic diversity that will be lost? Although the latter ratio (for PD) has been determined in earlier work, here we provide a corresponding result for FD, and show how it differs from PD when the rate of feature loss is non-zero. Our results suggest that the relative loss of FD under a mass extinction event at the present is greater than the relative loss of PD. We also investigate the number of features that are expected to be found in just one species at the present.

We begin by considering the properties of FD in a more general setting based purely on the species themselves (i.e. not involving any underlying phylogenetic tree or network) and then consider FD on fixed phylogenetic trees before presenting the results for (random) birth–death trees. Some of the results of these earlier sections are applied in the later sections.

General properties of feature diversity without reference to phylogenies

This section considers the generic properties of expected FD for sets of species, and thus, no underlying phylogenetic tree or stochastic process that generates a tree, or model of feature evolution is assumed. We mostly follow the notation from Wicke et al. (2021).

Definitions

Let X be a labelled set of species, and for each $x \in X$ , let $F_{x}$ be a non-empty set of discrete features (e.g. genes, genomic inserts, traits) that are associated with species x.

The collection of ordered pairs $F = {(x, F_{x}) : x \in X}$ is called a feature assignment on X.
For any subset A of X, let $F (A) = ⋃_{x \in A} F_{x}$ and let $μ : F (X) \to R^{> 0}$ be a function assigning some richness or novelty to a feature $f \in F (X)$ .
The feature diversity of some subset A of a set of species X is defined as,
$\begin{matrix} F D (A) = \sum_{f \in F (A)} μ (f) . \end{matrix}$ 2.1

A default option for $μ$ is to set $μ (f) = 1$ , which simply counts the number of features present. Notice that for any subset A of X we have $F D (A) \leq \sum_{x \in A} F D ({x}) = \sum_{x \in A} \sum_{f \in F_{x}} μ (f),$ with equality if and only if no features are shared by any two species.

Feature diversity loss under a ‘Field of Bullets’ model of extinction at the present

Definition 1.1

Consider a sudden extinction event taking place across a set of species (Raup 1993). In the generalised field of bullets (g-FOB) model, each species $x \in X$ either survives the extinction event (with probability $s_{x}$ ) or disappears (with probability $1 - s_{x}$ ), and these survival events are assumed to be independent among the species. We write $s (X)$ (or, more briefly, $s$ ) to denote the vector $(s_{x} : x \in X)$ and we let $X$ denote the (random) subset of X corresponding to the species that survive the extinction event. If $s_{x} = s$ for all $x \in X$ , then we have the simpler field of bullets (FOB) model.

We define the following quantity:

\begin{matrix} φ_{(F, s)} = \frac{F D (X)}{F D (X)}, \end{matrix}

which is the proportion of feature diversity that survives the extinction event. In the case of the FOB model, we denote this ratio by $φ_{(F, s)}$ .

Proposition 1.1

For the g-FOB model,

\begin{matrix} E [φ_{(F, s)}] = \sum_{f \in F (X)} \tilde{μ} (f) (1 - \prod_{x : f \in F_{x}} (1 - s_{x})), \end{matrix}

where $\tilde{μ} (f) = \frac{μ (f)}{\sum_{f \in F (X)} μ (f)}$ are the normalised $μ$ values (which sum to 1).

For the FOB model, the equation in Proposition 2.1 simplifies to

\begin{matrix} E [φ_{(F, s)}] = \sum_{f \in F (X)} \tilde{μ} (f) (1 - {(1 - s)}^{n_{f}}), \end{matrix}

2.2

where $n_{f} = | {f : \exists x : f \in F_{x}} |$ , the number of species in X that possess feature f.

Proof

We have $φ_{(F, s)} = \sum_{f \in F (X)} \tilde{μ} (f) \cdot I_{f}$ where $I_{f}$ is the Bernoulli variable that takes the value 1 if at least one species in X with feature f survives the g-FOB extinction event, and 0 otherwise. Applying linearity of expectation and noting that $P (I_{f} = 1) = 1 - \prod_{x : f \in F_{x}} (1 - s_{x})$ gives the result. $□$

Notice that $E [φ_{(F, s)}] = s$ at $s = 0, 1$ . The behaviour between these two extreme values of s is described next.

Proposition 1.2

Under the FOB model, the following hold:

(i)
$E [φ_{(F, s)}] \geq s$ for all $s \in [0, 1]$ .
(ii)
$E [φ_{(F, s)}] = s$ for a value of $s \in (0, 1)$ if and only if the sets $F_{x}$ in $F$ are pairwise disjoint, in which case $E [φ_{(F, s)}] = s$ for all $s \in [0, 1]$ .
(iii)
If the sets $F_{x}$ are not pairwise disjoint, then $E [φ_{(F, s)}]$ is a strictly concave increasing function of s.

Proof

Part (i): Since $n_{f} \geq 1$ in Eq. (2.2) and $1 - {(1 - s)}^{n_{f}} \geq s$ for all $n_{f} \geq 1$ , the claimed inequality is immediate. Part (ii): If $n_{f} = 1$ then $1 - {(1 - s)}^{n_{f}} = s$ for all $s \in [0, 1]$ , and if $n_{f} > 1$ then $1 - {(1 - s)}^{n_{f}} > s$ for every $s \in (0, 1)$ . Part (iii): By Eq. (2.2),

\begin{matrix} \frac{d}{ds} E [φ_{(F, s)}] = \sum_{f \in F (X)} \tilde{μ} (f) n_{f} {(1 - s)}^{n_{f} - 1} > 0, \end{matrix}

and

\begin{matrix} \frac{d^{2}}{d s^{2}} E [φ_{(F, s)}] = - \sum_{f \in F (X) : n_{f} \geq 2} \tilde{μ} (f) n_{f} (n_{f} - 1) {(1 - s)}^{n_{f} - 2} < 0, \end{matrix}

for all $s \in (0, 1)$ , from which the claimed results follow. $□$

Approximating $φ_{(F, s)}$ by its expected value

For each $n \geq 1$ , let $X_{n}$ be a labelled set of n species with feature assignment $F_{n}$ . Let $X_{n}$ denote the (random) set of species after a g-FOB extinction event with survival probability vector $s (X_{n})$ , which assigns each $x \in X_{n}$ a corresponding survival probability $s_{n} (x)$ . Note that we make no assumption regarding how the species in $X_{n}$ and $X_{m}$ are related (e.g. they may be disjoint, overlapping or nested), or any apriori relationship between $s (X_{n})$ and $s (X_{m})$ .

In the following result, we provide a sufficient condition under which $φ_{(F_{n}, s_{n})}$ is likely to be close to its expected value $E [φ_{(F_{n}, s_{n})}]$ (readily computed via Eq. (2.1)) when the number of species n is large. This condition allows some species to contribute proportionately more FD than other species do on average, and this proportion can grow with n, provided that it does not grow too quickly.

Proposition 1.3

For $ϵ > 0$ ,
$\begin{matrix} P (|φ_{(F_{n}, s_{n})} - E [φ_{(F_{n}, s_{n})}]| > ϵ) \leq 2 exp (- \frac{2 ϵ^{2}}{R_{n}}), \end{matrix}$
where $R_{n} = \sum_{x \in X_{n}} {[\frac{F D ({x})}{F D (X_{n})}]}^{2}$ .
If $R_{n} \to 0$ as $n \to \infty$ then $φ_{(F_{n}, s_{n})} - E [φ_{(F_{n}, s_{n})}] \overset{P}{\to} 0$ .
Let $av F D (X_{n}) = F D (X_{n}) / n$ (the average contribution of each species to the total FD), and suppose that for each $x \in X_{n}$ , $F D ({x}) / av F D (X_{n}) \leq B_{n}$ , where and $B_{n}^{2} = o (n)$ (e.g. $B_{n} = \sqrt[3]{n})$ . We then have the following convergence in probability as $n \to \infty$ :
$\begin{matrix} φ_{(F_{n}, s_{n})} - E [φ_{(F_{n}, s_{n})}] \overset{P}{\to} 0 . \end{matrix}$

Proof

Let $Y_{n} = {Y_{i} : i \in [n]}$ be a sequence of Bernoulli random variables where each $Y_{i}$ takes the value of 1 if species $x_{i}$ survives and 0 otherwise. For the g-FOB model, the random variables $Y_{i}$ are independent. We can write $φ_{(F_{n}, s_{n})} = h (Y_{n})$ where $h (y_{1}, \dots, y_{n})$ is the ratio $\frac{F D ({x_{i} \in X_{n} : y_{i} = 1})}{F D (X_{n})} .$ Observe that for any particular value of $i \in {1, \dots, n}$ , if we change $y_{i}$ (from 0 to 1 or visa versa) to give $y_{i}^{'}$ then:

\begin{matrix} | h (y_{1}, . . ., y_{i}, . . ., y_{n}) - h (y_{1}, . . ., y_{i}^{'}, . . ., y_{n}) | \leq \frac{F D ({x_{i}})}{F D (X_{n})} . \end{matrix}

We now apply McDiarmid’s inequality McDiarmid (1989) to obtain (for each $ϵ > 0$ ):

\begin{matrix} P (|φ_{(F_{n}, s_{n})} - E [φ_{(F_{n}, s_{n})}]| \geq ϵ) \leq 2 exp (\frac{- 2 ϵ^{2}}{R_{n}}), \end{matrix}

2.3

This establishes Part (a). Part (b) now follows immediately, and Part (c) follows from Part (b), since the condition in Part (c) implies that

\begin{matrix} R_{n} = \sum_{x} {[\frac{F D ({x})}{n \cdot av F D (X_{n})}]}^{2} \leq \sum_{x} {(\frac{B_{n}}{n})}^{2} = \sum_{x} \frac{B_{n}^{2}}{n^{2}} = B_{n}^{2} / n \to 0, \end{matrix}

as $n \to \infty$ . $□$

Remark

Note that Proposition 2.3(c) can fail when the condition stated in Part (c) does not hold, even for the simpler FOB model. We provide a simple example to demonstrate this. Let $X_{n} = {1, \dots, n}$ and let $F_{i} = {f}$ for $i = 1, \dots, n - 1$ and $F_{n} = {g}$ where f and g are distinct features with $μ (f) = μ (g) = 1$ . In this case:

\begin{matrix} φ_{(F_{n}, s)} = \{\begin{matrix} 0, & w.p. {(1 - s)}^{n} ; \\ \frac{1}{2}, & w.p. {(1 - s)}^{n - 1} s + (1 - {(1 - s)}^{n - 1}) (1 - s) ; \\ 1, & w.p. (1 - {(1 - s)}^{n - 1}) s . \end{matrix}) \end{matrix}

Therefore, $φ_{(F_{n}, s)} - E [φ_{(F_{n}, s)}]$ does not converge in probability to 0 (or to any constant) as $n \to \infty$ .

Consequences for phylogenetic diversity

Proposition 2.2 and Proposition 2.3 provide a simple way to derive certain results concerning phylogenetic diversity - both on rooted trees and also for rooted phylogenetic networks (specifically for the subNet diversity measure described in Wicke and Fischer (2018)). To each edge e of a rooted phylogenetic tree (or network), associate some unique feature $f_{e}$ and give it the value $μ (f_{e}) = ℓ (e)$ , where $ℓ (e)$ is the length of edge e in the tree (or network). For any subset Y of X (the leaf set of T) we then have:

\begin{matrix} P D (Y) = F D (Y) . \end{matrix}

It follows that for the simple field of bullets models, PD satisfies the concavity properties described in Proposition 2.2, where the condition that the sets $F_{x}$ are pairwise disjoint corresponds to the tree being a star tree. These results were established for rooted trees by specific tree-based arguments (see e.g. Sect. 5 of Lambert and Steel (2013)), but they directly follow from the more general framework above, and extend beyond trees.

Using this same link between PD and FD, Proposition 2.3 provides a further application to any sequence of rooted phylogenetic trees $T_{n}$ with n leaves and ultrametric edge lengths. For example, the ratio of surviving PD to original PD under the FOB model converges in probability to the expected value of this ratio for a sequence of trees $T_{n}$ if the total PD of $T_{n}$ grows at least as fast as $n L / n^{β}$ , where L is the height of the tree and $0 < β < 1 / 2$ . This condition holds, for example, for Yule trees (Stadler and Steel 2012).

The feature diversity ratio $φ$ for a model of feature evolution on a phylogenetic tree

Consider a rooted binary phylogenetic tree $T_{n}$ , in which each edge has a positive length that corresponds to a temporal duration, with the root $ρ$ of $T_{n}$ being placed at the top of a stem edge at time 0, and with each leaf in the leaf set $X_{n} = {x_{1}, \dots, x_{n}}$ of $T_{n}$ being placed at time t (as in Fig. 1). For convenience, we will assume in this section that $μ (f) = 1$ for all features; however, this assumption can be relaxed (e.g. by allowing $μ (f)$ to take values in a fixed interval [a, b] where $a > 0$ according to some fixed distribution, and independently between features) without altering the results significantly.

Fig. 1 — An example of feature gain and loss on a tree and the impact of extinction at the present. *Left:* New features arise (indicated by $+$ ) and disappear (indicated by −) along the branches of the tree. To simplify this example, no features are present at time 0; however, our results do not require this restriction. In total, there are five features present amongst the leaves of the tree at time t, namely ${α, β, γ, δ, ϵ}$ . *Right:* An extinction event at the present (denoted by $†$ ) results in the loss of three of the extant species, leaving just three features being present among the leaves of the resulting pruned tree, namely ${α, β, γ}$ . Thus, the ratio of surviving features to total features is $3 / 5 = 0.6$

We let $F_{ρ}$ denote the (possibly empty) set of features present at time 0 (i.e. at the top of the stem edge), and we assume throughout this section that $| F_{ρ} |$ is bounded by some fixed constant B, independent of n.

On $T_{n}$ , we apply a stochastic process in which (discrete) features arise independently along the branches of this tree at rate r, and each feature that arises is novel (i.e. it has not appeared earlier elsewhere in the tree). Once a feature arises, it is then carried forward in time along the branches of $T_{n}$ (and is passed on to the two lineages arising at any speciation event). In addition, any feature can be lost from a lineage at any point according to a continuous-time pure-death process that operates at rate $ν$ . This model was investigated in a different setting in Huson and Steel (2004) and studied more recently in Rosindell et al. (2022). Under this process, each leaf x of $T_{n}$ will have a (possibly empty) set of features ( $F_{x}$ ). Fig. 1 illustrates the processes described. Note that $| F_{x} |$ (for any $x \in X_{n}$ ) and $F D (X_{n})$ are now random variables.

Let $N_{ℓ}$ denote the number of features at the end of any path P in $T_{n}$ that starts at time $t = 0$ and ends at time $ℓ$ . Then $N_{ℓ}$ is described by a continuous-time Markov process that has a constant birth rate and a linear death rate. It is then a classical result Feller (1950) that $N_{ℓ}$ has expected value $\frac{r}{ν} (1 - e^{- ν ℓ}) + F_{0} e^{- ν ℓ}$ , where $F_{0} = | F_{ρ} |$ (the number of features present at time 0), and $N_{ℓ}$ converges to a Poisson distribution with mean $\frac{r}{ν}$ as $ℓ$ grows. Moreover, if $N_{0} = 0$ then $N_{ℓ}$ has a Poisson distribution for any value of $ℓ > 0$ (Feller (1950) p. 461), and so, regardless of the value of $F_{ρ}$ , the random variable $| F_{x} \ F_{ρ} |$ has a Poisson distribution with mean $\frac{r}{ν} (1 - e^{- ν ℓ (x)})$ where $ℓ (x)$ is the length of the path from the root to leaf x. The (random) number of features at any leaf x of $T_{n}$ (i.e. $| F_{x} |$ ) has the same distribution as $N_{ℓ (x)}$ . Note that for distinct leaves x and y of $T_{n}$ , the random variables $| F_{x} |$ and $| F_{y} |$ are not independent.

Notational convention: Henceforth we will write $F D (T_{n})$ in place of $F D (X_{n})$ and we will also write $F D (T_{n}^{s})$ in place of $F D (X_{n})$ when $X_{n}$ is the subset of the set of leaves $X_{n}$ of $T_{n}$ that survive under a FOB model with a survival probability s. We will let $F_{ρ}$ denote the set of features present at the root vertex $ρ$ at the top of the stem edge.

Lemma 1.1

Set $F_{0} = \emptyset$ . Then for any value of $n \geq 1$ the following hold:

(i)
The random variable $F D (T_{n})$ has a Poisson distribution.
(ii)
The expected value of $F D (T_{n})$ satisfies the following bound:
$\begin{matrix} E [F D (T_{n})] \leq \frac{r}{ν} n . \end{matrix}$

Proof

Part (i): We use induction on n. Since $F_{ρ} = \emptyset$ , the result for base case ( $n = 1$ ) holds by the results mentioned in the previous paragraph. Thus, suppose that $n \geq 2$ , and let $T_{n}$ be a binary tree with n leaves and with $F_{ρ} = \emptyset$ . Then:

\begin{matrix} F D (T_{n}) = F D (T^{1}) + F D (T^{2}) + G . \end{matrix}

where the trees $T^{i}$ (with $i = 1, 2$ ) are subtrees of $T_{n}$ obtained by deleting the stem edge and its endpoints (and setting the set of features at the top of the stem edge of $T^{1}$ and $T^{2}$ equal to the empty set), and G is the number of features that arise on the stem edge of $T_{n}$ and are present in at least one leaf of $T_{n}$ .

Notice that $F D (T^{1})$ , $F D (T^{2})$ and G are independent random variables, and, by induction, $F D (T^{1})$ and $F D (T^{2})$ each have a Poisson distribution. Conditional on the number X of features that are present at the end of the stem edge, G has a binomial distribution with parameters X and p where p is the probability that a single feature present at the end of the stem edge is present in at least one leaf of $T_{n}$ . Since X has a Poisson distribution, and a Poisson number of Bernoulli random variables is Poisson, it follows that $F D (T_{n})$ , being the sum of three independent Poisson variables, also has a Poisson distribution. This establishes the induction step and thus Part (i).

Part (ii): We have:

\begin{matrix} F D (T_{n}) = |⋃_{x \in X_{n}}, F_{x}| \leq \sum_{x \in X_{n}} | F_{x} | . \end{matrix}

Now, for each $x \in X$ , we have $E [| F_{x} |] = \frac{r}{ν} (1 - e^{- ν L_{x}})$ where $L_{x}$ denotes the length of the path from the top of the stem edge of $T_{n}$ to the leaf x. Thus $E [F D (T_{n})] \leq \frac{r}{ν} \cdot n .$ $□$

Example ( $n = 2$ ) Consider the process described on $T_{2}$ with $F_{0} = \emptyset$ . Let $ℓ_{0}$ denote the length of the stem edge, and $ℓ$ the length of each of the two pendant edges. Then $F D (T_{2})$ has a Poisson distribution with expected value

\begin{matrix} \frac{r}{ν} (1 - e^{- ν ℓ_{0}}) (1 - {(1 - e^{- ν ℓ})}^{2}) + 2 \frac{r}{ν} (1 - e^{- ν ℓ}) \end{matrix}

and $F D (T_{2}^{s})$ has expected value

\begin{matrix} s^{2} F D (T_{2}) + 2 s (1 - s) [\frac{r}{ν} (1 - e^{- ν ℓ_{0}}) e^{- ν ℓ} + \frac{r}{ν} (1 - e^{- ν ℓ})] . \end{matrix}

In particular,

\begin{matrix} \frac{E [F D (T_{2}^{s})]}{E [F D (T_{2})]} = s \cdot [s + 2 (1 - s) \frac{1 - e^{- ν (ℓ_{0} + ℓ)}}{(1 - e^{- ν ℓ_{0}}) (1 - {(1 - e^{- ν ℓ})}^{2}) + 2 (1 - e^{- ν ℓ})}] \end{matrix}

When $ℓ_{0} = 0$ , the right-hand side of this equation equals s, but for all other values it is strictly greater than s. Moreover, by differentiating $\frac{E [F D (T_{2}^{s})]}{E [F D (T_{2})]}$ it can be verified that this ratio is monotone decreasing as $ν$ increases for all positive values of $ℓ$ and $ℓ_{0}$ ; in particular, for $s \in (0, 1)$ , and $ν > 0$ , this ratio is always less than the expected proportion of PD that survives in $T_{2}$ .

A limit result for sequences of trees

The main result of this section is Theorem 3.1, and its proof relies on establishing a sequence of preliminary lemmas.

Lemma 1.2

Let $β > 0$ be a fixed constant. Given a sequence $T_{n}$ of trees with leaf set $X_{n}$ , let $E_{n}$ be the event that $| F_{x} | \leq n^{β}$ for every $x \in X_{n}$ . Then ${lim}_{n \to \infty} P (E_{n}) = 1$ .

Proof

We combine the Bonferroni inequality with a standard right-tail probability bound for a Poisson variable. Firstly, observe that:

\begin{matrix} P (E_{n}) \geq 1 - \sum_{x \in X_{n}} P (| F_{x} | > n^{β}) = 1 - n P (| F_{x_{1}} | > n^{β}) . \end{matrix}

3.1

Now, $| F_{x_{1}} |$ can be written as the sum of two independent random variables $U_{x_{1}} + V_{x_{1}}$ where $U_{x_{1}}$ has a Poisson distribution with mean $m \leq r / ν$ , and $V_{x_{1}} \leq | F_{ρ} | \leq B$ with probability 1 ( $U_{x_{1}}$ counts the features at $x_{1}$ if $F_{ρ} = \emptyset$ , $V_{x_{1}}$ counts the features in $F_{ρ}$ that are remain present at $x_{1}$ , and B is the global bound on $F_{ρ}$ described near the start of Sect. 3). Thus, the Chernoff bound on the right hand tail of a Poisson variable (Mitzenmacher and Upfal 2005, p. 97) gives $P (| U_{x_{1}} | > n^{β}) \leq {(\frac{em}{n^{β}})}^{n^{β}} e^{- m},$ and so $n P (| F_{x_{1}} | > n^{β}) \to 0$ as n grows. Applying this to the inequality in (3.1) establishes the result. $□$

Lemma 1.3

Let $(T_{n}, n \geq 1)$ be a sequence of rooted binary trees, with $T_{n}$ having leaf set $X_{n}$ , and suppose that $E [F D (T_{n})] \geq c n$ for some constant $c > 0$ . Then $\frac{F D (T_{n})}{E [F D (T_{n})]}$ converges in probability to 1 as $n \to \infty$ .

Proof

First observe that we can write $F D (T_{n})$ as the sum of two independent random variables, namely $F D_{0} (T_{n}) + K$ , where $F D_{0} (T_{n})$ is the FD value when $F_{ρ} = \emptyset$ , and K is the number of features at the time at the root that are also present in at least one leaf. In particular, $K \leq | F_{ρ} |$ which is assumed to be bounded by a constant B (independent of n). Thus

\begin{matrix} Var [F D (T_{n})] = Var [F D_{0} (T_{n})] + Var [K] \leq Var [F D_{0} (T_{n})] + B . \end{matrix}

By Lemma 3.1, $Var [F D_{0} (T_{n})] = E [F D_{0} (T_{n})] \leq \frac{r}{ν} \cdot n$ , so $Var [F D (T_{n})] \leq \frac{r}{ν} \cdot n + B$ , and thus:

\begin{matrix} Var [\frac{F D (T_{n})}{n}] = Var [F D (T_{n})] / n^{2} \to 0, \end{matrix}

3.2

as $n \to \infty$ . We now apply Chebyshev’s inequality to obtain:

\begin{matrix} P (|\frac{F D (T_{n})}{E [F D (T_{n})]} - 1| \geq ϵ) \leq Var (\frac{F D (T_{n})}{E [F D (T_{n})]}) / ϵ^{2}, \end{matrix}

3.3

and since

\begin{matrix} \frac{F D (T_{n})}{E [F D (T_{n})]} = \frac{F D (T_{n})}{n} \cdot \frac{n}{E [F D (T_{n})]} \end{matrix}

we have:

\begin{matrix} Var [\frac{F D (T_{n})}{E [F D (T_{n})]}] = Var [\frac{F D (T_{n})}{n}] \cdot {(\frac{n}{E [F D (T_{n})]})}^{2} \end{matrix}

and so the term on the right of Eq. (3.3) is bounded above by $Var [\frac{F D (T_{n})}{n}] \cdot \frac{1}{ϵ^{2} c^{2}}$ , which converges to 0 as $n \to \infty$ by Eq. (3.2). $□$

Lemma 1.4

Let $(T_{n}, n \geq 1)$ be a sequence of rooted binary trees, with $T_{n}$ having leaf set $X_{n}$ , and suppose that $E [F D (T_{n})] \geq c n$ for some constant $c > 0$ . Suppose that for each $x \in X_{n}$ , $F D ({x}) \leq B_{n}$ , where $B_{n}^{2} = o (n)$ . Then $\frac{F D (T_{n}^{s})}{E [F D (T_{n}^{s})]} \overset{P}{\to} 1$ as $n \to \infty$ .

Proof

Let $W_{n} = H (Y_{n})$ , where $Y_{n} = {Y_{i} : i \in [n]}$ is the sequence of Bernoulli random variables with $Y_{i} = 1$ if species $x_{i}$ survives the FOB extinction event and $Y_{i} = 0$ otherwise, and $H (y_{1}, \dots, y_{n})$ is the ratio $\frac{F D ({x_{i} \in X_{n} : y_{i} = 1})}{E [F D (T_{n}^{s})]}$ , where the numerator is as defined in Eq. (2.1) with $μ (f) = 1$ for all f. Observe that for any particular value of $i \in {1, \dots, n}$ , if we change $y_{i}$ (from 0 to 1 or visa versa) to give $y_{i}^{'}$ then:

\begin{matrix} | H (y_{1}, . . ., y_{i}, . . ., y_{n}) - H (y_{1}, . . ., y_{i}^{'}, . . ., y_{n}) | \leq & F D ({x_{i}}) / E [F D (T_{n}^{s})] \\ \leq & B_{n} / E [F D (T_{n}^{s})] . \end{matrix}

We now apply McDiarmid’s inequality to obtain (for each $ϵ > 0$ ):

\begin{matrix} P (| W_{n} - 1 | \geq ϵ) \leq 2 exp (\frac{- 2 ϵ^{2} E {[F D (T_{n}^{s})]}^{2}}{n B_{n}^{2}}) . \end{matrix}

3.4

Now, $E [F D (T_{n}^{s})] \geq s \cdot E [F D (T_{n})]$ by Proposition 2.2(i) (taking $μ (f) = 1$ for all f) and, by assumption, $E [F D (T_{n})] \geq c n$ . Thus we obtain:

\begin{matrix} P (| W_{n} - 1 | \geq ϵ) \leq 2 exp (\frac{- 2 ϵ^{2} c^{2} s^{2} n}{B_{n}^{2}}) . \end{matrix}

Therefore, $P (| W_{n} - 1 | \geq ϵ) \to 0$ as $n \to \infty$ . Since this holds for all $ϵ > 0$ , we obtain the claimed result. $□$

We can now state the main result of this section. Recall that $F D (T_{n}^{s})$ is the number of features present among leaves of $T_{n}$ that survive the FOB extinction event.

Theorem 1.1

Let $(T_{n}, n \geq 1)$ be a sequence of binary trees and let features evolve on $T_{n}$ according to the stochastic feature evolution process described. If $E [F D (T_{n})] \geq c n$ for some constant $c > 0$ , and if $\frac{E [F D (T_{n}^{s})]}{E [F D (T_{n})]}$ converges to a constant $c_{s}$ as n grows we have:

\begin{matrix} \frac{F D (T_{n}^{s})}{F D (T_{n})} \overset{P}{\to} c_{s} \end{matrix}

as $n \to \infty$ .

Before we proceed to the proof, we provide the following comments.

Remarks

Theorem 3.1 can fail without the condition $E [F D (T_{n})] \geq c n$ . Fig. 2 provides a simple example of a sequence of trees $T_{n}$ for which $\frac{F D (T_{n}^{s})}{F D (T_{n})}$ does not converge in probability to any constant value as n grows. In this example, $F_{ρ} = \emptyset$ and the tree has height $2 ℓ$ with one leaf having an incident edge of length $ℓ$ and the remaining $n - 1$ leaves having incident edges of length 1/n.
A sufficient condition for the inequality $E [F D (T_{n})] \geq c n$ in Theorem 3.1 to hold is that for some $ϵ > 0$ and $δ > 0$ the proportion of pendant edges of $T_{n}$ of length $\geq ϵ$ is at least $δ$ for all $n \geq 1$ . Briefly, the reason for this is that the expected number of features arising on each of these pendant edges and surviving to the end of this edge is bounded away from 0 and the features associated with distinct pendant edges are always different from each other, and different from any other features arising in the tree.

Fig. 2 — For the tree $T_{n}$ shown (with $ℓ > 1$ fixed and $s \in (0, 1)$ ) the sequence of trees $(T_{n}, n \geq 2)$ has the property that $\frac{F D (T_{n}^{s})}{F D (T_{n})}$ does not converge in probability to any constant value as n grows

Proof of Theorem 3.1

Let $A_{ϵ} (n)$ be the event that $|\frac{F D (T_{n}^{s})}{E [F D (T_{n}^{s})]} - 1| < ϵ$ , and let $E_{n}$ be the event described in Lemma 3.2 with $β = \frac{1}{3}$ . Then, ${lim}_{n \to \infty} P (A_{ϵ} (n) | E_{n}) = 1$ , by Lemma 3.4. Now,

\begin{matrix} P (A_{ϵ} (n)) = P (A_{ϵ} (n) | E_{n}) P (E_{n}) + P (A_{ϵ} (n) | \bar{E_{n}}) P (\bar{E_{n}}), \end{matrix}

and since $P (E_{n}) \to 1$ as $n \to \infty$ (by Lemma 3.2) we have ${lim}_{n \to \infty} P (A_{ϵ} (n)) = 1$ . Since this holds for all $ϵ > 0$ , it follows that $\frac{F D (T_{n}^{s})}{E [F D (T_{n}^{s})]} \overset{P}{\to} 1$ as n grows.

Moreover, from Lemma 3.3, we have $\frac{F D (T_{n})}{E [F D (T_{n})]} \overset{P}{\to} 1 .$ Now we can write $\frac{F D (T_{n}^{s})}{F D (T_{n})}$ as follows:

\begin{matrix} \frac{F D (T_{n}^{s})}{F D (T_{n})} = \frac{F D (T_{n}^{s})}{E [F D (T_{n}^{s})]} \cdot \frac{E [F D (T_{n})]}{F D (T_{n})} \cdot \frac{E [F D (T_{n}^{s})]}{E [F D (T_{n})]} \end{matrix}

The first two terms on the right of this equation each converge in probability to 1 as $n \to \infty$ , whereas the third (deterministic) term converges to $c_{s}$ . This completes the proof. $□$

We will apply Theorem 3.1 in the next section to establish a result for FD loss on birth–death trees.

Feature diversity ratios in birth–death trees

In this section, we continue to investigate the stochastic model of feature gain and loss, but rather than considering fixed trees, we will now allow the trees themselves to be stochastically generated, following the simple birth–death processes that are common in phylogenetics. Thus, there will now be three stochastic processes in play: the linear-birth/linear-death process that generates the tree, the constant-birth/linear-death process of feature gain and loss operating along the branches of the tree, and the simple FOB extinction event at the present.

Definitions

Let $T_{t}$ denote a birth–death tree grown from a single lineage for time t with birth and death parameters $λ$ and $μ$ , respectively. We will assume throughout that $λ > μ$ (since otherwise the tree is guaranteed to die out as t becomes large).

On $T_{t}$ , we impose the model of feature gain and loss from the previous section with parameters r and $ν$ . We now apply the FOB model in which each extant species (i.e. leaves of $T_{t}$ that are present at time t) has a probability $s > 0$ of surviving and $1 - s$ of becoming extinct (independently across the extant species), and we let $T_{t}^{s}$ denote the tree obtained from $T_{t}$ by removing the species at the present that did not survive this process. We refer to $T_{t}^{s}$ as the pruned tree, and the leaves of $T_{t}$ and $T_{t}^{s}$ that are present at time t as the extant species (or leaves) of these trees (to contrast them from leaves of $T_{t}$ that lie at the endpoints of any extinct lineages). If $s < 1$ , there may now be fewer features present among the (probably reduced number of) extant species in $T_{t}^{s}$ than there were in $T_{t}$ .

Let $F D (T_{t})$ be the discrete random variable that counts the number of features present in at least one of the extant leaves of $T_{t}$ , and let $F D (T_{t}^{s})$ denote the number of these features that are also present in at least one extant leaf in the pruned tree $T_{t}^{s}$ .

Expected feature diversity

Next, we consider the expected values of $F D (T_{t})$ and $F D (T_{t}^{s})$ , where this expectation is across all three processes (the birth–death process that generates $T_{t}$ , feature gain and loss on this tree, and the species that survive the extinction event at the present under the FOB model). Of particular interest is the ratio of these expectations, and their limit as t becomes large. Specifically, let:

\begin{matrix} φ_{FD} (t, s) = \frac{E [F D (T_{t}^{s})]}{E [F D (T_{t})]} and φ_{FD} (s) = lim_{t \to \infty} φ_{FD} (t, s) . \end{matrix}

Note that $φ_{FD} (s)$ is a function of five parameters ( $s, r, λ, μ, ν$ ); however, we will show that it is just a function of s and two other parameters. Notice also that once these parameters are fixed, $φ_{FD} (t, s)$ and $φ_{FD} (s)$ are monotone increasing functions of s taking the value 0 at $s = 0$ and 1 at $s = 1$ . Moreover, $φ_{FD} (t, s)$ and $φ_{FD} (s)$ are both independent of r (the rate at which features arise along any lineage) as we formally show shortly.

Relationship to phylogenetic diversity (PD)

Recall that for a rooted phylogenetic tree T with branch lengths, the PD value of a subset S of leaves (PD(S, T)) is the sum of the lengths of the edges of the subtree of T that connect S and the root of the tree.

In the special case where $ν = 0$ , and where no features are present at time 0, $F D (T_{t}^{s})$ (conditioned on $T_{t}$ ), has a Poisson distribution with a mean of r times $P D (S_{t}, T_{t})$ , where $S_{t}$ is the (random) set of leaves at time t that survive the extinction event at the present. Consequently, $E [F D (T_{t}^{s}) | T_{t}] = E [r P D (S_{t}, T_{t})] = r E [P D (S_{t}, T_{t})]$ . Similarly, $E [F D (T_{t}) | T_{t}] = r E [P D (L_{t}, T_{t})]$ , where $L_{t}$ is the set of extant leaves of $T_{t}$ . Thus, in this special case we have $φ_{FD} (s) = φ_{PD} (s)$ , where:

\begin{matrix} φ_{PD} (s) = lim_{t \to \infty} \frac{E [P D (S_{t}, T_{t})]}{E [P D (L_{t}, T_{t})]} . \end{matrix}

The function $φ_{PD} (s)$ was explicitly determined in Lambert and Steel (2013), Mooers et al. (2012) as follows:

\begin{matrix} φ_{PD} (s) : = \{\begin{matrix} \frac{ρ s}{ρ + s - 1} \cdot \frac{ln (s / (1 - ρ))}{ln (1 / (1 - ρ))}, & if ρ = μ / λ \neq 0, 1 - s ; \\ - s ln (s) / (1 - s), & if ρ = 0 (i.e. a Yule tree) ; \\ (1 - s) / ln (1 / s), & if ρ = 1 - s . \end{matrix}) \end{matrix}

4.1

Calculating $φ_{FD} (s)$

We first recall a standard result from birth–death theory. Consider a linear birth–death process (starting with a single individual at time 0), with a birth rate $λ$ , a death rate $θ$ . For the individuals present at time t, sample each individual independently with sampling probability $s > 0$ . Let $X_{t}$ ( $t \geq 0$ ) denote the number of these sampled individuals and let $R_{t}^{s} (λ, θ)$ be the probability that $X_{t} > 0$ . Then

\begin{matrix} R_{t}^{s} (λ, θ) = \{\begin{matrix} \frac{s (λ - θ)}{s λ + (λ (1 - s) - θ) e^{(θ - λ) t}}, & if λ \neq θ ; \\ \frac{s}{1 + λ s t}, & if λ = θ . \end{matrix}) \end{matrix}

5.1

In particular, $R_{t}^{s} (λ, θ)$ converges to 0 if $λ \leq θ$ and converges to a strictly positive value $1 - θ / λ$ if $λ > θ$ (Kendall 1948), (Yang and Rannala 1997).

The number of species at time t in $T_{t}^{s}$ that have a copy of particular feature f that arose at some fixed time $t_{0} \in (0, t)$ in $T_{t}$ is described exactly by the birth–death process $X_{t - t_{0}}$ with parameters $λ$ and $θ = μ + ν$ and survival probability s at the present; it follows that as t becomes large, it becomes increasingly certain that none of the species at time t in the pruned tree will contain feature f if $λ \leq μ + ν$ , whereas if $λ > μ + ν$ , there is a positive limiting probability that f will be present in the extant leaves of the pruned tree.

Since $φ_{FD} (s) = φ_{PD} (s)$ (as given by Eqn. (4.1)) for all values of s when $ν = 0$ , in this section we will assume that $ν > 0$ (in addition to our universal assumption that $λ > μ$ ). Our main result provides an explicit formula for $φ_{FD} (s)$ in Part (a), and describes some of its key properties in Parts (b) and (c).

Theorem 1.2

Given $λ > μ$ and $ν > 0$ , let $ρ = \frac{μ + ν}{λ}$ , $β = 1 - \frac{λ - μ}{ν}$ . Then:

$\begin{matrix} φ_{FD} (s) = s \cdot \frac{I (s)}{I (1)}, \end{matrix}$
where
$\begin{matrix} I (s) = \int_{0}^{1} \frac{dx}{1 - s (\frac{1 - x^{β}}{1 - ρ})}, when ρ \neq 1 \end{matrix}$
and
$\begin{matrix} I (s) = \int_{0}^{1} \frac{dx}{ν / λ - s ln x}, when ρ = 1 . \end{matrix}$
Conditional on the non-extinction of $T_{t}$ , $\frac{F D (T_{t}^{s})}{F D (T_{t})}$ converges in probability to $φ_{FD} (s)$ as $t \to \infty$ .
$φ_{FD} (s)$ is an increasing concave function that satisfies $1 \geq φ_{FD} (s) \geq s$ for all s.

Remarks

(i)
Notice that although $φ_{FD} (s)$ depends on five parameters ( $r, s, λ, μ, ν$ ), Theorem 5.1(a) reveals that just three derived parameters suffice to determine $φ_{FD} (s)$ , namely s and the ratios $ρ_{1} = μ / λ \in (0, 1)$ and $ρ_{2} = ν / λ$ (these determine $ρ$ and $β$ , since $ρ = ρ_{1} + ρ_{2}$ and $β = 1 - 1 / ρ_{2} + ρ_{1} / ρ_{2}$ ). Notice also that $ρ = 1 \Leftrightarrow β = 0$ and $ρ > 1 \Leftrightarrow β > 0$ .
(ii)
Our proof relies on establishing the following exact expression for $E [F D (T_{t}^{s})]$ :
$\begin{matrix} E [F D (T_{t}^{s})] = r e^{(λ - μ) t} \int_{0}^{t} e^{- (λ - μ) τ} R_{τ}^{s} (λ, μ + ν) d τ + F_{0} R_{t}^{s} (λ, μ + ν) . \end{matrix}$ 5.2
where $F_{0}$ is the number of features present at time $t = 0$ .

In particular, this also provides an exact expression for $φ_{FD} (t, s)$ . Notice that the ratio of the expected number of features present in the pruned tree, divided by the expected number of species in the pruned tree is $\frac{F D (T_{t}^{s})}{s e^{(λ - μ) t}}$ , and this ratio converges to $\frac{r (1 - ρ)}{ν} \int_{0}^{1} \frac{dx}{1 - s - ρ + s x^{β}}$ as $t \to \infty$ when $ρ \neq 1$ (via a further analysis of Eq. (5.2) in the Appendix).
(ii)
We saw in Sect. 4.3 that when $ν = 0$ , $φ_{FD} (s) = φ_{PD} (s)$ . At the other extreme, if $λ$ and $μ$ are fixed, and we let $ν \to \infty$ , then $φ_{FD} (s)$ converges to s (since $ρ \to - \infty$ and $β \to 1$ in Theorem 5.1(a)). Informally, when $ν$ is large compared to $λ$ , most of the features present among the extant leaves of $T_{t}$ will have arisen near the end of the pendant edges incident with these extant leaves (a formalisation of this claim appears in Rosindell et al. (2022)); if we now apply the FOB model then the expected proportion of these features that survive will be close to the expected proportion of leaves that survive, namely s.

Illustrative examples

First, consider a Yule tree (i.e. $μ = 0$ ) grown for time t. Figure 3 (left) plots $φ_{FD} (s)$ for values of $ν / λ \in {0, 0.5, 1, 2, 10}$ . When $ν = 0$ , $φ_{FD} (s)$ describes the proportional loss of expected PD in the pruned tree, and as $ν$ increases, $φ_{FD} (s)$ converges towards s (the expected proportion of leaves that survive extinction at the present). Figure 3 (right) plots $φ_{FD} (s)$ for birth–death trees with $μ / λ = 0.8$ , showing a similar trend, however with $φ_{FD} (s)$ ranging higher above the curve $y = s$ .

Fig. 3 — *Left:* The function $φ_{FD} (s)$ as a function of s for pure-birth Yule trees ( $μ = 0$ ). *Right:* The function $φ_{FD} (s)$ for birth–death trees with $μ / λ = 0.8$ . The curves within each graph show the effect of increasing $ν$ , for values of $ν / λ$ between 0 (top) and 10 (bottom). The top curve also corresponds to the phylogenetic diversity ratio $φ_{PD} (s)$

Simulations

We ran simulations to test the expected relationship between $φ_{FD} (s)$ and s and to get estimates of standard deviation. All simulations were run in R version 4.2.1 Team (2021). We first simulated 500 Yule trees (age = 100, $λ = 0.055, μ = 0$ , repeated and filtered to keep 500 250-tip trees) and 500 birth–death trees (age = 100, $λ = 0.11, μ = 0.088$ , repeated and filtered to keep 500 trees with 250-300 tips) using the sim.bd.age function in the package TreeSim (Stadler 2011). Features were then evolved on each tree, followed by separate extinction events.

Keeping the rate of feature gain fixed ( $r = 0.3$ ), we modelled five different rates of feature loss ( $ν \in {0, 0.5 λ, λ, 2 λ, 10 λ}$ ). We estimated feature gain and loss on each edge using the Gillespie algorithm (Gillespie 1976), where time until the next event (either feature gain or loss) was drawn from an exponential distribution with rate $(r + k ν$ ), where k is the number of currently existing features at the start of the edge. The type of event was then determined with a Bernoulli draw with probability of a gain equal to $r / (r + k ν)$ . At each split on the tree, all existing features were copied to descendent edges. Each gain event created a new unique feature, and each loss event randomly selected an existing feature to eliminate from the current edge. At the end of the simulation the presence of features on each tip of the tree was recorded.

Extinction events were simulated by randomly selecting a proportion s of tips to delete, with s ranging from 0.05 to 0.95 by intervals of 0.05. The proportion of unique features remaining after extinction events was recorded, and the results are shown in Fig. 4.

Fig. 4 — Simulation results for remaining feature diversity as a function of s for pure-birth Yule trees ( $μ = 0$ , left) and birth–death trees ( $μ / λ = 0.8$ , right). Results shown for two of the five feature evolution parameter scenarios. Each point represents one simulation on one simulated tree ( $n = 500$ trees). The solid curves are the theoretical relationships between the expected proportion of remaining feature diversity and s for both of the shown feature evolution parameter scenarios

Resulting ratios of remaining features generally tracked expectations (the bias for birth–death trees when $ν / λ = 0$ is likely due to our theoretical results conditioning on t rather than n). The standard deviation (SD), calculated for each $ν$ on both Yule and birth–death trees, were fairly consistent for all $ν$ except $ν = 10 λ$ , where it was noticeably higher, as shown in Table 1. Because the number of total events along an edge (gains and losses) is described by a Poisson distribution, its variance increases with the mean, and this may explain the higher standard deviation at the highest loss rate.

Table 1.

Mean standard deviations of proportions of remaining feature diversity (each value is averaged over the 19 values of s between 0.05 and 0.95)

$ν / λ$	0	0.5	1	2	10
Yule	$2.06 \times 10^{- 2}$	$1.83 \times 10^{- 2}$	$1.84 \times 10^{- 2}$	$1.99 \times 10^{- 2}$	$3.67 \times 10^{- 2}$
Birth–death	$2.33 \times 10^{- 2}$	$2.02 \times 10^{- 2}$	$2.16 \times 10^{- 2}$	$2.52 \times 10^{- 2}$	$5.01 \times 10^{- 2}$

Open in a new tab

Features that appear in only one extant species

Let $U_{t}$ denote the number of features that are present in precisely one species in $T_{t}$ , and let $U_{t} = E [U_{t}]$ . The following result describes a simple relationship between $U_{t}$ and $F_{t} = E [F D (T_{t})]$ .

Proposition 1.4

\begin{matrix} \frac{d F_{t}}{dt} = r e^{(λ - μ) t} - (μ + ν) U_{t} . \end{matrix}

5.3

Proof

Let $F_{t}$ denote the number of features present at time t among the leaves of $T_{t}$ . Consider evolving $T_{t}$ for an additional (short) period $δ$ into the future. Then, conditional on $N_{t}$ (the number of leaves of $T_{t}$ present at time t) and $U_{t}$ :

\begin{matrix} F_{t + δ} - F_{t} = \{\begin{matrix} 1, & with probability r N_{t} δ + o (δ) ; \\ - 1, & with probability (μ + ν) δ \cdot U_{t} + o (δ) ; \\ 0, & with probability 1 - ((μ + ν) U_{t} + r N_{t}) δ + o (δ) . \end{matrix}) \end{matrix}

Applying the expectation operator (and using $E [F_{t + δ}] = E [E [F_{t + δ} | N_{t}, U_{t}]]$ ) and letting $δ \to 0$ leads to the equation stated. $□$

Concluding comments

In this paper, we have considered, in order, three types of data to quantify the expected loss of feature diversity: sets of features across species (without any model of feature evolution or phylogeny), sets of features at the tips of a given phylogenetic tree, and sets of features at the tips of a random (birth–death) tree. The results of the earlier sections also proved helpful in establishing certain results in later sections.

In terms of wider significance to biodiversity conservation, our results and graphs in Sect. 5 suggest that the extent of relative feature diversity loss following extinction at the present is likely to be greater than that predicted by relative phylogenetic diversity loss for any given extinction rate $s \in (0, 1)$ .

Of course, our results are based on simple models (of feature gain and loss, and extinction at the present) and so exploring how these results might extend to more complex and realistic biological models would be a worthwhile topic for future work.

Acknowledgements

We thank François Bienvenu and James Rosindell for helpful suggestions on an earlier draft of this manuscript, and Ailene MacPherson for technical advice regarding the simulations. We also thank the two anonymous reviewers for further helpful comments and the New Zealand Marsden Fund (MFP-UOC2005) for supporting this research.

Appendix: Proof of Theorem 5.1

Part (a): Consider a birth–death tree $T_{t}$ with parameters $λ, μ$ , a feature evolution model with parameters $r, ν$ , and a survival probability s for leaves at the present. Let $F_{t}^{s} = F D (T_{t}^{s})$ , and let $G_{t}^{s}$ be the random variable that has the same distribution as $F_{t}^{s}$ with the initial condition $G_{0}^{s} = 0$ (i.e. no features at the root of the tree). Let $Δ_{t}^{s}$ be the number of features that are present at the root of $T_{t}$ and also in the pruned tree $T_{t}^{s}$ . Then

\begin{matrix} F_{t}^{s} = G_{t}^{s} + Δ_{t}^{s} . \end{matrix}

Let $F_{t}^{s} = E [F_{t}^{s}]$ and $G_{t}^{s} = E [G_{t}^{s}]$ . The two random variables $G_{t}^{s}$ and $Δ_{t}^{s}$ are not independent (they are linked by both the underlying tree $T_{t}$ and the pruning event at the present) however, applying the expectation operator gives:

\begin{matrix} F_{t}^{s} = E [G_{t}^{s}] + E [Δ_{t}^{s}] = G_{t}^{s} + F_{0} R_{t}^{s} (λ, μ + ν), \end{matrix}

7.1

where $F_{0}$ is the number of features present at the root of $T_{t}^{s}$ .

Now, consider $G_{t + δ}^{s}$ , and the events that can occur in the interval $[0, δ)$ .

A new feature arises (with probability $r δ + o (δ)$ );
A speciation event occurs (with probability $λ δ + o (δ))$ ;
The lineage (and hence the tree) dies (with probability $μ δ + o (δ))$ .
None of the above occur (with probability $1 - (r + λ + μ) δ + o (δ))$

Let X be the random variable taking values in ${a, b, c, d}$ which denotes which of these four events occurs. In Case (a), the new feature is also present in $T_{t}^{s}$ with probability $R_{t}^{s} (λ, μ + ν)$ and so

\begin{matrix} E [G_{t + δ}^{s} | X = a] = E [G_{t}^{s}] + R_{t}^{s} (λ, μ + ν) = G_{t}^{s} + R_{t}^{s} (λ, μ + ν) . \end{matrix}

For Case (b), $E [G_{t + δ}^{s} | X = b] = E [G_{t}^{s} + H_{t}^{s}]$ , where $H_{t}^{s}$ is an independent copy of $G_{t}^{s}$ . Thus,

\begin{matrix} E [G_{t + δ}^{s} | X = b] = 2 G_{t}^{s} . \end{matrix}

For Cases (c) and (d), we have: $E [G_{t + δ}^{s} | X = c] = 0 and E [G_{t + δ}^{s} | X = d] = G_{t}^{s} .$ Thus, by the law of total expectation,

\begin{matrix} G_{t + δ}^{s} = E [E [G_{t + δ}^{s} | X]] = G_{t}^{s} + (r R_{t}^{s} (λ, μ + ν) + (λ - μ) G_{t}^{s}) δ + o (δ) . \end{matrix}

Consequently, the function $G_{t}^{s}$ satisfies the first-order linear differential equation:

\begin{matrix} \frac{d}{dt} G_{t}^{s} - (λ - μ) G_{t}^{s} = r R_{t}^{s} (λ, μ + ν), \end{matrix}

7.2

subject to the initial condition $G_{0}^{s} = 0$ . Solving Eq. (7.2) gives:

\begin{matrix} G_{t}^{s} = \int_{0}^{t} r e^{(λ - μ) τ} R_{t - τ}^{s} (λ, μ + ν) d τ \end{matrix}

7.3

By making a change of variable we can rewrite Eq. (7.3) as:

\begin{matrix} G_{t}^{s} = r e^{(λ - μ) t} \int_{0}^{t} e^{- (λ - μ) τ} R_{τ}^{s} (λ, μ + ν) d τ . \end{matrix}

7.4

Combining this equation and Eq. (7.1) provides the explicit expression for $F_{t}^{s}$ described earlier (Eq. (5.2)).

We now substitute in the expression for $R_{t}^{s} (λ, θ)$ from Eq. (5.1) (with $t = τ$ and $θ = μ + ν$ ). For $ρ \neq 1$ , we have:

\begin{matrix} G_{t}^{s} = r e^{(λ - μ) t} s (1 - ρ) \int_{0}^{t} \frac{e^{(μ - λ) τ}}{(1 - s - ρ) e^{- λ (1 - ρ) τ} + s} d τ . \end{matrix}

7.5

Multiplying the numerator and denominator of the integrand by $e^{λ (1 - ρ) τ}$ gives $\frac{e^{- ν τ}}{1 - s - ρ + s e^{λ (1 - ρ) τ}}$ , and then by making the substitution $x = e^{- ν τ}$ , we obtain:

\begin{matrix} G_{t}^{s} = \frac{r e^{(λ - μ) t} s (1 - ρ)}{ν} \cdot \int_{e^{- ν t}}^{1} \frac{dx}{1 - s - ρ + s x^{β}}, \end{matrix}

7.6

for the value of $β$ described in the theorem. Combining Eqs. (7.1) and (7.6) gives:

\begin{matrix} F_{t}^{s} = \frac{r e^{(λ - μ) t} s (1 - ρ)}{ν} \cdot \int_{e^{- ν t}}^{1} \frac{dx}{1 - s - ρ + s x^{β}} + F_{0} R_{t}^{s} (λ, μ + ν) . \end{matrix}

7.7

Thus, for $ρ \neq 1$ ,

\begin{matrix} \frac{F_{t}^{s}}{F_{t}^{1}} = \frac{s \int_{e^{- ν t}}^{1} \frac{dx}{1 - s - ρ + s x^{β}} + o (1)}{\int_{e^{- ν t}}^{1} \frac{dx}{- ρ + x^{β}} + o (1)}, \end{matrix}

where the two terms of order o(1) (which converge to zero as t grows) refer to the last term on the right of Eq. (7.7) which is bounded above by the constant $F_{0}$ and so is asymptotically negligible in comparison to the term $e^{(λ - μ) t}$ in Eq. (7.6). This gives the limit for $φ_{FD} (s)$ as stated in Part (a) for fixed $ρ \neq 1$ .

In the case where $ρ = 1$ (which implies $β = 0$ ), we use the corresponding expression for $R_{t}^{s} (λ, θ)$ from Eq. (5.1) with $t = τ$ and $θ = μ + ν$ to obtain:

\begin{matrix} G_{t}^{s} = r s e^{(λ - μ) t} \int_{0}^{t} \frac{e^{- (λ - μ) τ}}{1 + λ s τ} d τ . \end{matrix}

By a similar approach to the above we are led to the second equation in Part (a).

Part (b): Let $T_{t}$ be a birth–death tree with rates $λ, μ$ where $λ > μ$ , let $n (T_{t})$ denote the number of leaves of $T_{t}$ present at time t, and let $E^{'}$ be the event that $n (T_{t}) > 0$ (i.e. the non-extinction of $T_{t}$ ).

Conditional on the event $E^{'}$ , the number of leaves in $T_{t}$ tends to infinity (with probability 1) as $t \to \infty$ (Jagers 1992), and so we can define a sequence of trees $T_{1}, T_{2}, \dots, T_{n}, \dots$ by letting $T_{k}$ denote the tree $T_{τ}$ at the first time $τ = τ (k)$ when $T_{τ}$ has k extant leaves (we ignore leaves of $T_{t}$ that have already become extinct by time $τ$ ).

Next, we establish that $E [F D (T_{n})] \geq c n$ for a constant $c > 0$ (in order to apply Theorem 3.1). The tree $T_{n}$ has n extant pendant edges, and the length of a randomly selected pendant edge in $T_{n}$ has a strictly positive probability p of having length at least $κ > 0$ (dependent on $μ$ and $λ$ ), by Theorem 3.1 of Stadler and Steel (2012). Now, $E [F D (T_{n})]$ is bounded below by the total number of features that arises on the n pendant edges and survive to the end of the edge (since all these features will necessarily be distinct from each other, and from other features that arise in the tree). Moreover, for each edge having length at least $κ$ the expected number of features that arise on this edge and survive to the end of the edge is at least $\frac{r}{ν} (1 - e^{- κ ν})$ . Thus the expected number of features contributed by the pendant edges to $F D (T_{n})$ is at least $n \cdot p \frac{r}{ν} (1 - e^{- κ ν})$ .

Thus, we can now apply Theorem 3.1, since $c_{s} = {lim}_{n \to \infty} \frac{E {[F D (T_{n}^{s})]}_{}}{E {[F D (T_{n})]}^{}}$ exists, and equals $φ_{FD} (s)$ , so $\frac{F D (T_{n}^{s})}{F D (T_{n})}$ (and thus $\frac{F D (T_{t}^{s})}{F D (T_{t})}$ ) converges to $φ_{FD} (s)$ as n (respectively t) grows.

Part (c): We apply Proposition 2.2. By Part (i) of that result, and conditioning on $T_{t}$ we obtain $E [F D (T_{t}^{s}) | T_{t}] \geq s F D (T_{t})$ , and so, taking expectation again (over the distribution of $T_{t}$ ) gives: $E [F D (T_{t}^{s})] \geq s E [F D (T_{t})]$ , and thus $φ_{FD} (s) = {lim}_{t \to \infty} \frac{E {[F D (T_{t}^{s})]}_{}}{E {[F D (T_{t})]}^{}} \geq s .$

The inequality $φ_{FD} (s) \leq 1$ is clear since, for any choice of $T_{t}$ , we have $F D (T_{t}^{s}) \leq F D (T_{t})$ with probability 1.

For concavity, Proposition 2.2 implies that the conditional expectation $E [\frac{F D {(T_{t}^{s})}_{}}{F D {(T_{t})}^{}}, |, T_{t}]$ is concave as a function of s, and (by taking expectation over the distribution of $T_{t}$ ), $E [\frac{F D {(T_{t}^{s})}_{}}{F D {(T_{t})}^{}}]$ is also concave as a function of s. Finally, by Part (a) of the current theorem, $E [\frac{F D {(T_{t}^{s})}_{}}{F D {(T_{t})}^{}}]$ converges (deterministically) to $φ_{FD} (s)$ and so this function is also concave as a function of s. $□$

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Devictor V, Mouillot D, Meynard C, Jiguet F, Thuiller W, Mouquet N. Spatial mismatch and congruence between taxonomic, phylogenetic and functional diversity: the need for integrative conservation strategies in a changing world. Ecol Lett. 2010;13:1030–1040. doi: 10.1111/j.1461-0248.2010.01493.x. [DOI] [PubMed] [Google Scholar]
Faith DP. Conservation evaluation and phylogenetic diversity. Biol Cons. 1992;61(1):1–10. doi: 10.1016/0006-3207(92)91201-3. [DOI] [Google Scholar]
Feller W. An introduction to probability theory and its applications. 3. London: Wiley; 1950. [Google Scholar]
Gillespie D. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22:403–434. doi: 10.1016/0021-9991(76)90041-3. [DOI] [Google Scholar]
Huson D, Steel M. Phylogenetic trees based on gene content. Bioinformatics. 2004;20:2044–2049. doi: 10.1093/bioinformatics/bth198. [DOI] [PubMed] [Google Scholar]
Jagers P. Stabilities and instabilities in population dynamics. J Appl Probab. 1992;29:770–780. doi: 10.2307/3214711. [DOI] [Google Scholar]
Kendall DG. On the generalized birth-and-death process. Ann Math Stat. 1948;19:1–15. doi: 10.1214/aoms/1177730285. [DOI] [Google Scholar]
Lambert A, Steel M. Predicting the loss of phylogenetic diversity under non-stationary diversification models. J Theor Biol. 2013;337:111–124. doi: 10.1016/j.jtbi.2013.08.009. [DOI] [PubMed] [Google Scholar]
Mazel F, Mooers AO, Riva GVD, Pennell MW. Conserving phylogenetic diversity can be a poor strategy for conserving functional diversity. Syst Biol. 2017;66(6):1019–1027. doi: 10.1093/sysbio/syx054. [DOI] [PubMed] [Google Scholar]
Mazel F, Pennell MW, Cadotte MW, Diaz S, Riva GVD, Grenyer R, Leprieur F, Mooers AO, Mouillot D, Tucker CM, Pearse WD. Prioritizing phylogenetic diversity captures functional diversity unreliably. Nat Commun. 2018;9:2888. doi: 10.1038/s41467-018-05126-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mazel F, Pennell MW, Cadotte MW, Diaz S, Riva GVD, Grenyer R, Leprieur F, Mooers AO, Mouillot D, Tucker CM, Pearse WD. Reply to: “Global conservation of phylogenetic diversity captures more than just functional diversity”. Nat Commun. 2019;10:858. doi: 10.1038/s41467-019-08603-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
McDiarmid C. Surveys in combinatorics, London mathematical society lecture notes series 141. Cambridge: Cambridge University Press; 1989. On the method of bounded differences; pp. 148–188. [Google Scholar]
Miller JT, Jolley-Rogers G, Mishler BD, Thornhill AH. Phylogenetic diversity is a better measure of biodiversity than taxon counting. J Syst Evol. 2018;56(6):663–667. doi: 10.1111/jse.12436. [DOI] [Google Scholar]
Mitzenmacher M, Upfal E. Probability and computing: Randomized algorithms and probabilistic analysis. Cambridge: Cambridge University Press; 2005. [Google Scholar]
Mooers A, Gascuel O, Stadler T, Li H, Steel M. Branch lengths on birth–death trees and the expected loss of phylogenetic diversity. Syst Biol. 2012;61(2):195–203. doi: 10.1093/sysbio/syr090. [DOI] [PubMed] [Google Scholar]
Owen NR, Gumbs R, Gray CL, Faith DP. Global conservation of phylogenetic diversity captures more than just functional diversity. Nat Commun. 2019;10:859. doi: 10.1038/s41467-019-08600-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Raup DM. Extinction: bad genes or bad luck? Oxford: Oxford University Press; 1993. [Google Scholar]
Rosindell J, Manson K, Gumbs R, Pearse W, Steel M (2022) Phylogenetic biodiversity metrics should account for both accumulation and attrition of evolutionary heritage. Technical Report 2022.07.16.499419, BioRxiv [DOI] [PMC free article] [PubMed]
Stadler T. Simulating trees with a fixed number of extant species. Syst Biol. 2011;60:676–684. doi: 10.1093/sysbio/syr029. [DOI] [PubMed] [Google Scholar]
Stadler T, Steel M. Distribution of branch lengths and phylogenetic diversity under homogeneous speciation models. J Theor Biol. 2012;297(2):33–40. doi: 10.1016/j.jtbi.2011.11.019. [DOI] [PubMed] [Google Scholar]
Team RC (2021) A language and environment for statistical computing (accessed on 7 August 2021)
Tucker CM, Aze T, Cadotte MW, Cantalapiedra JL, Chisholm C, Díaz S. Assessing the utility of conserving evolutionary history. Biol Rev. 2019;94:1740–1760. doi: 10.1111/brv.12526. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tucker CM, Davies TJ, Cadotte MW, Pearse WD. On the relationship between phylogenetic diversity and trait diversity. Ecology. 2018;99(6):1473–1479. doi: 10.1002/ecy.2349. [DOI] [PubMed] [Google Scholar]
Wicke K, Fischer M. Phylogenetic diversity and biodiversity indices on phylogenetic networks. Math Biosci. 2018;298:80–90. doi: 10.1016/j.mbs.2018.02.005. [DOI] [PubMed] [Google Scholar]
Wicke K, Mooers A, Steel M. Formal links between feature diversity and phylogenetic diversity. Syst Biol. 2021;70:480–490. doi: 10.1093/sysbio/syaa062. [DOI] [PubMed] [Google Scholar]
Yang Z, Rannala B. Bayesian phylogenetic inference using DNA sequences: a Markov chain Monte Carlo method. Mol Biol Evol. 1997;14(7):717–724. doi: 10.1093/oxfordjournals.molbev.a025811. [DOI] [PubMed] [Google Scholar]

[CR1] Devictor V, Mouillot D, Meynard C, Jiguet F, Thuiller W, Mouquet N. Spatial mismatch and congruence between taxonomic, phylogenetic and functional diversity: the need for integrative conservation strategies in a changing world. Ecol Lett. 2010;13:1030–1040. doi: 10.1111/j.1461-0248.2010.01493.x. [DOI] [PubMed] [Google Scholar]

[CR2] Faith DP. Conservation evaluation and phylogenetic diversity. Biol Cons. 1992;61(1):1–10. doi: 10.1016/0006-3207(92)91201-3. [DOI] [Google Scholar]

[CR3] Feller W. An introduction to probability theory and its applications. 3. London: Wiley; 1950. [Google Scholar]

[CR4] Gillespie D. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22:403–434. doi: 10.1016/0021-9991(76)90041-3. [DOI] [Google Scholar]

[CR5] Huson D, Steel M. Phylogenetic trees based on gene content. Bioinformatics. 2004;20:2044–2049. doi: 10.1093/bioinformatics/bth198. [DOI] [PubMed] [Google Scholar]

[CR6] Jagers P. Stabilities and instabilities in population dynamics. J Appl Probab. 1992;29:770–780. doi: 10.2307/3214711. [DOI] [Google Scholar]

[CR7] Kendall DG. On the generalized birth-and-death process. Ann Math Stat. 1948;19:1–15. doi: 10.1214/aoms/1177730285. [DOI] [Google Scholar]

[CR8] Lambert A, Steel M. Predicting the loss of phylogenetic diversity under non-stationary diversification models. J Theor Biol. 2013;337:111–124. doi: 10.1016/j.jtbi.2013.08.009. [DOI] [PubMed] [Google Scholar]

[CR9] Mazel F, Mooers AO, Riva GVD, Pennell MW. Conserving phylogenetic diversity can be a poor strategy for conserving functional diversity. Syst Biol. 2017;66(6):1019–1027. doi: 10.1093/sysbio/syx054. [DOI] [PubMed] [Google Scholar]

[CR10] Mazel F, Pennell MW, Cadotte MW, Diaz S, Riva GVD, Grenyer R, Leprieur F, Mooers AO, Mouillot D, Tucker CM, Pearse WD. Prioritizing phylogenetic diversity captures functional diversity unreliably. Nat Commun. 2018;9:2888. doi: 10.1038/s41467-018-05126-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] Mazel F, Pennell MW, Cadotte MW, Diaz S, Riva GVD, Grenyer R, Leprieur F, Mooers AO, Mouillot D, Tucker CM, Pearse WD. Reply to: “Global conservation of phylogenetic diversity captures more than just functional diversity”. Nat Commun. 2019;10:858. doi: 10.1038/s41467-019-08603-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] McDiarmid C. Surveys in combinatorics, London mathematical society lecture notes series 141. Cambridge: Cambridge University Press; 1989. On the method of bounded differences; pp. 148–188. [Google Scholar]

[CR13] Miller JT, Jolley-Rogers G, Mishler BD, Thornhill AH. Phylogenetic diversity is a better measure of biodiversity than taxon counting. J Syst Evol. 2018;56(6):663–667. doi: 10.1111/jse.12436. [DOI] [Google Scholar]

[CR14] Mitzenmacher M, Upfal E. Probability and computing: Randomized algorithms and probabilistic analysis. Cambridge: Cambridge University Press; 2005. [Google Scholar]

[CR15] Mooers A, Gascuel O, Stadler T, Li H, Steel M. Branch lengths on birth–death trees and the expected loss of phylogenetic diversity. Syst Biol. 2012;61(2):195–203. doi: 10.1093/sysbio/syr090. [DOI] [PubMed] [Google Scholar]

[CR16] Owen NR, Gumbs R, Gray CL, Faith DP. Global conservation of phylogenetic diversity captures more than just functional diversity. Nat Commun. 2019;10:859. doi: 10.1038/s41467-019-08600-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] Raup DM. Extinction: bad genes or bad luck? Oxford: Oxford University Press; 1993. [Google Scholar]

[CR18] Rosindell J, Manson K, Gumbs R, Pearse W, Steel M (2022) Phylogenetic biodiversity metrics should account for both accumulation and attrition of evolutionary heritage. Technical Report 2022.07.16.499419, BioRxiv [DOI] [PMC free article] [PubMed]

[CR19] Stadler T. Simulating trees with a fixed number of extant species. Syst Biol. 2011;60:676–684. doi: 10.1093/sysbio/syr029. [DOI] [PubMed] [Google Scholar]

[CR20] Stadler T, Steel M. Distribution of branch lengths and phylogenetic diversity under homogeneous speciation models. J Theor Biol. 2012;297(2):33–40. doi: 10.1016/j.jtbi.2011.11.019. [DOI] [PubMed] [Google Scholar]

[CR21] Team RC (2021) A language and environment for statistical computing (accessed on 7 August 2021)

[CR22] Tucker CM, Aze T, Cadotte MW, Cantalapiedra JL, Chisholm C, Díaz S. Assessing the utility of conserving evolutionary history. Biol Rev. 2019;94:1740–1760. doi: 10.1111/brv.12526. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] Tucker CM, Davies TJ, Cadotte MW, Pearse WD. On the relationship between phylogenetic diversity and trait diversity. Ecology. 2018;99(6):1473–1479. doi: 10.1002/ecy.2349. [DOI] [PubMed] [Google Scholar]

[CR24] Wicke K, Fischer M. Phylogenetic diversity and biodiversity indices on phylogenetic networks. Math Biosci. 2018;298:80–90. doi: 10.1016/j.mbs.2018.02.005. [DOI] [PubMed] [Google Scholar]

[CR25] Wicke K, Mooers A, Steel M. Formal links between feature diversity and phylogenetic diversity. Syst Biol. 2021;70:480–490. doi: 10.1093/sysbio/syaa062. [DOI] [PubMed] [Google Scholar]

[CR26] Yang Z, Rannala B. Bayesian phylogenetic inference using DNA sequences: a Markov chain Monte Carlo method. Mol Biol Evol. 1997;14(7):717–724. doi: 10.1093/oxfordjournals.molbev.a025811. [DOI] [PubMed] [Google Scholar]

PERMALINK

The expected loss of feature diversity (versus phylogenetic diversity) following rapid extinction at the present

Marcus Overwater

Daniel Pelletier

Mike Steel

Abstract

Introduction

General properties of feature diversity without reference to phylogenies

Feature diversity loss under a ‘Field of Bullets’ model of extinction at the present

Definition 1.1

Proposition 1.1

Proof

Proposition 1.2

Proof

Approximating φ(F,s) by its expected value

Proposition 1.3

Proof

Remark

Consequences for phylogenetic diversity

The feature diversity ratio φ for a model of feature evolution on a phylogenetic tree

Fig. 1.

Lemma 1.1

Proof

A limit result for sequences of trees

Lemma 1.2

Proof

Lemma 1.3

Proof

Lemma 1.4

Proof

Theorem 1.1

Remarks

Fig. 2.

Proof of Theorem 3.1

Feature diversity ratios in birth–death trees

Definitions

Expected feature diversity

Relationship to phylogenetic diversity (PD)

Calculating φFD(s)

Theorem 1.2

Remarks

Illustrative examples

Fig. 3.

Simulations

Fig. 4.

Table 1.

Features that appear in only one extant species

Proposition 1.4

Proof

Concluding comments

Acknowledgements

Appendix: Proof of Theorem 5.1

Funding

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Approximating $φ_{(F, s)}$ by its expected value

The feature diversity ratio $φ$ for a model of feature evolution on a phylogenetic tree

Calculating $φ_{FD} (s)$