Abstract
Phylogenetic diversity is a popular measure for quantifying the biodiversity of a collection Y of species, while phylogenetic diversity indices provide a way to apportion phylogenetic diversity to individual species. Typically, for some specific diversity index, the phylogenetic diversity of Y is not equal to the sum of the diversity indices of the species in Y. In this paper, we investigate the extent of this difference for two commonly-used indices: Fair Proportion and Equal Splits. In particular, we determine the maximum value of this difference under various instances including when the associated rooted phylogenetic tree is allowed to vary across all rooted phylogenetic trees with the same leaf set and whose edge lengths are constrained by either their total sum or their maximum value.
Keywords: Phylogenetic tree, Phylogenetic diversity, Diversity indices, Fair Proportion index, Equal Splits index
Introduction
Phylogenetic diversity (PD) is prominent measure in evolutionary biology to quantify the biological diversity of a collection of species. Intuitively, the phylogenetic diversity of such a collection quantifies how much of the ‘Tree of Life’ is spanned by the species in the collection. Introduced by Faith in 1992 (Faith 1992), PD has been analysed and applied in a variety of contexts for various taxa including plants, bacteria, and mammals (Cadotte et al. 2008; Lozupone and Knight 2007; Safi et al. 2011). On the other hand, (phylogenetic) diversity indices, also called evolutionary distinctiveness measures, quantify an individual species’ contribution to overall phylogenetic diversity, thus providing a convenient way to rank species as, for example, in conservation planning (Redding et al. 2008, 2014). Intuitively, these indices assign species with fewer and more distant relatives higher values than those species with many more and closer relatives.
Diversity indices have been proposed and used as an alternative to PD in conservation prioritisation (Isaac et al. 2007; Redding et al. 2010; Stein et al. 2018). However, although the sum of the diversity indices across all species equates to the total phylogenetic diversity, in general, the sum of the diversity indices of a subset of the species differs from the subset’s phylogenetic diversity. If a biologist or conservationist is switching from the ideal measure of full-tree PD to an approximation by a simple diversity index, it is important for them to understand the scale of the possible difference that this switch might involve.
Using empirical experiments, the authors in Redding et al. (2008) studied how well different diversity indices ‘captured’ PD with respect to tree shape. Their experiments indicate that diversity indices captured more PD on “more unbalanced trees and on trees with many splits near the present” and concluded that “new measures... may provide a transparent alternative to more complicated full-tree approaches.”
In this paper, we rigorously investigate the extent of this difference for two natural and well-known diversity indices, Fair Proportion and Equal Splits. Both of these indices were included in the empirical study in Redding et al. (2008) and are used, for example, in the conservation initiative ‘EDGE of Existence Programme’ established by the Zoological Society of London (Isaac et al. 2007). As well as determining the maximum difference under various scenarios (see below for details), which will inform the practitioner as to the possible extent to which an index can fail to approximate PD, we also characterised the shape and structure of the phylogenetic trees that realised these differences. Such information is potentially useful to the practitioner in that the more their phylogenetic trees of interest resemble an extremal instance, the more attention they may need to pay to the real possibility of the diversity index failing to accurately capture PD.
A version of the investigation in this paper was posed in earlier work (Haake et al. 2008) but, apart from the related simulation study (Redding et al. 2008), it has not been explored further. Furthermore, the work in this paper is related to a recent study in Wicke and Steel (2020) in which the authors consider the extent of the difference between the Fair Proportion and Equal Splits indices of a species. We next formalise the investigation in this paper, ending the introduction with a high-level overview of the main results.
Throughout the paper, X is a non-empty finite set. A rooted phylogenetic X-tree is a rooted tree with leaf set X whose non-leaf vertices have out-degree at least two. For technical reasons, if , we additionally allow a rooted phylogenetic tree to consist of the single vertex in X. If all non-leaf vertices have out-degree exactly two, then is binary. For ease of reading, as all phylogenetic trees in this paper are rooted and binary, we will refer to a “rooted binary phylogenetic tree” as simply a “phylogenetic tree”.
Let be a phylogenetic X-tree with root , and consider a map from the set E of edges of to the non-negative reals. Collectively, we denote and by the ordered pair and refer to as an edge-weighted phylogenetic X-tree. Furthermore, we use to denote the total sum of the edge lengths of . The phylogenetic diversity of a subset Y of X on , denoted , is the sum of the weights of the edges in the (unique) minimal subtree of connecting the vertices in . Observe that . To illustrate, consider the edge-weighted phylogenetic X-tree shown in Fig. 1, where . The minimal subtree of connecting the vertices in is indicated by dashed edges, and so PD of is 18. The PD of is 10.
Fig. 1.

An edge-weighted phylogenetic X-tree . The minimal subtree of connecting the vertices in is indicated with dashed edges
A phylogenetic diversity index for an edge-weighted phylogenetic X-tree is a function that assigns a score to each leaf of such that
Furthermore, if, for each , we can write as a linear function of the edge lengths of , that is,
| 1 |
for some constants that are independent of , we say is a linear diversity index. It is easily checked that an arbitrary function of the form shown in (1) is a phylogenetic diversity index if and only if, for each edge e of , we have
| 2 |
Two well-studied linear diversity indices underlie the results in this paper. Let be an edge-weighted phylogenetic X-tree. The Fair Proportion index (FP) for a leaf , denoted , is the value
where denotes the (unique) path in from the root to x, and n(e) denotes the number of leaves that are at the end of a directed path starting at the root and traversing e. Intuitively, the FP index distributes the length of e evenly amongst its descendant leaves. The Fair Proportion index is also called ‘evolutionary distinctiveness’ (Isaac et al. 2007). The second index is the Equal Splits index (ES) which for a leaf is the value
where if e is the pendant edge of incident with x, and if is a non-pendant edge of , then is the product of the out-degrees of the non-leaf vertices (including v) on the directed path from v to x. Since all phylogenetic trees in this paper are binary, is always a power of 2. In particular, if is a non-pendant edge of , then is , where m is the number of edges from v to x. The Fair Proportion and Equal Splits indices were introduced in Redding (2003) and Redding and Mooers (2006), respectively, and a direct comparison of these two indices was investigated recently in Wicke and Steel (2020).
As examples of the Fair Proportion and Equal Splits indices, consider the edge-weighted phylogenetic X-tree in Fig. 1. The FP indices for and are
and
The ES indices for and are
and
For an edge-weighted phylogenetic X-tree and diversity index , although the sum of the diversity indices across all taxa in X equates to , the sum of the diversity indices across all taxa in a proper subset Y of X will typically not equal . For example, in Fig. 1, , but
To quantify this, set
| 3 |
We refer to as the diversity difference of Y (relative to and ). In this paper, we are interested in the extent of this difference for the Fair Proportion and Equal Splits indices.
The following lemma shows that the diversity difference is always non-negative provided satisfies a natural distribution property. This property says that, for an edge-weighted phylogenetic X-tree and all , the value can be written as a non-negative linear function of the lengths of edges in the path from the root of to x.
Lemma 1.1
Let be an edge-weighted phylogenetic X-tree with root , and let be a linear diversity index such that, for each ,
| 4 |
for some non-negative constants that are independent of . If , then
Proof
Let . Then
and
where denotes the minimal subtree of connecting the leaves in Y and . Since is non-negative for all and edges e of , it follows by (2) that . Hence as required.
We call a phylogenetic diversity index that satisfies (4) a descendant diversity index. It is easily checked that the Fair Proportion and Equal Splits indices are examples of descendant diversity indices.
In this paper, for each of the Fair Proportion and Equal Splits indices, we will determine, for all positive integers k, the maximum value of the diversity difference over all subsets of X of size k under the following instances:
-
(i)
a fixed phylogenetic X-tree whose edge lengths are fixed;
-
(ii)
a fixed phylogenetic X-tree whose edge lengths are constrained by either their (total) sum or their maximum value; and
-
(ii)
across all phylogenetic X-trees whose edge lengths are constrained by either their (total) sum or their maximum value.
In particular, for (i) and (ii), we give polynomial-time algorithms for finding these maximum values for an arbitrary descendant diversity index while, for (iii), we characterise the edge-weight phylogenetic X-trees and subsets of size k that realise this maximum value for FP and ES. For (iii), it turns out that, in the case that the edge lengths are constrained by their sum, the class of phylogenetic trees that maximise the diversity difference under ES is always a subclass of the phylogenetic trees that maximise the diversity difference under FP. The corresponding characterisations are stated as Theorems 4.2 and 4.1, respectively. However, in the case that the edge lengths are constrained by their maximum value, the class of phylogenetic trees that maximise the diversity difference under FP and ES coincide (Theorem 4.7).
The paper is organised as follows. The next section contains some preliminaries that are used throughout the paper. Section 3 considers maximising the diversity difference on a fixed phylogenetic tree, while Sect. 4 considers maximising the diversity difference across all phylogenetic trees. The last section, Sect. 5, consists of a brief discussion.
Preliminaries
Let be a phylogenetic X-tree, and let t and w be vertices of . If P is a (directed) path in from t to w, the length of P, denoted |P|, is the number of edges in P. We denote this path by . Furthermore, we sometimes refer to a path in from an edge to w, in which case, we mean from v to w, and denote this path by . Note that if (u, v) is an edge of
, then u is ancestral to v.
Subtrees. Let be a phylogenetic X-tree with root , and let be a subset of . The minimal subtree of connecting the vertices in is denoted by . Furthermore, if , the restriction of to , denoted by , is the phylogenetic -tree obtained from by suppressing all non-root vertices of degree two.
Let be a phylogenetic X-tree and let be an edge of . The subset of leaves that are descendants of v is called a cluster of and is denoted by C(v). The set X as well as each of the singleton subsets of X are clusters of every phylogenetic X-tree, and so such clusters are called the trivial clusters of . Hence a cluster of is non-trivial if . Thus if has a non-trivial cluster, then . Furthermore, the rooted subtree obtained from by deleting e and whose root is v is a pendant subtree of .
Cherries and chains. A 2-element subset of X, say , is a cherry of a phylogenetic X-tree if a and b have the same parent. Now let be a subset of X such that and, for all , let denote the parent of . We call a chain of if there is an ordering of the elements of , say , such that
is a (directed) path in , in which case, is the first parent of the chain and is the last parent of the chain. Note that we still refer to as a chain of if is a cherry and is a path in . The edge set of consists of the pendant edges incident with a leaf in as well as the edges in the path .
Subtree prune and regraft. Let be a phylogenetic X-tree. For the purposes of the upcoming operation, view the root of adjoined to the original root via a pendant edge. Let (u, v) be an edge of such that . Let be the phylogenetic X-tree obtained from by deleting (u, v) and suppressing u, and then reattaching the pendant subtree by subdividing an edge, f say, in the component of containing with a new vertex and adjoining with a new edge . We say that has been obtained from by a rooted subtree prune and regraft operation. More specifically, in this operation, we have pruned C(v) and regrafted it to f. In the special case f is the pendant edge incident with the root of , we say that C(v) is pruned and regrafted to . (Effectively, this corresponds to the subtree being regrafted above the root.) At the completion of this operation, we no longer view the root as being joined via a pendant edge, and so the root of (labelled ) is the unique vertex of of in-degree zero after deleting the temporary root and its incident edge.
Diversity indices. Let be an edge-weighted phylogenetic X-tree with edge set E, and let be a descendant diversity index. Then, for each ,
for some non-negative constants that are independent of . For a subset Y of X, we denote the contribution of an edge of to by , that is,
if is non-empty, and otherwise. Thus
It will be clear in the context which subset of X and which descendant diversity index is referring to. Observe that . Moreover, if either or , then .
Maximising the diversity difference on a fixed tree
In this section, we consider the problem of determining the maximum value of the diversity difference for a fixed phylogenetic tree for when is an arbitrary descendant diversity index.
Fixed edge lengths
We first consider the instance of a fixed phylogenetic tree with fixed edge lengths. Let be an edge-weighted phylogenetic X-tree, and let be an arbitrary linear diversity index. Now let be the edge-weighted phylogenetic X-tree obtained from by adding, for each , the value
to the length of the pendant edge incident with x. For all non-pendant edges e, we have . Note that the use of the value is simply to have a constant large enough so that all pendant edges have non-negative lengths.
Lemma 3.1
Let be an edge-weighted phylogenetic X-tree, let be a linear diversity index, let k be a non-negative integer, and let Y be a subset of X of size k. Then
Proof
Now,
Lemma 3.1 is the basis of the following algorithm which finds the maximum value of the diversity difference under a linear diversity index for a given edge-weighted phylogenetic tree. It is well known that Step 2 of the algorithm, that is, finding a subset Y of X of size k such that for all subsets of X of size k, takes quadratic time in the size of X (Pardi and Goldman 2005; Steel 2005).
MaxDiversityDiff
Input: An edge-weighted phylogenetic X-tree , a linear diversity index , and a non-negative integer k.
Output: A subset Y of X of size k that maximises across all subsets of X of size k.
Let be the map defined by setting for all pendant edges e, where x is the leaf incident with e, and for all non-pendant edges. Construct .
Find a subset Y of X of size k such that for all subsets of X of size k.
Return Y and .
The next theorem is an immediate consequence of the discussion prior to the description of MaxDiversityDiff and the fact that the construction of from takes linear time (in the size of X).
Theorem 3.2
Let be an edge-weighted phylogenetic X-tree, let k be a positive integer, and let be a linear diversity index. Then applying MaxDiversityDiff to , , and k correctly returns a subset Y of X of size k that maximises
across all subsets of X of size k in time .
Maximum sum of edge lengths
Now consider the problem of maximising the diversity difference on a fixed phylogenetic tree whose edge lengths are constrained by their (total) sum. More particularly, we consider the following optimisation problem:
Fixed Tree and Total Weight
Instance: A positive integer k, a phylogenetic X-tree , a positive real number m, and a descendant diversity index .
Question: Find a subset Y of X of size k and a map on the set E of edges of with that maximise
We will make use of the next lemma in this subsection as well as in Sect. 4.
Lemma 3.3
Let be an instance of Fixed Tree and Total Weight, and let and be a solution to this problem. Let be a map on the set E of edges of with and let Y be a subset of X of size k such that
If and are distinct edges of such that and , then
-
(i)
, and
-
(ii)
, where , , and for all .
Proof
Let be such a map. For each , let and, for each edge e of , define and to be the contribution of e to and , respectively, where is as defined in (ii). To prove the lemma, it suffices to show that
Since , it follows that . Now, for each ,
If
then, by linearity,
a contradiction to the maximality of . Using a symmetric argument, it follows that
Hence
This completes the proof of the lemma.
By repeated applications of Lemma 3.3(ii), we obtain the following corollary.
Corollary 3.4
Let be an instance of Fixed Tree and Total Weight, and let and be a solution to this problem. Then there exists a weighting of the edge set E of with for some edge and a subset Y of X of size k such that
It follows by Corollary 3.4 that, given , k, m, and as in its statement, we can find an edge weighting that maximises across all subsets of X of size k as follows. For all , let denote the edge-weighted phylogenetic X-tree, where is the edge weighting of in which and for all . Now, for each , apply Theorem 3.2 and, more particularly, MaxDiversityDiff to , k, and . The maximum of the values returned by these applications gives the desired value (as well as a subset of X of size k realising this value). Since the total number of edges in is , we have the next theorem.
Theorem 3.5
The problem Fixed Tree and Total Weight can be solved in time, where |X| is the size of the leaf set of .
Maximum edge length
Next consider the problem of maximising the diversity difference on a fixed phylogenetic X-tree whose edge lengths are constrained by some maximum value, say . We begin with a lemma that reduces the problem to an earlier problem.
Lemma 3.6
Let be a phylogenetic X-tree, let , and let be a non-negative real. If is a map on the set E of edges of such that for all , and is a descendant diversity index, then
where is the map defined by for all .
Proof
Let , and let and denote the contribution of e to and , respectively. To prove the lemma, it suffices to show that .
Since and is linear,
thereby completing the proof of the lemma.
By Lemma 3.6, we may assume that all edges of have length . Thus the problem is reduced to maximising the diversity difference on a fixed phylogenetic tree with fixed edge lengths. Therefore, we can find the maximum value in time by applying Theorem 3.2 and, in particular, MaxDiversityDiff, to , a non-negative integer k, and a descendant diversity index .
Maximising the diversity difference across all trees
In contrast to the computational results in last section, in this section we characterise the edge-weighted phylogenetic X-trees that maximise the diversity difference for FP and ES. If the size of the subset of interest is |X|, the diversity difference is zero for all edge-weighted phylogenetic X-trees. Thus, throughout this section, we will impose the condition that .
Maximum sum of edge lengths
We first consider the problem of maximising the diversity difference across all edge-weighted phylogenetic trees whose edge lengths are constrained by their (total) sum:
All Trees and Total Weight
Instance: A positive integer , a positive real number m, and a diversity index.
Question: Find an edge-weighted phylogenetic X-tree with and a subset Y of X that maximises
We begin with the Fair Proportion index.
Theorem 4.1
Let (k, X, m, FP) be an instance of All Trees and Total Weight. Let be an edge-weighted phylogenetic X-tree with and let Y be a subset of X of size k. Suppose that and is a solution to this instance of All Trees and Total Weight. Then
if and only if has an edge such that , , and , in which case,
where .
To illustrate Theorem 4.1, a generic edge-weighted phylogenetic X-tree optimising as in the statement of this theorem is shown in Fig. 2(i).
Fig. 2.

(i) An illustration of Theorem 4.1, where , , and . All edges have weight zero except the edge (u, v) which has weight m. The “triangles” represent pendant subtrees whose leaf sets are . (ii) An illustration of Theorem 4.2, where , , and is non-empty. All edges have weight zero except the edge (u, v) which has weight m. The “triangles” represent pendant subsets whose leaf sets are
Proof of Theorem 4.1
First note that if and Y satisfy the conditions described in the statement of the theorem, then it follows that
To show that the only if direction holds, let be an edge-weighted phylogenetic X-tree, and suppose that together with a subset Y of size k maximises across all edge-weighted phylogenetic X-trees with and subsets of X of size k. For each edge of , let and, for convenience, . Thus
if and if . Let denote the edges of in which . Note that, for such edges, is non-empty. We next show that by showing that if , then
contradicting the maximality of .
Suppose . Since FP is descendant, it follows by Lemma 3.3(i) that
for all . Therefore
where the second-to-last inequality holds as the size of can be no more than the sum of the sizes of and . Moreover, it is strict when since (i) if there is no directed path containing and , then and are disjoint and, in particular, is non-empty and (ii) if, without loss of generality, there is a directed path from to , then and, as
again is non-empty.
Hence and so, to maximise the diversity difference, , , and , in which case,
This completes the proof of the theorem.
The next theorem is the analogue of Theorem 4.1 for the Equal Splits index.
Theorem 4.2
Let (k, X, m, ES) be an instance of All Trees and Total Weight. Let be an edge-weight phylogenetic X-tree with and let Y be a subset of X of size k, and suppose that and is a solution to this instance of All Trees and Total Weight. Then
if and only if has an edge such that , is a chain whose first parent is v, and is non-empty, in which case,
where .
A generic edge-weighted phylogenetic X-tree optimising as in the statement of Theorem 4.2 is shown in Fig. 2(ii). The proof of Theorem 4.2 takes a similar approach to that of Theorem 4.1 but is slightly more involved. We begin with a lemma.
Lemma 4.3
Let be a phylogenetic X-tree, and let . Let be an edge of , and suppose that and are both non-empty. Let be the phylogenetic X-tree obtained from by replacing the pendant subtree with a phylogenetic tree on leaf set C(v), where is a chain whose first parent is v and is a pendant subtree below the chain. If is not a chain whose first parent is v in , then
where and .
Proof
Suppose that is not of the same form as . Then there is a such that either z is in a cherry of , where or z is a leaf of a pendant subtree of whose two maximal subtrees each contain an element of Y. Let be the phylogenetic X-tree that is obtained from by pruning z and regrafting to the edge e. Relabel the vertex v as and the newly created vertex in the subdivided edge v (so that C(v) is unchanged). It is now easily checked that if , then . Moreover, for some , we have . In particular,
The lemma now follows by repeating this process until we have constructed a phylogenetic tree in the same form as .
Proof of Theorem 4.2
If and Y satisfy the conditions described in the statement of Theorem 4.2, then it is easily checked that
To complete the proof, let be an edge-weighted phylogenetic X-tree, and suppose that together with a subset Y of X of size k maximises across all edge-weighted phylogenetic X-trees with and subsets of X of size k. For each edge of , define to be the contribution of e to . Thus
if and if . Note that, if , then is non-empty. Let denote the edges of in which . We next show that by showing that if then
Say . Since ES is descendant, it follows by Lemma 3.3(i) that
| 5 |
for all . Hence
where and , and the last inequality holds by Lemma 4.3. Since , it follows by (5) that is non-empty, so . That is,
Hence , and so to maximise the diversity difference, a single edge say of has length and, by Lemma 4.3, is a chain with first parent v, and is non-empty, in which case,
This completes the proof of the theorem.
Maximum edge length
We next consider the problem of maximising the diversity difference across all phylogenetic trees whose edge lengths are constrained by some maximum value. By Lemma 3.6, we may assume all edges of have the same weight. Thus, without loss of generality, we will assume that all edges have weight 1 and so, for simplicity, we write for . To this end, for a descendant diversity index , the contribution of an edge e of to is denoted by . We begin with two lemmas and a corollary.
Lemma 4.4
Let be a phylogenetic X-tree, let such that , and let . If C is a non-trivial cluster of such that either or , then there exists a phylogenetic X-tree such that
Proof
Let v be a non-leaf vertex of , and suppose that or . Without loss of generality, we may assume that C(v) is maximal with this property. Let (u, v) be the edge of directed into v and note that, for (u, v) and all edges on a path from v to a leaf, the contribution of these edges under either FP or ES to is zero regardless of whether or . Let w be the child vertex of u that is not v. By maximality and , if , then and, if , then . Let E denote the edge set of , let P denote the set of edges of on the path from its root to u, and let . Depending on whether or , we will compare with two other phylogenetic X-trees.
Let denote the phylogenetic X-tree obtained from by pruning a and regrafting it to . Let denote the set of edges of , let denote the root of , let q denote the child of that is not a, and let denote the set of edges of on the path from q to u. Let denote the phylogenetic X-tree obtained from by pruning a and regrafting it to (u, w). Let denote the set of edges of , let denote the parent of a in , and let denote the set of edges of on the path from its root to .
First suppose that is FP. If , then
Furthermore, as , and and are both non-empty,
and . Hence . On the other hand, if , then it is easily checked that
and
as has a non-empty intersection with Y and . Thus , and so the lemma holds if is FP.
Now suppose that is ES. If , then
Since and are both non-empty, , and so if is ES. Furthermore, if , then
Hence, as ,
as since and are both non-empty. Therefore . This completes the proof of the lemma.
An immediate consequence of Lemma 4.4 is the next corollary.
Corollary 4.5
Let be a phylogenetic X-tree, let k be a positive integer such that , let Y be a subset of X of size k, and let . Suppose that and Y maximises
across all phylogenetic X-trees and all subsets of X of size k. If is a cherry of , then . In particular, has no non-trivial cluster C such that either or .
Lemma 4.6
Let be a phylogenetic X-tree, let k be a positive integer such that , let Y be a subset of X of size k, and let . Suppose that and Y maximises
across all phylogenetic X-trees and all subsets of X of size k. If is a chain of , in which is a cherry, then
-
(i)
and
-
(ii)
for some , we have and .
Proof
By Corollary 4.5, satisfies (i). Suppose that does not satisfy (ii). Let be the least index such but . Let be the phylogenetic X-tree obtained from by interchanging and , that is, pruning and regrafting it to the edge directed into the parent of so that is a chain of . Let and denote the parents of and in , respectively, and let and denote the parents of and in , respectively.
If is the Fair Proportion index, then
This contradiction to the maximality of and Y implies that the lemma holds when is FP.
Now suppose that is the Equal Splits index. Let P denote the path in from the root to and let denote the path in from the root to . Then
For all , we have , and the contribution of to is . So
This contradiction to the maximality of and Y implies that the lemma holds when is ES, thereby completing the proof of the lemma.
The next theorem is illustrated in Fig. 3. In particular, an edge-weighted phylogenetic X-tree optimising as in the statement of this theorem is shown in Fig. 3, where . It is interesting to note that the outcome of Theorem 4.7 is the same for FP and ES, although the proof of the theorem requires FP and ES to be considered separately.
Fig. 3.

An illustration of Theorem 4.7, where , so , and
Theorem 4.7
Let k be a positive integer such that , let be a positive real, and let . Then an edge-weighted phylogenetic X-tree with for all and a subset Y of X of size k maximises
across all phylogenetic X-trees with for all and all subsets of X of size k if and only if for all and X is a chain such that , and, for some , and .
Proof
Let be an edge-weighted phylogenetic X-tree with for all and let Y be a subset of X of size k. Suppose that and Y maximise across all edge-weighted phylogenetic X-trees with for all and all subsets of X of size k. Recall that, by Lemma 3.6, we may assume that all edges have weight .
We first show that does not have two distinct cherries. Suppose that has two such cherries. Then has a vertex v in which C(v) contains exactly two cherries. By Lemma 4.6 and the maximality of and Y, the set C(v) consists of two disjoint chains and each of which satisfies properties (i) and (ii) in the statement of Lemma 4.6. Let and let .
First suppose that is FP. Without loss of generality, we may assume that
Say and . Let be the phylogenetic X-tree obtained from by pruning and regrafting it to the edge (u, v) directed into v. This operation is shown in Fig. 4, where is drawn above . Let and denote the parents of in and , respectively. Then it is easily seen that
Furthermore, as ,
Thus . It follows that , and so we may assume that .
Fig. 4.
Illustrating the pruning and regrafting of the leaf in the proof of Theorem 4.7 for FP
Say and . Now let be the phylogenetic X-tree obtained from by pruning and regrafting it to (u, v). let and denote the parents of in and , respectively. Then
Also, as and ,
Thus , contradicting the maximality of and Y. Therefore , and so we may assume that .
Without loss of generality, we may now assume that . Let and denote the parents of and in , respectively. Let be the phylogenetic X-tree obtained from by pruning and regrafting it to the edge . Let denote the parent of in . Then
and
as . Repeating this process for each of , we obtain a phylogenetic tree X-tree with one less cherry than , and
a contradiction to maximality. We deduce that has exactly one cherry if is FP.
Now suppose that is ES. Without loss of generality, we may assume that and . Let be the phylogenetic X-tree obtained from by pruning and regrafting it to the edge , where is the parent of in (as depicted in Fig. 5). Then, noting that is not in Y,
Furthermore, since the length of the each path from an edge to is reduced by 1 in forming as edge is suppressed,
and. since the length of the each path from an edge to is increased by 1 in forming as edge is added,
But, as ,
Therefore
It now follows that . This contradiction to maximality implies that has exactly one cherry if is ES. It now follows by Lemma 4.6 that, if , then to maximise we have that X is a chain and Y is a subset of X of size k as described in the statement of the theorem.
Fig. 5.
Illustrating the pruning and regrafting of the leaf in the proof of Theorem 4.7 for ES
Discussion
The results for Fair Proportion and Equal Splits in the last section are strikingly similar. Indeed, they are exactly the same when considering the outcomes of Theorem 4.7. While the proof of Theorem 4.7 eventually required separating into two parts to independently consider FP and ES, the two approaches taken were alike. This suggests that there is probably a natural class of phylogenetic diversity indices which these results are representative of. If so, what is this class? The class of descendant diversity indices is unlikely to be sufficient. However, what if we additionally required the following property? Let be an edge-weighted phylogenetic X-tree and let be a descendant diversity index. Let be an edge of , and let such that . If the number of edges from v to y is at most the number of edges from v to x, then the contribution of to is at most the contribution of to .
Funding
Open Access funding enabled and organized by CAUL and its Member Institutions
Footnotes
Charles Semple, was supported by the New Zealand Marsden Fund.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Faith DP. Conservation evaluation and phylogenetic diversity. Biol Conserv. 1992;61:1–10. doi: 10.1016/0006-3207(92)91201-3. [DOI] [Google Scholar]
- Cadotte MW, Cardinale BJ, Oakley TH. Evolutionary history and the effect of biodiversity on plant productivity. Proc Natl Acad Sci. 2008;105:17012–17017. doi: 10.1073/pnas.0805962105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haake C-J, Kashiwada A, Su FE. The Shapely value of phylogenetic trees. J Math Biol. 2008;56:479–497. doi: 10.1007/s00285-007-0126-2. [DOI] [PubMed] [Google Scholar]
- Isaac NJB, Turvey ST, Collen B, Waterman C, Baillie JEM. Mammals on the EDGE: Conservation priorities based on threat and phylogeny. PLoS One. 2007;2:e296. doi: 10.1371/journal.pone.0000296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lozupone CA, Knight R. Global patterns in bacterial diversity. Proc Natl Acad Sci. 2007;104:11436–11440. doi: 10.1073/pnas.0611525104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pardi F, Goldman N. Species choice for comparative genomics: being greedy works. PLoS Genet. 2005;1:e71. doi: 10.1371/journal.pgen.0010071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Redding DW (2003) Incorporating genetic distinctness and reserve occupancy into a conservation prioritisation approach. Master’s thesis. University of East Anglia, Norwich
- Redding DW, DeWolff CV, Mooers AO. Evolutionary distinctiveness, threat status, and ecological oddity in primates. Conserv Biol. 2010;24:1052–1058. doi: 10.1111/j.1523-1739.2010.01532.x. [DOI] [PubMed] [Google Scholar]
- Redding DW, Hartmann K, Mimoto A, Bokal D, DeVos M, Mooers AO. Evolutionary distinctive species often capture more phylogenetic diversity than expected. J Theor Biol. 2008;251:606–615. doi: 10.1016/j.jtbi.2007.12.006. [DOI] [PubMed] [Google Scholar]
- Redding DW, Mazel F, Mooers AO. Measuring evolutionary isolation for conservation. PLoS One. 2014;9:e113490. doi: 10.1371/journal.pone.0113490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Redding DW, Mooers AO. Incorporating evolutionary measures into conservation prioritization. Conserv Biol. 2006;20:1670–1678. doi: 10.1111/j.1523-1739.2006.00555.x. [DOI] [PubMed] [Google Scholar]
- Safi K, Cianciaruso MV, Loyola RD, Brito D, Armour-Marshall K, Diniz-Filho JAF. Understanding global patterns of mammalian functional and phylogenetic diversity. Phil Trans R Soc B. 2011;366:2536–2544. doi: 10.1098/rstb.2011.0024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steel M. Phylogenetic diversity and the greedy algorithm. Syst Biol. 2005;54:527–529. doi: 10.1080/10635150590947023. [DOI] [PubMed] [Google Scholar]
- Stein RW, Mull CG, Kuhn TS, Aschliman NC, Davidson LNK, Joy JB, Smith GJ, Dulvy NK, Mooers AO. Global priorities for conserving the evolutionary history of sharks, rays and chimaeras. Nat Ecol Evol. 2018;2:288–298. doi: 10.1038/s41559-017-0448-4. [DOI] [PubMed] [Google Scholar]
- Wicke K, Steel M. Combinatorial properties of phylogenetic diversity indices. J Math Biol. 2020;80:687–715. doi: 10.1007/s00285-019-01438-0. [DOI] [PubMed] [Google Scholar]


