Skip to main content
Springer logoLink to Springer
. 2024 Jul 9;86(8):103. doi: 10.1007/s11538-024-01332-x

Phylogenetic Diversity Indices from an Affine and Projective Viewpoint

V Moulton 1, A Spillner 2, K Wicke 3,
PMCID: PMC11233365  PMID: 38980452

Abstract

Phylogenetic diversity indices are commonly used to rank the elements in a collection of species or populations for conservation purposes. The derivation of these indices is typically based on some quantitative description of the evolutionary history of the species in question, which is often given in terms of a phylogenetic tree. Both rooted and unrooted phylogenetic trees can be employed, and there are close connections between the indices that are derived in these two different ways. In this paper, we introduce more general phylogenetic diversity indices that can be derived from collections of subsets (clusters) and collections of bipartitions (splits) of the given set of species. Such indices could be useful, for example, in case there is some uncertainty in the topology of the tree being used to derive a phylogenetic diversity index. As well as characterizing some of the indices that we introduce in terms of their special properties, we provide a link between cluster-based and split-based phylogenetic diversity indices that uses a discrete analogue of the classical link between affine and projective geometry. This provides a unified framework for many of the various phylogenetic diversity indices used in the literature based on rooted and unrooted phylogenetic trees, generalizations and new proofs for previous results concerning tree-based indices, and a way to define some new phylogenetic diversity indices that naturally arise as affine or projective variants of each other or as generalizations of tree-based indices.

Keywords: Phylogenetic tree, Phylogenetic diversity, Shapley value, Fair Proportion index Cluster system, Split system

Introduction

Evolutionary isolation metrics or phylogenetic diversity indices provide quantitative measures of biodiversity and are increasingly popular tools to prioritize species for conservation (Isaac et al. 2007; Redding et al. 2008, 2014; Redding and Mooers 2006; Tucker et al. 2016; Vane-Wright et al. 1991). These indices quantify the importance of a species to overall biodiversity by assessing its unique and shared evolutionary history as indicated by its placement in an underlying phylogeny. Preserving phylogenetic diversity and the “Tree of Life” has become an integral component of conservation considerations (see, e.g., the “Phylogenetic Diversity Task Force”1 initiated by the IUCN). Indeed, conservation initiatives like the EDGE of Existence programme2 (Gumbs et al. 2023; Isaac et al. 2007) incorporate phylogenetic diversity indices in their identification of species that are both evolutionary distinct and globally endangered. Moreover, the “guide to phylogenetic metrics for conservation, community ecology and macroecology” by Tucker et al. (2016) has been cited more than 700 times since its publication, thus demonstrating an even more widespread interest and application of phylogenetic tools, and in particular different phylogenetic diversity indices, within conservation settings.

Mathematically, with a multitude of phylogenetic diversity indices at hand, there is now an increasing interest in understanding how the different indices relate to each other. Much of the previous work in this direction has focused on comparing and analyzing different indices derived from rooted phylogenetic trees (Bordewich and Semple 2024; Manson 2024; Manson and Steel 2023; Wicke and Steel 2020). Phylogenetic diversity indices have also been defined for unrooted trees (Haake et al. 2008; Wicke and Steel 2020), and an exploration of the relationship between indices derived via rooted and unrooted phylogenetic trees is presented by Wicke and Steel (2020).

As one might expect, phylogenetic diversity indices for rooted and unrooted trees are closely related. To illustrate this, consider the much studied fair proportion index (Isaac et al. 2007; Redding 2003). For the rooted phylogenetic tree with edge weights in Fig. 1a, the value FPr(x) of the rooted fair proportion index for a species xX (here and throughout this manuscript, X denotes a non-empty finite set of taxa or species) is computed by adding, over all edges that are contained in the path from the root r to the leaf labeled by x, the weight of the edge divided by the total number of species for which the path from the root to the leaf labeled by that species also contains that edge. For example, for species e there are three edges in the path from r to e and we obtain

FPr(e)=33+22+41=6. 1

In Wicke and Steel (2020) the fair proportion index has also been defined for unrooted phylogenetic trees. Consider the unrooted phylogenetic tree with edge weights in Fig. 1b. The removal of an edge breaks the tree into two subtrees. The value FPu(x) of the unrooted fair proportion index for a species xX is one half of the value obtained by adding, over all edges in the unrooted tree, the weight of the edge divided by the number of species that lie in the same subtree as x after removal of the edge. For example, for species e we obtain

FPu(e)=12·34+24+73+64+22+14+41=316. 2

As can be seen in Fig. 1, xXFPr(x)=xXFPu(x)=25, which is the total weight of the edges of the phylogenetic tree from which the values are computed. Among other natural requirements, this property called completeness (formally defined in Sect. 2), should be preserved when relating phylogenetic diversity indices for rooted and unrooted trees.

Fig. 1.

Fig. 1

a A rooted phylogenetic tree on the set X={a,b,c,d,e} of species. The root vertex is r and all edges are weighted. The table gives the value FPr(x) of the fair proportion index on this rooted tree for each xX. b The unrooted phylogenetic tree with weighted edges on the same set X of species obtained by suppressing the root of the tree in (a). The table gives the value FPu(x) of the fair proportion index on this unrooted tree for each xX

To better understand how this can be systematically achieved, in this paper we consider indices from the viewpoint of affine and projective clustering. This way of thinking about clustering has its origins in Dress (1997), and since then has become a useful tool in phylogenetic combinatorics (see, e.g., Dress 2012, Ch. 9 and Kleinman et al. 2013). More specifically, in this paper we extend the study of phylogenetic diversity indices into the more general setting of collections of clusters (subsets of a set) and collections of splits (bipartitions of a set). These settings correspond to affine and projective viewpoints of clustering, respectively (see Sect. 5). Considering collections of clusters and splits in general can be beneficial since it allows for the representation of data that is not tree-like or where it is difficult to determine the correct topology for a phylogenetic tree. Indeed, phylogenetic diversity indices have already been introduced for collections of splits (see, e.g., Abhari et al. 2024).

To illustrate this way of thinking, as hinted above, collections of clusters naturally arise when computing the rooted fair proportion index. In particular, clusters arise from rooted phylogenetic trees by taking, for each edge, the subset of species for which the path from the root to that species contains the edge (e.g., in Fig. 1a the edge with weight 3 next to the root gives rise to the cluster {c,d,e}). Thus, the sum used to compute the fair proportion index of e in Eq. (1) is just the sum of the values ω(C)|C| taken over all clusters C that contain e, where ω(C) is the weight of the edge giving rise to cluster C and |C| denotes the number of species in C. Similarly, we can interpret Eq. (2) in terms of splits, using the fact that splits arise from unrooted phylogenetic trees by taking, for each edge, the split obtained by removing the edge and considering the subsets of species in the two resulting subtrees (e.g., in Fig. 1b the edge with weight 7 gives rise to the split {{a,b},{c,d,e}}). Then the sum used to compute the unrooted fair proportion index of e in Eq. (2) is just the sum of the values λ(S)2|A| taken over all splits S coming from the tree, where λ(S) is the weight of the edge giving rise to S and A is the part in S that contains e. More generally, the sums used to compute FPr and FPu can be applied to any collection of weighted clusters or splits, respectively (for example, the values for FPu computed for a collection of weighted splits visualized by the network in Fig. 2a are shown in the second column in Fig. 2b).

Fig. 2.

Fig. 2

a A network visualizing a collection S of weighted splits on the set X={a,f,h,m,r,s} of six owl populations (see Fig. 12 in the Appendix for more details on this data set). Each band of parallel edges in this network corresponds to a split of X and the length of the edges in the band corresponds to the weight of the split. b The values of four different phylogenetic diversity indices FPu, SV, Ψτ and ΨR considered in this paper, computed for the six owl populations. The populations are ranked by the values of FPu

Thinking about phylogenetic diversity indices in an affine and projective way, leads us to two key questions that we will consider in this paper:

  • (i)

    How do properties of tree-based phylogenetic diversity indices extend to indices defined via collections of clusters and splits?

  • (ii)

    How can the relationships between collections of clusters and collections of splits be exploited to relate cluster- and split-based phylogenetic diversity indices?

In this contribution, we give answers to both of these questions, introducing the concept of phylogenetic diversity indices based on collections of clusters and splits, and giving characterizations for some of these indices in terms of their special properties. We also present a general framework to systematically relate cluster- and split-based phylogenetic diversity indices via a process that is commonly used in phylogenetic combinatorics. This provides concise proofs for generalizations of previous results for trees as well as ways to define new indices.

The rest of this paper is structured as follows. We first illustrate our new concepts and results by focusing on a few well-known tree-based phylogenetic diversity indices, namely the fair proportion index, the Shapely value (Haake et al. 2008; Shapley 1953), and the equal splits index (Redding and Mooers 2006), before we look into some new split-based phylogenetic diversity indices. More specifically, in Sect. 2 we formally define cluster-based phylogenetic diversity indices and present some key properties that such indices may have. Then, in Sect. 3, we present a characterization of the general cluster-based fair proportion index. In Sect. 4 we consider the Shapley value, SV (the values of SV for the six owl populations considered in Fig. 2a are given in Fig. 2b). In particular, we present a characterization of the Shapley value and use its relationship to the fair proportion index to describe the first building block of our framework. In Sect. 5 we then give the complete framework, and illustrate some of its applications in Sect. 6 using the fair proportion index and a split-based phylogenetic diversity index, Ψτ, related to the equal splits index as examples. Then, in Sect. 7, we introduce a family of new split-based phylogenetic diversity indices, ΨR, that generalize the phylogenetic diversity index for unrooted phylogenetic trees given by Wicke and Steel (2020, Sec. 5.2) (the values of the indices Ψτ and ΨR for the six owl populations considered in Fig. 2a are also given in Fig. 2b). We conclude in Sect. 8 discussing some potential interesting directions for future work.

Cluster-Based Indices

Let X be a non-empty finite set. We denote the power set of X by P(X). We call a non-empty subset CX a cluster on X and call a non-empty collection CP(X)\{} a cluster system on X. In this section we introduce the concept of a phylogenetic diversity index on a cluster system, and illustrate some basic properties of these indices by considering a generalization of the fair proportion index for rooted trees that we discussed in the introduction.

To motivate the definition of a phylogenetic diversity index on a cluster system, we briefly look again at rooted phylogenetic trees. Fixing a rooted phylogenetic tree T on a set X of species, a phylogenetic diversity index Φ on T assigns, to each weighting3ω of the edges in T, a vector Φ(ω)RX. To give an example, let Φ be the fair proportion index on the rooted phylogenetic tree in Fig. 1a. Then, for the weighting ω of its edges given in Fig. 1a, we can write

Φ(ω)=(5,4,7,3,6), 3

or, in more detail, (Φ(ω))(a)=5,(Φ(ω))(b)=4,,(Φ(ω))(e)=6.

As described in the introduction, each edge in a rooted phylogenetic tree on X is associated with a cluster on X. In Fig. 3a the clusters associated with the edges of the rooted phylogenetic tree in Fig. 1a are given, where each cluster is weighted by the length of the corresponding edge. Note that this cluster system C has a special property, namely it is a hierarchy, that is, CC{,C,C} holds for all C,CC. In particular, as we see in this example, hierarchies are essentially those cluster systems that can be represented by a rooted phylogenetic tree on X [see, e.g., Semple and Steel (2003, Thm. 3.5.2) for a more precise statement of this fact using the concept of a rooted X-tree].

Fig. 3.

Fig. 3

a The weighted clusters on X corresponding to the edges of the rooted phylogenetic tree in Fig. 1a. b The matrix Γ from Eq. (5) for the fair proportion index on C, where C is the cluster system consisting of the clusters given in (a)

Bearing these facts in mind, for an arbitrary cluster system C on X, we consider the space L(C) consisting of all weightings ω:CR. We then define a phylogenetic diversity index on C to be a map Φ:L(C)RX. For example, following the intuitive description in the introduction, we define the fair-proportion index on a cluster system C on X by putting, for each ωL(C) and all xX,

(FP(ω))(x)=CC:xCω(C)|C|. 4

It can then be checked that (4) applied to the weighted cluster system in Fig. 3a yields precisely the vector we saw in (3).

We now introduce three key properties of cluster-based indices which generalize properties of tree-based indices described in the literature. We will illustrate these properties for the fair proportion index and, as we shall see, these properties are also shared by some of the other phylogenetic diversity indices that we consider later on.

Let C be a cluster system on X. A phylogenetic diversity index Φ on C is additive if

  • (A)

    Φ(ω1+ω2)=Φ(ω1)+Φ(ω2) for all ω1,ω2L(C),

and Φ is homogeneous if

  • (H)

    Φ(a·ω)=a·Φ(ω) for all ωL(C) and all aR.

Properties (A) and (H) together mean that Φ is a linear map, in which case we call Φ linear. Phylogenetic diversity indices considered in the literature are usually linear. This may be due to useful consequences of linearity such as, for example, that applying a linear phylogenetic diversity index to a weighting obtained by taking the average over several different edge weightings of a fixed rooted phylogenetic tree amounts to averaging the values of the phylogenetic diversity index. In this paper, most (but not all) results assume linearity of the phylogenetic diversity indices involved. To avoid any confusion, we will always explicitly state which properties we assume.

Note that every linear phylogenetic diversity index Φ on C corresponds to a |C|×|X|-matrix Γ=ΓΦ=(γ(C,x)) such that

(Φ(ω))(x)=CCω(C)·γ(C,x) 5

for all ωL(C) and all xX. The entries of the matrix Γ are usually assumed to be non-negative (see, e.g., Manson and Steel 2023, Def. 1). Again, our framework also applies when this assumption is violated. In Sect. 7 we will come back to this point. In Fig. 3b we give, as an example, the matrix Γ corresponding to the fair proportion index on the cluster system in Fig. 3a.

Finally, we call a phylogenetic diversity index Φ on C complete if

  • (C)

    xX(Φ(ω))(x)=CCω(C) holds for all ωL(C).

For tree-based phylogenetic diversity indices, completeness is often required as part of their definition (see, e.g., Bordewich and Semple 2024; Wicke and Steel 2020). For example, we have seen in the introduction for the fair proportion index on a rooted phylogenetic tree that xXFPr(x) equals the total weight of the edges in the tree. Property (C) expresses this fact in terms of clusters. Note that a linear phylogenetic diversity index Φ on C is complete if and only if xXγ(C,x)=1 for all CC (cf. Wicke and Steel 2020, Eq. (2) and Wicke 2020, Eq. (1)), where Γ=(γ(C,x)) is the matrix from Eq. (5).

We next show that the fair proportion index satisfies all three of the above properties.

Lemma 2.1

The fair proportion index is a complete, linear phylogenetic diversity index on C for any cluster system C on X.

Proof

As we have seen in the example in Fig. 3, the fair proportion index can be described by a matrix Γ=(γ(C,x)) where the row associated with a cluster CC contains |C| entries equal to 1|C| and |X|-|C| entries equal to 0.

We conclude this section with an example of a weighted cluster system C on X={a,b,c,d} that is not a hierarchy and which illustrates the possible consequences of restricting C to some hierarchy. The cluster system C and the weighting ω are given in Fig. 4a. Figure 4b gives the matrix Γ=ΓΦ corresponding to the fair proportion index Φ on C. We have Φ(ω)=32,3,52,2. Now consider the hierarchies C1=C\{{a,b}} and C2=C\{{b,c},{b,c,d}} and the fair proportion index Φi on Ci, i{1,2}. Note that the matrix ΓΦi is obtained from Γ by removing the rows corresponding to clusters in C\Ci and the weighting ωi is just the restriction of ω to Ci. This yields Φ1(ω1)=32,1,12,1 and Φ2(ω2)=1,52,52,2. As can be seen, the rankings of the elements in X obtained by Φ1 and Φ2 are different and, thus, need not coincide with the ranking obtained by considering the whole cluster system C.

Fig. 4.

Fig. 4

a A weighted cluster system C on X={a,b,c,d} that is not a hierarchy. b The matrix Γ from Eq. (5) for the fair proportion index Φ on C

A Characterization of the Fair Proportion Index

In general, it is of interest to characterize phylogenetic diversity indices in terms of their key properties, as this can help to understand better how they are related to one another. In this section, as an illustration for cluster-based indices, we shall present a characterization of the fair proportion index. This generalizes the characterization of the fair proportion index on rooted phylogenetic trees given by Manson and Steel (2023, Thm. 6).

Our characterization will require three properties. The first two properties concern linear phylogenetic diversity indices Φ on a cluster system C on X, and are given in terms of the matrix corresponding to Φ. For all CC, let ch(C) denote the set of those CC with CC such that there is no CC with CCC. We emphasize that even though a cluster in C may receive the weight 0, it is still considered as present in C and, therefore, the sets ch(C) for CC do not change when such a weighting is encountered.

We say that Φ satisfies the neutrality condition if

  • (NC)

    the entries of the matrix ΓΦ in Eq. (5) are such that γ(C,x)=γ(C,y) holds for all CC with ch(C)= and all x,yC.

A property similar to (NC) was introduced by Manson and Steel (2023) for rooted X-trees. In addition, we say that Φ is a descendant diversity index if

  • (DD)

    Φ is complete, all entries of the matrix ΓΦ in Eq. (5) are non-negative and, for all CC, γ(C,x)=0 if xC.

Property (DD) was introduced by Bordewich and Semple (2024) for the special case where the cluster system C is a hierarchy (using the equivalent description of hierarchies in terms of rooted X-trees).

The third property is a bit more complicated, and thus we first motivate it using rooted trees as in Manson and Steel (2023). Let T be a rooted phylogenetic tree on X with edge weights and let T be the rooted phylogenetic tree on X obtained by collapsing one of the edges of T. This is illustrated in Fig. 5. In addition, let Φ and Φ be phylogenetic diversity indices on T and T, respectively. Both Φ and Φ yield a vector in RX for all weightings of the edges of T and T, respectively. The topology of the rooted phylogenetic trees, however, may have an impact on how the weights of the edges are used to compute these vectors by Φ and Φ, respectively. Therefore, since the topologies of T and T differ, the vector in RX that we obtain by Φ for T will usually not coincide with the vector that we obtain by Φ in the limit, as the weight of the edge in T tends to 0 (keeping the weights of all other edges in T in constant).

Fig. 5.

Fig. 5

Collapsing the edge with weight 6 in the rooted phylogenetic tree T on X={a,b,c,d} yields the rooted phylogenetic tree T on X

With this in mind, let C be a cluster system on X and let CC be such that C=C\{C} is non-empty. A phylogenetic diversity index Φ on C is downward continuous with respect to a phylogenetic diversity index Φ on C if

  • (DC)
    for all ωL(C) we have
    limω(C)0Φ(ω)=Φ(ω), 6

where ωL(C) is the weighting with ω(D)=ω(D) for all DC. Note that when forming the cluster system C we remove the cluster C from the cluster system C but not the elements in C from the set X. In particular, both Φ and Φ yield vectors in RX.

With the properties (NC), (DD) and (DC) in hand, we now present our characterization of the fair proportion index.

Theorem 3.1

Suppose we have, for each cluster system C on X, a phylogenetic diversity index ΦC on C. Then the following are equivalent:

  • (i)

    For all cluster systems C on X, ΦC is the fair proportion index on C.

  • (ii)

    For all cluster systems C on X, ΦC is a descendant diversity index that satisfies the neutrality condition and is downward continuous with respect to ΦC\{C} for all CC such that C\{C}.

Before proving this theorem, to provide some intuition for its statement, consider the cluster system C={{a,b},{a,b,c,d},{a,b,c,d,e}} on X={a,b,c,d,e}, and let Φ be the linear phylogenetic diversity index with the following matrix ΓΦ

abcde{a,b}1/21/2000{a,b,c,d}1/41/41/41/40{a,b,c,d,e}2/52/51/151/151/15.

In addition, let C={a,b}, C=C\{C}, and let Φ be the linear phylogenetic diversity index on C whose matrix ΓΦ is obtained by deleting the row corresponding to C from ΓΦ. Then both Φ and Φ satisfy properties (NC) and (DD). Moreover, Φ is downward continuous with respect to Φ. But, clearly, Φ is not the fair proportion index on C. Hence, it is not enough to look at a phylogenetic diversity index Φ on a cluster system C and phylogenetic diversity indices Φ on cluster systems C\{C} for some CC. Instead we need to look at all cluster systems on X.

Proof of Theorem 3.1

We first show that (i) implies (ii). Consider a cluster system C on X and put Φ=ΦC. By assumption, Φ is the fair proportion index on C. Thus, in view of Lemma 2.1, Φ is linear and complete. Moreover, as illustrated by the example in Fig. 3b, it follows immediately from the definition of the fair proportion index in (4) that Φ is a descendant diversity index and satisfies the neutrality condition.

It remains to establish downward continuity. Consider a cluster CC and assume that C=C\{C}. Put Φ=ΦC. Let Γ=ΓΦ and Γ=ΓΦ be the matrices whose entries satisfy Eq. (5) for Φ and Φ, respectively. By assumption, Φ is the fair proportion index on C and Φ is the fair proportion index on C. Therefore, it follows again from the definition of the fair proportion index in (4) that deleting the row corresponding to the cluster C from the matrix Γ yields the matrix Γ. But this immediately implies that Eq. (6) holds for all ωL(C), as required.

Next we show that (ii) implies (i). Let C be a cluster system on X. By assumption, Φ=ΦC is a descendant diversity index and, therefore, linear. Let Γ=ΓΦ be the matrix whose entries satisfy Eq. (5) for Φ. In view of the definition of the fair proportion index in (4), it suffices to show that the entries of Γ satisfy

γ(C,x)=1|C|forxC0forxC

for all CC and all xX. We use induction on |C| to show this.

To establish the base case of the induction, assume |C|=1. Consider CC and xX. In view of |C|=1 we have ch(C)=. Thus, in view of the assumption that Φ is a descendant diversity index and satisfies the neutrality condition, we have γ(C,x)=1|C| for all xC and γ(C,x)=0 for all xX\C, as required.

Next assume |C|2. Consider CC and put C=C\{C}. By the assumption that Φ is downward continuous with respect to Φ=ΦC, the matrix Γ=ΓΦ whose entries satisfy Eq. (5) for Φ is obtained by deleting the row corresponding to cluster C from Γ. Thus, by induction, we have

γ(D,x)=1|D|forxD0forxD

for all DC\{C} and all xX. Since this holds for all CC, this finishes the inductive proof.

The Shapely Value

The Shapely value is a well-known phylogenetic diversity index that can be computed using rooted phylogenetic trees and that has its origins in game theory. Interestingly, to understand a generalization of this index in the cluster setting, it is necessary to consider mappings on slightly more general spaces than those used in the definition of cluster-based phylogenetic diversity indices in Sect. 2. In this section, we shall explain this, and then give a characterization of a cluster-based version of the Shapely value.

As before, let X be a finite non-empty set. A game is a map g:P(X)R. The elements of X are referred to as the players in this context and the value g(C) for some CP(X) can be interpreted as the gain when the players in C form a coalition. One aspect of analyzing such a game is to quantify, for each player xX, the value v(x)R of the player with respect to the game (see, e.g., Branzei et al. 2008 for a more detailed exposition of these concepts).

Formally speaking, we are thus interested in maps v from RP(X) to RX, and the Shapley value is one specific such map v given by

(v(g))(x)=1|X|!·MP(X):xM[(|M|-1)!·(|X|-|M|)!·(g(M)-g(M\{x}))]. 7

This map was originally proposed by Shapley (1953).

In a biological context, the players of Shapley’s game are species and from a rooted phylogenetic tree T on X with edge weights we obtain a game g by setting g(M)=PD(M) for each MP(X), where PD(M) is the phylogenetic diversity of M. The value PD(M) is defined as the total weight of those edges in T that lie on a path from the root to some species in M (Faith 1992). For example, for the rooted phylogenetic tree in Fig. 1a we obtain

PD({a,b,d})=3+2+4+3+2+1=15.

We now explain a way to generalize these considerations to cluster systems C on X. First we need to define the phylogenetic diversity of a subset of X relative to a weighted cluster system. Let ωL(C). Then the phylogenetic diversity of a subset M of X with respect to ω is defined as

PD(M)=PDω(M)=CC:MCω(C). 8

To further explore properties of the Shapley value in the context of our work, it will be convenient to consider the set

PD(C)={gRP(X):there existsωL(C)withg=PDω},

that is, the set of games in RP(X) for which there is some ωL(C) which gives rise to this game.

The following lemma states two key structural properties of the set PD(C) for any cluster system C on X. To prove this lemma, we define, for all CP(X), the game gC:P(X)R obtained by putting

gC(M)=1ifCM0ifCM=. 9

Lemma 4.1

Let C be a cluster system on X. Then PD(C) is a linear subspace of RP(X) that has dimension |C|.

Proof

In view of (8), PD(C) is the linear span of the games gC for CC defined in (9):

PDω(M)=CC:MCω(C)=CCω(C)·gC(M)

Thus, it suffices to show that the games gC, CC, are linearly independent. To see this, consider the square matrix A whose rows and columns are each in one-to-one correspondence with the elements of P(X)\{}. For all C,MP(X)\{} the entry of A in the row corresponding to C and the column corresponding to M is 1 if CM and is 0 otherwise. A is the so-called intersection matrix of P(X)\{} and it is known that A has full rank (see, e.g., Jukna 2011, p. 216). Thus, in particular, the rows corresponding to CC are linearly independent.

Now, as explained above, for a cluster system C on X, we restrict in (7) to games g=PD in PD(C). More specifically, we define the Shapley value relative to the cluster system C as the map SV:PD(C)RX obtained by putting

(SV(PD))(x)=1|X|!·MP(X):xM(|M|-1)!·(|X|-|M|)!·(PD(M)-PD(M\{x})) 10

for all PDPD(C) and all xX. Note that, in view of Lemma 4.1, PD(C) may be a proper subspace of RP(X) (i.e., the set of all games). As we will see below, any characterization of the Shapley value relative to a cluster system must take this into account (see also Dubey 1975 for a more general discussion of this aspect).

The sharp-eyed reader will have noticed that the Shapley value relative to a cluster system C is not a phylogenetic diversity index on C, as the latter is defined as a map from L(C) to RX. However, we can resolve this issue by slightly generalizing our cluster-based definition of phylogenetic diversity indices. Let L be a linear subspace of RP(X). Then we define a phylogenetic diversity index on L to be a map Φ:LRX. This encompasses then the Shapley value as a phylogenetic diversity index on L=PD(C) for all cluster systems C on X. Moreover, viewing L(C) as the linear subspace

L={ωRP(X):ω(C)=0for allCC},

it also encompasses phylogenetic diversity indices on C as defined in Sect. 2. In fact, we can say even more about these relationships, which we will return to in the next section.

For the remainder of this section, we focus on giving a characterization of the Shapley value relative to a cluster system. This will involve the following two properties. We say that a phylogenetic diversity index Φ on a linear subspace L of RP(X) satisfies Pareto efficiency if

  • (PE)

    xX(Φ(ω))(x)=ω(X) for all ωL.

Remark 4.2

The properties of completeness and Pareto efficiency are tightly linked. Let C be a cluster system on X and note that CCω(C)=PDω(X) holds for all ωL(C). Therefore, every complete phylogenetic diversity index Φ on  L(C) corresponds to a phylogenetic diversity index Φ on PD(C) that satisfies Pareto efficiency, where Φ is obtained such that the diagram in Fig. 6 commutes, that is,

Φ(PDω)=Φ(ω) 11

for all ωL(C).

Fig. 6.

Fig. 6

This diagram depicts the relationship between a phylogenetic diversity index Φ on L(C) for a cluster system C on X and a phylogenetic diversity index Φ on PD(C) as described by Eq. (11)

We say that a phylogenetic diversity index Φ on a linear subspace L of RP(X) satisfies group proportionality (cf. Haake et al. 2008) if

  • (GP)

    (Φ(a·gC))(x)=a|C|ifxC0ifxC,    for all CP(X)\{} and all aR

with gC the game as defined in (9). Note that a similar characterization to that given in the following theorem was established by Wicke and Steel (2020, Thm. 7) for the special case of cluster systems that form a hierarchy.

Theorem 4.3

Let C be a cluster system on X. The Shapley value is the unique phylogenetic diversity index on PD(C) that is additive and satisfies Pareto efficiency and group proportionality.

Proof

Assume that Φ is the Shapley value on PD(C). It is known (see, e.g., Aumann 1994) that Φ satisfies Pareto efficiency for all ωRP(X) and is additive for all ω1,ω2RP(X). Thus, these two properties hold, in particular, for all ω,ω1,ω2PD(C)RP(X).

To establish that Φ also satisfies group proportionality, consider xX, CC and aR. We calculate the value (Φ(a·gC))(x) using Formula (10) (similar calculations are used in the proofs of Haake et al. 2008, Thm. 4 and Coronado et al. 2018, Thm. 1):

If xC we have gC(M)-gC(M\{x})=0 for all MP(X), implying Φ(a·gC))(x)=0, as required. So assume that xC, put c=|C|, m=|M|, and put n=|X|. Then, in view of the fact that only MP(X) with MC={x} contribute to (Φ(a·gC))(x), we have

(Φ(a·gC))(x)=an!·m=1n-c+1(m-1)!·(n-m)!·n-cm-1=a·(n-c)!·(c-1)!n!·j=c-1n-1jc-1=a·(n-c)!·(c-1)!n!·nc=ac,

as required, where we used the formula for the sum along a diagonal in Pascal’s triangle to obtain the first equality in the second line.

Uniqueness now follows from the fact that, in view of the proof of Lemma 4.1, PD(C) is the linear span of {gC:CC}.

Interestingly, as shown by Fuchs and Jin (2015), the vector in RX that results from computing the Shapley value on the game PD obtained from an edge-weighted rooted phylogenetic tree always coincides with the vector that we obtain by computing the fair proportion index on the rooted phylogenetic tree. In fact, this is a particular instance of (11). The following corollary of Theorem 4.3 makes this more precise.

Corollary 4.4

Let C be a cluster system on X, Φ be the fair proportion index on L(C), and Φ be the Shapley value on PD(C). Then

Φ(ω)=Φ(CCω(C)·gC)=Φ(PDω)

holds for all ωL(C).

Proof

This follows immediately from the definition of the fair proportion index together with the fact that, by Theorem 4.3, the Shapley value is additive and satisfies group proportionality.

It is remarked in the discussion by Coronado et al. (2018) that Corollary 4.4 can also be derived using arguments based on so-called phylogenetic networks (for more on the connection between such networks and diversity indices see Sect. 8). Moreover, the fact that the Shapley value on PD(C) satisfies Pareto efficiency means that it apportions the phylogenetic diversity of X among the elements of X. In view of Corollary 4.4 this then also holds for the fair proportion index on L(C) and, in view of Remark 4.2, this corresponds to the fact that the fair proportion index is complete, as can be seen in the example in Fig. 3a:

PDω(X)=CCω(C)=25=xX(FP(ω))(x).

An Affine and Projective Framework for Phylogenetic Diversity Indices

As mentioned in the introduction, the notion of phylogenetic diversity indices has also been considered on unrooted phylogenetic trees (Haake et al. 2008; Wicke and Steel 2020) and, just as rooted phylogenetic trees can be encoded by a collection of clusters, unrooted phylogenetic trees on a set X of species can be encoded by a collection S of bipartitions, or splits, of X (see, e.g., Steel 2016, Ch. 2). In the area of phylogenetic combinatorics, the interplay between collections of clusters and collections of splits has been studied in terms of affine and projective models of clustering, respectively, in analogy with the interplay between affine and projective geometry in classical geometry (Dress 2012, p. 207; see also Dress 1997). One of the key ideas that we will exploit from this theory is that we can map a collection S of splits of X in a natural way to a cluster system C(S) on X (defined in (13) below) and, in this way, derive split-based indices from cluster-based indices. In this section, we will make this more precise, and illustrate the resulting framework using the fair proportion index and the Shapely value as examples.

First, we formally define the concepts mentioned above. A split S of X is a bipartition of X into two non-empty subsets A and B, that is, AB=X and AB=. We denote such a split as an unordered pair A|B=B|A. A split system S on X is a non-empty set of splits of X. By S(X) we denote the set of all splits of X and, for a split system SS(X), we denote by L(S) the set of all weightings λ:S(X)R with λ(S)=0 for all SS(X)\S. In addition, we denote by PD(S) the set of all weightings PD:P(X)R that can be written as

PD(M)=PDλ(M)=A|BS:AM,BMλ(A|B) 12

for some λL(S). The value PDλ(M) is usually called the phylogenetic diversity of M with respect to the weighting λ of the splits in S (see, e.g., Spillner et al. 2008).

Figure 7 gives an overview of the various spaces we shall consider and the maps between them. In addition to the maps already introduced in Fig. 6 in Sect. 4, we also consider, for split systems S on X, maps τ from L(S) to L(C) where C is the cluster system

C(S)=SSS 13

on X mentioned above. In particular, we are interested in maps τ for which various parts of the diagram in Fig. 7 commute.

Fig. 7.

Fig. 7

A diagram of the various maps we consider to study relationships between phylogenetic diversity indices. The left part of the diagram we have already seen in Fig. 6. In analogy to this, the right part of the diagram depicts phylogenetic diversity indices Ψ and Ψ on L(S) and PD(S), respectively, where S is a split system on X. Finally τ associates with each weighting λ of the splits in S a weighting ω=τ(λ) of the clusters in a cluster system C=C(S) that arises from S by (13)

As an illustration of this setup, we now revisit the relationship between the fair proportion index and the Shapely value. Let S be a split system on X. Then the Shapley value on PD(S) is defined as in (10). Equivalently, as shown by Haake et al. (2008) for trees and by Volkmann et al. (2014) for split systems in general, the Shapley value on PD(S) can also be computed as

(SV(PDλ))(x)=A|BS:xA|B||X|·|A|·λ(A|B). 14

for all λL(S) and all xX.

Now consider the map τ:L(S)L(C(S)) defined by putting, for λL(S),

(τ(λ))(A)=|B||X|·λ(A|B)and(τ(λ))(B)=|A||X|·λ(A|B) 15

for all A,BC(S) such that A|B is a split in S. For example, consider the split system S with weighting λ in Fig. 8a. Using Formula (14), we obtain SV(a)=112 in this example and we also have FP(a)=112 for the fair proportion index as defined in (4) applied to the cluster system C(S) with weighting ω=τ(λ). We conclude this section by showing that this is not a coincidence.

Fig. 8.

Fig. 8

a A split system S on X={a,b,c,d,e} with weighting λ. b The associated cluster system C(S) on X as defined in (13) and the weighting ω=τ(λ) as defined in (15)

Theorem 5.1

Let S be a split system on X, Φ be the fair proportion index on L(C(S)) and Ψ be the Shapley value on PD(S). If τ is as defined in (15), then

Φ(τ(λ))=Ψ(PDλ) 16

for all λL(S).

Proof

Let λL(S) and put ω=τ(λ). Since the maps Φ, τ, Ψ and PD are all linear, it suffices to show Eq. (16) for the case that one element of S, say S=A|B has weight 1 (i.e. λ(A|B)=1), whereas λ(S)=0 for all SS. Then we have ω(A)=|B|/|X|, ω(B)=|A|/|X|, and ω(C)=0 for all CC(S) with CA,B. Now let xX, and assume without loss of generality that xA. Then,

(Φ(ω))(x)=CC(S):xCω(C)|C|=ω(A)|A|=|B||X||A|=|B||X|·|A|.

On the other hand, in view of (14) we have

(Ψ(PDλ))(x)=A|BS:xA|B||X|·|A|λ(S)=|B||X|·|A|

as well. This completes the proof.

Complete Diversity Indices

In this section we shall consider Fig. 7 once again, considering an alternative definition for the map τ that can be used to translate, for any split system S on X, the property of completeness from a cluster-based index Φ on L(C(S)) to an associated split-based index Ψ=Ψτ(Φ) on L(S). In particular, we will see that this immediately implies the completeness of the fair proportion index on unrooted phylogenetic trees that was established by Wicke and Steel (2020) (for example, see Fig. 1b in the introduction). In addition, we illustrate the application of these considerations to a generalization of the so-called equal splits index that appears in Wicke and Steel (2020).

We begin by proving a result concerning completeness. Let S be a split system on X. A phylogenetic diversity index Ψ on L(S) is complete if

(C’)

xX(Ψ(λ))(x)=SSλ(S) holds for all λL(S).

Define the map τ:L(S)L(C(S)) by putting

(τ(λ))(C)=12·λ(C|(X-C)) 17

for all CC(S). The basic idea is to distribute the weight λ(S) of a split S=A|BS evenly on the two corresponding clusters A,BC(S). The following results, however, also hold if the weight is distributed non-evenly, that is, when putting (τ(λ))(A)=p·λ(A|B) and (τ(λ))(B)=(1-p)·λ(A|B) for some 0<p<1 with p12.

With the map τ defined in (17), we obtain, for a phylogenetic diversity index Φ on L(C(S)), the phylogenetic diversity index Ψ=Ψτ(Φ) on L(S) by putting Ψ(λ)=Φ(τ(λ)) for all λL(S).

Theorem 6.1

Let S be a split system on X and Φ a complete linear phylogenetic diversity index on L(C(S)). If τ is as defined in (17), then Ψτ(Φ) is a complete linear phylogenetic diversity index on L(S).

Proof

Let Φ be a complete linear phylogenetic diversity index on L(C(S)). We first show that Ψτ is linear. Let λ1,λ2L(S) and aR. Then, noting that τ is linear, we have

(Ψτ(Φ))(a·λ1+λ2)=Φ(τ(a·λ1+λ2))=Φ(a·τ(λ1)+τ(λ2))=a·Φ(τ(λ1))+Φ(τ(λ2))=a·(Ψτ(Φ))(λ1)+(Ψτ(Φ))(λ2),

as required.

It remains to show that Ψτ is complete. Let λL(S). Then we have

xX((Ψτ(Φ))(λ))(x)=xX(Φ(τ(λ)))(x)=CC(S)(τ(λ))(C)=CC(S)12·λ(C|X-C)=SSλ(S),

as required.

The following Corollary 6.2 includes, as a special case, the completeness of the fair proportion index on unrooted phylogenetic trees that was established by Wicke and Steel (2020, Thm. 10). To see this, it suffices to consider, for an unrooted phylogenetic tree on X, the split system consisting of those splits of X that can be obtained by removing an edge from the tree.

Corollary 6.2

Let S be a split system on X and Φ be the fair proportion index on L(C(S)). If τ is as defined in (17), then Ψτ(Φ) is a complete linear phylogenetic diversity index on L(S) and we have

((Ψτ(Φ))(λ))(x)=A|BS:xAλ(S)2·|A| 18

for all λL(S) and all xX.

Proof

In view of Lemma 2.1, Theorem 6.1 implies that Ψτ(Φ) is a complete linear phylogenetic diversity index on L(S). Moreover, (18) follows from (4), (13), and (17).

We now turn our attention to a generalization of the equal splits index, a phylogenetic diversity index that was introduced in the setting of rooted phylogenetic trees by Redding and Mooers (2006). We first define our generalization for cluster systems C on X. For all CC, let cl(C) denote the set of those xC that are not contained in any cluster Cch(C). Then put

mx(C)=0ifxC,1|ch(C)|+|cl(C)|ifxcl(C),Cch(C)mx(C)|ch(C)|+|cl(C)|otherwise, 19

for all xX and all CC. Note that mx(C)=1/|C| if xC and ch(C)= (as in this case |ch(C)|=0 and |cl(C)|=|C|). Also note that mx(C) is defined recursively. In particular, when computing mx(C) in the third case it is assumed that mx(C) for each CC with CC has been computed already. The equal splits index is then defined by putting

(ES(ω))(x)=CCmx(C)·ω(C) 20

for all ωL(C) and all xX. As an example, consider the cluster system C with weighting ω in Fig. 9a. For the cluster C=X we have ch(X)={{a,b,c},{c,d}} and cl(X)={e}, which yields, by (19), me(X)=13. The resulting values of the equal splits index are given in Fig. 9b.

Fig. 9.

Fig. 9

a A cluster system C on X={a,b,c,d,e} with weighting ω and the quantities mx(C) as defined for all xX and CC in (19). b The equal splits index ES(x) for all xX obtained from C and ω by (20)

The equal splits index on L(C) is linear with the corresponding |C|×|X|-matrix Γ in Eq. (5) having the entries γ(C,x)=mx(C). Moreover, as can be seen in the example in Fig. 9a, the sum of the entries in each row of Γ equals 1. The next theorem establishes that this is always the case.

Theorem 6.3

For all cluster systems C on X the equal splits index is a complete linear phylogenetic diversity index on L(C).

Proof

Let C be a cluster system on X. We already noted above that the equal splits index is linear. Thus, it remains to establish that the equal splits index is complete. More specifically, it suffices to show that

xXmx(C)=1 21

for all CC. To this end, put desc(C)=|Cch(C)C| for all CC, that is, desc(C) equals the total number of elements in X contained in the clusters in ch(C). We show (21) by induction on desc(C).

For the base case desc(C)=0 we have ch(C)= and thus cl(C)=C, implying |cl(C)|=|C| and

xXmx(C)=xCmx(C)+xX\Cmx(C)=|C|·1|C|+0=1,

as required.

Next assume desc(C)>0. By the definition of ch(C), we have desc(C)<desc(C) for all Cch(C). Thus,

xXmx(C)=xX\Cmx(C)+xcl(C)mx(C)+xC\cl(C)mx(C)=0+|cl(C)||ch(C)|+|cl(C)|+xC\cl(C)Cch(C)mx(C)|ch(C)|+|cl(C)|=|cl(C)||ch(C)|+|cl(C)|+Cch(C)xC\cl(C)mx(C)|ch(C)|+|cl(C)|=|cl(C)||ch(C)|+|cl(C)|+Cch(C)xXmx(C)|ch(C)|+|cl(C)|=|cl(C)||ch(C)|+|cl(C)|+|ch(C)|·1|ch(C)|+|cl(C)|=1,

where the equality in the fourth line holds in view of the fact that mx(C)=0 for all xX\(C\cl(C)) and for all Cch(C), and the equality in the fifth line holds by induction.

Our final result in this section, which is an immediate consequence of Theorem 6.1 and Theorem 6.3, summarizes how we obtain, via the map τ defined in (17), a complete linear split-based phylogenetic diversity index from the cluster-based equal splits index.

Corollary 6.4

Let S be a split system on X and Φ be the equal splits index on L(C(S)). If τ is as defined in (17), then Ψτ(Φ) is a complete linear phylogenetic diversity index on L(S).

We conclude this section coming back to the biological example in Fig. 2a and compute the phylogenetic diversity index Ψτ=Ψτ(Φ) from Corollary 6.4 for this example. From the split system S on X={a,f,h,m,r,s} with weighting λ given in Fig. 12 in the Appendix we first compute the cluster system C=C(S) on X with weighting ω=τ(λ) (in Fig. 13 in the Appendix we present the Hasse diagram for the 20 clusters in C, where the weight ω(C) obtained by (17) is given below each cluster C in the diagram). Then we compute the matrix Γ=ΓΦ for the equal splits index Φ on L(C) (see Fig. 14 in the Appendix) from which we obtain the values of the phylogenetic diversity index Ψτ given in Fig. 2b. For comparison purposes, we also compute the Shapley value SV as defined in (14) for the split system S on X with weighting λ using the program SplitsTreeCE (Huson and Bryant 2005).

Fig. 12.

Fig. 12

The split system S={S1,,S10} on X={a,f,h,m,r,s} and its weighting λ represented by the network in Fig. 2a. The elements in X represent sampling locations of spotted owls in western North America (a = Aguascalientes, Mexico; f = San Fransisco Peaks, AZ; h = Huachuca Mountains, AZ; m = Marin County, CA; r = Capitol Reef National Park, UT; s = San Bernardino Mountains, CA). The weighted split system is computed from the pairwise genetic distances between these populations given in Table S1 of Volkmann et al. (2014) using the implementation of NeighborNet (Bryant and Moulton 2004) in SplitsTree (Huson and Bryant 2005). The same methodology was applied by Volkmann et al. (2014) to all 32 populations in the data set. Here six populations are selected for illustration purposes. The total weight of all splits in S is 1.980 (all weights rounded to three decimal places)

Fig. 13.

Fig. 13

The Hasse diagram for the clusters in the cluster system C(S) on X={a,f,h,m,r,s} computed from the split system S on X in Fig. 12. The number below each cluster is the weight of the cluster obtained by the map τ defined in (17) from the weights of the splits in S

Fig. 14.

Fig. 14

The matrix Γ for the equal splits index on L(C) for the cluster system C in Fig. 13

As can be seen in Fig. 2b, the sum of the values of the index Ψτ yields the total weight 1.980 of all splits in S, as it should be for a complete phylogenetic diversity index. The ranking of the six populations given by SV is the same as the ranking given by FPu computed in the introduction. The ranking given by Ψτ slightly deviates from it but also ranks populations s and m at the top and population f at the bottom.

The fact that the network in Fig. 2a is not a tree implies that using a tree-based phylogenetic diversity index necessarily involves a (potentially arbitrary) decision which of the splits in S are used to compute the tree-based index. More formally, we would first need to restrict to some subset SS such that any two splits A|B,C|DS are compatible, that is, at least one of the intersections AC, AD, BC, and BD is empty. A collection of pairwise compatible splits is called a compatible split system. Intuitively, compatible split systems correspond to unrooted phylogenetic trees.

To illustrate that the choice of a compatible subset of S really has an impact on the ranking of the six owl populations, we consider two compatible subsets S1 and S2 that are maximal with respect to set inclusion. Using the index Ψτ, we obtain the ranking (from highest to lowest) smhraf based on S1 and the ranking smahrf based on S2 (for details see Figs. 15 and 16 in the Appendix). Clearly, these two rankings are different, and in fact they also differ from the ranking obtained when considering all splits in S (cf. Fig. 2b). Interestingly, in this example, the ranking given in Fig. 2b for FPu and SV does not change when restricting to any maximal compatible subset of S. This could be due to the fact that the trivial splits in S (i.e. splits A|B with |A|=1 or |B|=1) carry more weight than the non-trivial splits, and both FPu and SV are less heavily influenced by the non-trivial splits than Ψτ. In future work, it could be interesting to further investigate the differences in rankings obtained from these and other split-based diversity indices.

Fig. 15.

Fig. 15

The matrix Γ for the equal splits index on L(C1) for the cluster system C1 obtained from the subset S1={S1,S2,S3,S4,S5,S6,S7,S8} of the split system S in Fig. 12. Using the map τ defined in (17) to compute the weights of the clusters in C1 from the weights of the splits in S1, we obtain Ψτ(s)=0.450, Ψτ(m)=0.431, Ψτ(h)=0.380, Ψτ(r)=0.267, Ψτ(a)=0.202, and Ψτ(f)=0.156 (all values rounded to three decimal places)

Fig. 16.

Fig. 16

The matrix Γ for the equal splits index on L(C2) for the cluster system C2 obtained from the subset S2={S1,S2,S3,S4,S5,S9,S10} of the split system S in Fig. 12. Using the map τ defined in (17) to compute the weights of the clusters in C2 from the weights of the splits in S2, we obtain Ψτ(s)=0.388, Ψτ(m)=0.355, Ψτ(a)=0.296, Ψτ(h)=0.280, Ψτ(r)=0.220, and Ψτ(f)=0.164 (all values rounded to three decimal places)

A Generalization of the Pauplin Index

In Wicke and Steel 2020, (Sec. 5.2) a phylogenetic diversity index for unrooted phylogenetic trees is introduced that is related to the formula for the total weight of the edges given by Pauplin (2000). In this section, we describe how the viewpoint suggested by our framework leads to a new family of split-based phylogenetic diversity indices.

Let n=|X|3 and θ=x0,x1,,xn-1 be an ordering of the elements in X. Define the split system

Sθ={{xi,,xj}|X\{xi,,xj}:0ij<n-1}.

A split system S on X is circular if there exists an ordering θ of the elements in X such that SSθ. If S=Sθ for some ordering θ of the elements in X we call S a full circular split system on X. Clearly, a circular split system contains at most n2 splits. Moreover, every compatible split system is circular (Bandelt and Dress 1992). Volkmann et al. (2014) considered the Shapley value and another phylogenetic diversity index on weighted circular split systems (see also the more recent work by Abhari et al. (2024)).

Let X2 denote the set of all 2-element subsets of X. For a circular split system S on X we consider the |S|×n2-matrix M(S) whose rows correspond to the splits in S and whose columns correspond to the 2-element subsets in X2. The entry of M=M(S) corresponding to S=A|BS and {x,y}X2 is defined as

m(S,{x,y})=1if|{x,y}A|=|{x,y}B|=10otherwise.

The matrix M(S) describes how a weighting λL(S) gives rise to pairwise distances between the elements in X:

dλ({x,y})=SSλ(S)·m(S,{x,y}), 22

or, more compactly, dλ=λ·M, where λ is viewed as a row vector with |S| entries and dλ as a row vector with n2 entries. The rows of M(S) are linearly independent (Bandelt and Dress 1992). Thus, every weighting λ yields a distinct dλ.

Since the matrix M(S) has full rank, there exists an n2×|S|-matrix R that is right inverse to M(S), that is, M(S)·R yields the |S|×|S|-identity matrix. The matrix R need not be unique, however. Any such matrix corresponds to a linear estimator of a weighting of the splits in S from pairwise distances between the elements in X (see e.g. Pardi and Gascuel (2012) for a discussion of such estimators for unrooted phylogenetic trees). Fixing such a matrix R, we define a phylogenetic diversity index ΨR on L(S) by putting

(ΨR(λ))(x)=12·yX\{x}SSdλ({x,y})·r({x,y},S)

for all xX and all λL(S). In view of (22), we have ΨR(λ)=λ·ΓR for the |S|×|X|-matrix Γ=ΓΨR whose entries are

γ(S,x)=12·yX\{x}SSm(S,{x,y})·r({x,y},S).

Moreover, since R is right inverse to M(S), we have

xXγ(S,x)=12·xXyX\{x}SSm(S,{x,y})·r({x,y},S)={x,y}X2SSm(S,{x,y})·r({x,y},S)=1

for all SS. The next lemma summarizes these basic facts about ΨR.

Lemma 7.1

Let S be a circular split system on X with |X|3. Then, for every matrix R that is right inverse to the matrix M(S), ΨR is a complete linear phylogenetic diversity index on L(S).

Consider, as an example, the split system S={S1,,S5} on X={a,b,c,d}, for which the matrix M(S) is given in Fig. 10a. The split system S is compatible and, thus, circular. There are infinitely many matrices R that are right inverse to the matrix M(S) and they can be described by five parameters p1,,p5R as shown in Fig. 10b. The matrix ΓΨR for the resulting complete linear phylogenetic diversity index ΨR is given in Fig. 10c. This index has only a single parameter rR with r=12(p1++p5). If all entries in ΓΨR are required to be non-negative, we need to restrict this parameter to 0r14. The Pauplin index in Wicke and Steel (2020, Sec. 5.2) corresponds to r=18.

Fig. 10.

Fig. 10

a The matrix M(S) for the split system S={S1,,S5} on X={a,b,c,d}. b The matrices R that are right inverse to M(S). c The matrix ΓΨR for the phylogenetic diversity index ΨR on L(S)

We now present the main result of this section which focuses on full circular split systems.

Theorem 7.2

Let S be a full circular split system on X with S=Sθ for the ordering θ=x0,x1,,xn-1 of X. Then the matrix M(S) has a unique right inverse matrix R and the matrix Γ=ΓΨR for the complete linear phylogenetic diversity index ΨR on L(S) has the entry

graphic file with name 11538_2024_1332_Equ47_HTML.gif

for the split S={xi,,xj}|X\{xi,,xj}S and the element x=xkX.

Proof

Since S is a full circular split system, the matrix M=M(S) is a square matrix. Hence, M has a unique right inverse matrix R which is just the usual inverse matrix of M. Moreover, as shown by Chepoi and Fichet (1998), the matrix R has the entry

r({y,z},S)=12if{y,z}={a,(b+1)modn}or{y,z}={(a-1)modn,b}-12if{y,z}={a,b}or{y,z}={(a-1)modn,(b+1)modn}0otherwise 23

for {y,z}X2 and S={xa,,xb}|X\{xa,,xb}S.

Consider the split S={xi,,xj}|X\{xi,,xj}S and the element x=xkX. By symmetry, it suffices to consider the following three cases.

Case 1: i<k<j. Consider yX\{x} with m(S,{x,y})=1. Then, by the definition of the matrix M, we have yX\{xi,,xj}. This implies, in view of (23), that there exist precisely two splits SS with r({x,y},S)=12 and precisely two splits SS with r({x,y},S)=-12. Hence, we have

SSm(S,{x,y})·r({x,y},S)=0,

implying that γ(S,x)=12·0=0, as required.

Case 2: i=k<j. Consider again yX\{x} with m(S,{x,y})=1. If yx(i-1)modn, we have SSm(S,{x,y})·r({x,y},S)=0 using the same argument as in Case 1. Otherwise, there exist precisely two splits SS with r({x,y},S)=12 but only one split SS with r({x,y},S)=-12, implying

SSm(S,{x,y})·r({x,y},S)=12

and, thus, γ(S,x)=12·12=14, as required.

Case 3: i=k=j. Then we have m(S,{x,y})=1 for all yX\{x}. If y{x(k-1)modn,x(k+1)modn} we again have SSm(S,{x,y})·r({x,y},S)=0 by the argument used in Case 1. Otherwise, we have SSm(S,{x,y})·r({x,y},S)=12 by the argument used in Case 2, and, thus, γ(S,x)=12·(12+12)=12, as required.

While Theorem 7.2 focuses on full circular split systems, it also suggests two specific phylogenetic diversity indices Ψ1 and Ψ2 for any circular split system that is not full. Consider, as an example, again the split system S in Fig. 10a. We have SSθ for the ordering θ=a,b,c,d. To obtain the complete linear phylogenetic diversity index Ψ1 on L(S), we restrict the matrix Γ obtained for Sθ by Theorem 7.2 to those rows that correspond to splits in S. The resulting matrix ΓΨ1 is then the matrix in Fig. 10c with r=0. To obtain the complete linear phylogenetic diversity index Ψ2 on L(S) we restrict the matrix R obtained for Sθ by Theorem 7.2 to those columns that correspond to splits in S. The resulting matrix R is a matrix that is right inverse to M(S) and we put Ψ2=ΨR. More specifically, the matrix R equals the matrix in Fig. 10b with p1=p3=12 and p2=p4=p5=0 and the matrix ΓΨ2 is the matrix in Fig. 10c with r=12.

We conclude this section looking again at the example of owl populations in Fig. 2a. The corresponding split system S in Fig. 12 in the Appendix is circular with the ordering θ=a,f,r,h,s,m of the elements in X. In Fig. 11 we give the matrices ΓΨR obtained from the right inverse matrices R of M(S). They contain five parameters r1,,r5R. Only by putting r1=r4=14 and r2=r3=r5=0, however, all entries of ΓΨR are non-negative and then ΓΨR equals the restriction of the matrix Γ obtained for Sθ by Theorem 7.2 to those rows that correspond to splits in S. Using the weighting λ of the splits given in Fig. 12 in the Appendix, we obtain the values of ΨR given in Fig. 2b, which yields the same ranking of the six owl populations as FPu and SV.

Fig. 11.

Fig. 11

The matrices ΓΨ for the phylogenetic diversity indices Ψ=ΨR obtained from the matrices R that are right inverse to the matrix M(S) for the split system S on X={a,f,h,m,r,s} represented by the network in Fig. 2a. The splits S1,,S10 are listed in Fig. 12 in the Appendix

Conclusion

In this paper, we have presented a framework for phylogenetic diversity indices defined on linear spaces coming from weighted cluster and split systems. Using some examples of popular tree-based phylogenetic diversity indices from the literature, we have shown that this framework can yield generalizations of these indices for cluster and split systems as well as allowing us to gain a better understanding of their interrelationships.

Note that in our framework presented in Fig. 7, by associating to any split system S on X the cluster system C(S) on X and then considering maps τ, we have focused on deriving split-based indices from cluster-based indices. In the affine and projective clustering approach, however, there are also ways to associate to any cluster system C on X a split system S(C) on X (see, e.g., Dress 2012, Sec. 9.3). Thus, it could be interesting to investigate how this fact might be used to derive cluster-based indices from split-based indices.

In our results, we have considered cluster and split systems in general, special examples of which include hierarchical cluster systems, compatible split systems (which correspond to rooted and unrooted phylogenetic trees, respectively) and circular split systems. There are, however, various other special classes of cluster and split systems that could be interesting to investigate within our framework. For example, it would also be interesting to consider diversity indices coming from weak hierarchies, a special type of cluster system introduced by Bandelt and Dress (1989). The advantage of considering such specialized cluster and split systems is that they can be efficiently computed from biological data, making them potentially more useful for applications.

In the literature, various approaches have been proposed to generalize tree-based phylogenetic diversity indices using phylogenetic networks, a graph-theoretical generalization of phylogenetic trees (Coronado et al. 2018; Volkmann et al. 2014; Wicke and Fischer 2018). Such networks are essentially directed, acyclic, graphs with a single root and whose set of leaves corresponds to some collection of species. The fair proportion index, for example, is generalized in terms of such networks by Coronado et al. (2018). In general, phylogenetic networks give rise to cluster systems (see, e.g., Steel 2016, Sec. 10.3.4) by, for example, taking the set of leaves that lie below a vertex or an edge in the network (just as with rooted phylogenetic trees). Thus, it could be interesting to explore how phylogenetic diversity indices defined in terms of phylogenetic networks, such as, for example, those considered by Wicke and Fischer (2018), fit into our cluster based framework. This could also be interesting to investigate for split networks such as the one presented in Fig. 2a, which are a certain type of undirected phylogenetic network (see, e.g., Dress 2012, Sec. 4.4).

With the different ways of defining diversity indices via clusters and splits and translating between the two viewpoints, it could also be interesting to analyze under which circumstances different indices give exactly the same score and thus also identical rankings of the taxa. For example, Wicke and Steel (2020) characterized precisely when the fair proportion index and the equal splits index for rooted binary phylogenetic trees coincide. Thus, it would be interesting to characterize which conditions a weighted cluster system (resp. weighted split system) has to satisfy in order to obtain similar results for pairs of cluster- or split-based indices.

In another direction, it could be interesting to apply our framework to establish properties and generalizations of other tree-based phylogenetic diversity indices that we did not consider in this paper. Indeed, as we have demonstrated, sometimes expressing indices in terms of clusters or splits can lead to more concise proofs for showing that they have certain properties. For example, it would be interesting to consider some of the questions asked in Section 6 of Wicke and Steel (2020) within our new framework.

Finally, concerning the generalization of the Pauplin index presented in Sect. 7, we saw in the examples in Figs. 10 and 11 that even when a circular split system S is not full there may exist right inverse matrices R for M(S) such that for the complete linear phylogenetic diversity index ΨR the matrix ΓΨR has non-negative entries. Can we characterize when this is the case and, more specifically, give the number of parameters in the matrix ΓΨR? As a potential direction for further generalization, one could consider split systems S for which the matrix M(S) has full rank, which are known as linearly independent split systems. There exist such split systems that are not circular (Bryant and Dress 2007). Can Theorem 7.2 be generalized in some way to all maximum sized linearly independent split systems?

Acknowledgements

All authors thank Schloss Dagstuhl – Leibniz Centre for Informatics—for hosting the Seminar 19443 Algorithms and Complexity in Phylogenetics in October 2019, where this work was initiated. The authors also thank the two anonymous reviewers for their helpful comments on a previous version of this paper.

Appendix

See Figs. 12, 13, 14, 15 and 16.

Data availability

No data was generated.

Footnotes

3

In biological applications, weights are usually assumed to be non-negative. Our framework also applies when this assumption is violated.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Abhari N, Colijn C, Mooers A, Tupper P. Capturing diversity: split systems and circular approximations for conservation. J Theor Biol. 2024;578:111689. doi: 10.1016/j.jtbi.2023.111689. [DOI] [PubMed] [Google Scholar]
  2. Aumann RJ (1994) The Shapley value. In: Game-theoretic methods in general equilibrium analysis. Springer, Berlin, pp 61–66
  3. Bandelt H-J, Dress AW. Weak hierarchies associated with similarity measures-an additive clustering technique. Bull Math Biol. 1989;51(1):133–166. doi: 10.1007/BF02458841. [DOI] [PubMed] [Google Scholar]
  4. Bandelt H-J, Dress A. A canonical decomposition theory for metrics on a finite set. Adv Math. 1992;92:47–105. doi: 10.1016/0001-8708(92)90061-O. [DOI] [Google Scholar]
  5. Bordewich M, Semple C. Quantifying the difference between phylogenetic diversity and diversity indices. J Math Biol. 2024;88(4):1–25. doi: 10.1007/s00285-024-02059-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Branzei R, Dimitrov D, Tijs S. Models in cooperative game theory. Berlin: Springer; 2008. [Google Scholar]
  7. Bryant D, Dress A. Linearly independent split systems. Eur J Comb. 2007;28(6):1814–1831. doi: 10.1016/j.ejc.2006.04.007. [DOI] [Google Scholar]
  8. Bryant D, Moulton V. Neighbor-net: an agglomerative method for the construction of phylogenetic networks. Mol Biol Evol. 2004;21(2):255–265. doi: 10.1093/molbev/msh018. [DOI] [PubMed] [Google Scholar]
  9. Chepoi V, Fichet B. A note on circular decomposable metrics. Geom Dedicata. 1998;69:237–240. doi: 10.1023/A:1004907919611. [DOI] [Google Scholar]
  10. Coronado TM, Riera G, Rosselló F (2018) The fair proportion is a Shapley value on phylogenetic networks too. In: Enjoying natural computing. Springer, Berlin, pp 77–87
  11. Dress A (1997) Towards a theory of holistic clustering. In: Mathematical hierarchies and biology. American Mathematical Society, Providence, pp 271–289
  12. Dress A. Basic phylogenetic combinatorics. Cambridge: Cambridge University Press; 2012. [Google Scholar]
  13. Dubey P. On the uniqueness of the Shapley value. Int J Game Theory. 1975;4:131–139. doi: 10.1007/BF01780630. [DOI] [Google Scholar]
  14. Faith DP. Conservation evaluation and phylogenetic diversity. Biol Cons. 1992;61(1):1–10. doi: 10.1016/0006-3207(92)91201-3. [DOI] [Google Scholar]
  15. Fuchs M, Jin EY. Equality of Shapley value and fair proportion index in phylogenetic trees. J Math Biol. 2015;71(5):1133–1147. doi: 10.1007/s00285-014-0853-0. [DOI] [PubMed] [Google Scholar]
  16. Gumbs R, Gray CL, Böhm M, Burfield IJ, Couchman OR, Faith DP, Forest F, Hoffmann M, Isaac NJB, Jetz W, Mace GM, Mooers AO, Safi K, Scott O, Steel M, Tucker CM, Pearse WD, Owen NR, Rosindell J. The EDGE2 protocol: advancing the prioritisation of evolutionarily distinct and globally endangered species for practical conservation action. PLoS Biol. 2023;21(2):e3001991. doi: 10.1371/journal.pbio.3001991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Haake C-J, Kashiwada A, Su FE. The Shapley value of phylogenetic trees. J Math Biol. 2008;56(4):479–497. doi: 10.1007/s00285-007-0126-2. [DOI] [PubMed] [Google Scholar]
  18. Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2005;23(2):254–267. doi: 10.1093/molbev/msj030. [DOI] [PubMed] [Google Scholar]
  19. Isaac NJ, Turvey ST, Collen B, Waterman C, Baillie JE. Mammals on the EDGE: conservation priorities based on threat and phylogeny. PLoS ONE. 2007;2(3):e296. doi: 10.1371/journal.pone.0000296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jukna S. Extremal combinatorics: with applications in computer science. Berlin: Springer; 2011. [Google Scholar]
  21. Kleinman A, Harel M, Pachter L. Affine and projective tree metric theorems. Ann Comb. 2013;17:205–228. doi: 10.1007/s00026-012-0173-2. [DOI] [Google Scholar]
  22. Manson K. The robustness of phylogenetic diversity indices to extinctions. J Math Biol. 2024;89(1):5. doi: 10.1007/s00285-024-02098-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Manson K, Steel M. Spaces of phylogenetic diversity indices: combinatorial and geometric properties. Bull Math Biol. 2023;85:78. doi: 10.1007/s11538-023-01183-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Pardi F, Gascuel O. Combinatorics of distance-based tree inference. Proc Natl Acad Sci. 2012;109(41):16443–16448. doi: 10.1073/pnas.1118368109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Pauplin Y. Direct calculation of a tree length using a distance matrix. J Mol Evol. 2000;51:41–47. doi: 10.1007/s002390010065. [DOI] [PubMed] [Google Scholar]
  26. Redding DW (2003) Incorporating genetic distinctness and reserve occupancy into a conservation priorisation approach. Master’s thesis, University of East Anglia, Norwich, UK
  27. Redding DW, Mooers AØ. Incorporating evolutionary measures into conservation prioritization. Conserv Biol. 2006;20(6):1670–1678. doi: 10.1111/j.1523-1739.2006.00555.x. [DOI] [PubMed] [Google Scholar]
  28. Redding DW, Hartmann K, Mimoto A, Bokal D, DeVos M, Mooers A. Evolutionarily distinctive species often capture more phylogenetic diversity than expected. J Theor Biol. 2008;251(4):606–615. doi: 10.1016/j.jtbi.2007.12.006. [DOI] [PubMed] [Google Scholar]
  29. Redding DW, Mazel F, Mooers AØ. Measuring evolutionary isolation for conservation. PLoS ONE. 2014;9(12):e113490. doi: 10.1371/journal.pone.0113490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Semple C, Steel M. Phylogenetics. Oxford: Oxford University Press; 2003. [Google Scholar]
  31. Shapley LS. A value for n-person games. Contrib Theory Games. 1953;2(28):307–317. [Google Scholar]
  32. Spillner A, Nguyen BT, Moulton V. Computing phylogenetic diversity for split systems. IEEE/ACM Trans Comput Biol Bioinf. 2008;5(2):235–244. doi: 10.1109/TCBB.2007.70260. [DOI] [PubMed] [Google Scholar]
  33. Steel M. Phylogeny: discrete and random processes in evolution. Philadelphia: SIAM; 2016. [Google Scholar]
  34. Tucker CM, Cadotte MW, Carvalho SB, Davies TJ, Ferrier S, Fritz SA, Grenyer R, Helmus MR, Jin LS, Mooers AO, Pavoine S, Purschke O, Redding DW, Rosauer DF, Winter M, Mazel F. A guide to phylogenetic metrics for conservation, community ecology and macroecology. Biol Rev. 2016;92(2):698–715. doi: 10.1111/brv.12252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Vane-Wright R, Humphries C, Williams P. What to protect?—systematics and the agony of choice. Biol Cons. 1991;55(3):235–254. doi: 10.1016/0006-3207(91)90030-D. [DOI] [Google Scholar]
  36. Volkmann L, Martyn I, Moulton V, Spillner A, Mooers AO. Prioritizing populations for conservation using phylogenetic networks. PLoS ONE. 2014;9(2):e88945. doi: 10.1371/journal.pone.0088945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Wicke K (2020) Novel aspects of mathematical phylogenetics. Ph.D. thesis, Universität Greifswald
  38. Wicke K, Fischer M. Phylogenetic diversity and biodiversity indices on phylogenetic networks. Math Biosci. 2018;298:80–90. doi: 10.1016/j.mbs.2018.02.005. [DOI] [PubMed] [Google Scholar]
  39. Wicke K, Steel M. Combinatorial properties of phylogenetic diversity indices. J Math Biol. 2020;80(3):687–715. doi: 10.1007/s00285-019-01438-0. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

No data was generated.


Articles from Bulletin of Mathematical Biology are provided here courtesy of Springer

RESOURCES