Skip to main content
Springer logoLink to Springer
. 2025 Aug 22;87(9):134. doi: 10.1007/s11538-025-01509-y

Metaconcepts of Rooted Tree Balance

Mareike Fischer 1,✉,#, Tom Niklas Hamann 1,#, Kristina Wicke 2,#
PMCID: PMC12373716  PMID: 40844631

Abstract

Measures of tree balance play an important role in many different research areas such as mathematical phylogenetics or theoretical computer science. Typically, tree balance is quantified by a single number which is assigned to the tree by a balance or imbalance index, of which several exist in the literature. Most of these indices are based on structural aspects of tree shape, such as clade sizes or leaf depths. For instance, indices like the Sackin index, total cophenetic index, and s^-shape statistic all quantify tree balance through clade sizes, albeit with different definitions and properties. In this paper, we formalize the idea that many tree (im)balance indices are functions of similar underlying tree shape characteristics by introducing metaconcepts of tree balance. A metaconcept is a function Φf that depends on a function f capturing some aspect of tree shape, such as balance values, clade sizes, or leaf depths. These metaconcepts encompass existing indices but also provide new means of measuring tree balance. The versatility and generality of metaconcepts allow for the systematic study of entire families of (im)balance indices, providing deeper insights that extend beyond index-by-index analysis.

Keywords: Tree balance, Rooted tree, Sackin index, Colless index, Total cophenetic index

Introduction

The study of tree balance is an integral part of many different research areas. For example, tree balance is used in evolutionary biology and phylogenetics to study macroevolutionary processes such as speciation and extinction. Additionally, balanced trees are important in computer science, for instance, in the context of search trees (Andersson 1993; Knuth 1998).

In phylogenetics, the balance of a tree is usually quantified by so-called tree (im)balance indices. Intuitively, an (im)balance index is a function that assigns a single numerical value to a tree, assessing some aspect of its shape. The greater (smaller) the value, the more balanced the tree according to the respective (im)balance index. Over the last few decades, there has been a surge in the development of tree (im)balance indices, and numerous such indices are now available (for an overview and categorization see the recent survey by Fischer et al. (2023)). Despite the multitude of different (im)balance indices available, many of them employ similar underlying tree shape characteristics such as balance values, clade sizes, or leaf depths, albeit with different definitions and properties.

The main goal of this paper is to formalize the idea that many (im)balance indices for rooted trees are functions of similar underlying tree shape characteristics by introducing metaconcepts of tree balance. Here, a metaconcept is a function Φf that depends on a function f capturing some aspect of tree shape, such as balance values, clade sizes, or leaf depths. This idea is inspired by a paper on unrooted trees: Fischer and Liebscher (2021) analyzed the balance of unrooted trees and introduced a measure Φf, which can be regarded as a metaconcept for unrooted tree balance. Choosing Φf to be the sum function and f to be a function of split sizes, the authors showed that this metaconcept leads to a family of functions suitable for measuring unrooted tree imbalance if f is strictly increasing. Furthermore, very recently, Cleary et al. (2025) essentially used the idea of a metaconcept based on clade sizes to show that a wide range of clade-size based measures satisfying concavity and monotonicity conditions are minimized by the so-called complete or greedy from the bottom tree (Coronado et al. 2020; Fill 1996) and maximized by the so-called caterpillar tree (both trees are formally defined below).

In this paper, we provide a formal definition of a metaconcept for rooted trees. We then specialize this metaconcept to three classes of metaconcepts suitable to measure tree (im)balance. These metaconcepts are based on certain sequences that can be associated with rooted trees, namely the clade size sequence, the leaf depth sequence, and, in the case of binary trees, additionally the balance value sequence. We rigorously study all metaconcepts, characterize which choices of the function f lead to (im)balance indices, analyze extremal trees and values for the metaconcepts, and investigate further desirable properties such as locality and recursiveness.

Cleary et al. (2025) proved that the clade size metaconcept yields an imbalance index for binary trees if f is strictly increasing and strictly concave. We extend this result to include functions that are strictly increasing and either strictly convex or affine. Moreover, the clade size metaconcept also defines an imbalance index for arbitrary trees if f is additionally either 2-positive, i.e., f(x)>0 for all x2, in the concave and convex cases, or, in the case of affine functions, if f has a non-negative intercept. Further, the leaf depth metaconcept yields an imbalance index for both arbitrary and binary trees if f is strictly increasing and either convex or affine. Finally, the balance value metaconcept defines an imbalance index for binary trees for all strictly increasing functions f without additional constraints.

To help users identify which imbalance index derived from a metaconcept best suits their specific aims, we provide four decision trees (Figures 3 and 4). These decision trees are based on three key properties of the index: (1) whether it applies to binary trees or to arbitrary trees; (2) the underlying structural aspect of the tree it captures (such as balance values, clade sizes, or leaf depths); and (3) the set of binary minimizing trees. For the third criterion, we give four possible options to choose from (cf. Figure 5). Additionally, we provide code for computing the metaconcepts using the R packages treebalance (Fischer et al. 2023) and ape (Paradis and Schliep 2019).

Fig. 3.

Fig. 3

In these decision trees, one first selects the binary minimizing tree(s), followed by the class of metaconcepts from which the resulting (binary) imbalance index should be derived. The leaves of the trees are labeled with the corresponding metaconcept that satisfies the previously chosen properties, assuming the function f is chosen as indicated

Fig. 4.

Fig. 4

In these decision trees, one first selects the metaconcept from which the resulting (binary) imbalance index should be derived, and then chooses the corresponding binary minimizing tree(s). The leaves are labeled with the condition that the function f must satisfy to ensure the previously chosen properties of the resulting (binary) imbalance index

Fig. 5.

Fig. 5

Depicted are all trees that minimize the Sackin index on BT12, including the gfb-tree and the mb-tree, i.e., argminTBT12S(T)=T12gfb,T12mb,T1,T2,T3. Among these, all trees except T3 also minimize the Colless index on BT12, i.e., argminTBT12C(T)=T12gfb,T12mb,T1,T2

We remark that our metaconcepts encompass various existing tree imbalance indices such as the Sackin and Colless indices, and we highlight these connections in the course of the paper. The power of our metaconcepts is that they are naturally more general and versatile than individual indices. Next to leading to new (im)balance indices, the metaconcepts thus also provide a new framework to study the properties (such as extremal trees and values) of whole families of existing imbalance indices holistically, rather than on an individual index basis.

The present manuscript is organized as follows: In Section 2, we present all definitions and notations needed throughout this manuscript and summarize some known results. Section 3 then contains all our results: In Section 3.1, we establish some general results on the underlying tree shape sequences employed in this paper. Section 3.2 then discusses the resulting metaconcepts and their properties in depth. We start with the balance value metaconcept (Section 3.2.1), then turn to the clade size metaconcept (Section 3.2.2), and finally consider the leaf depth metaconcept (Section 3.2.3). In Section 3.2.4, we analyze the locality and recursiveness of all three metaconcepts. We conclude our manuscript with a brief discussion and highlight some directions for future research in Section 4.

Preliminaries

In this section, we introduce all concepts relevant for the present manuscript. We start with some general definitions. We mainly follow the notation of Fischer et al. (2023).

Definitions and Notation

Rooted trees A rooted tree (or simply tree) is a directed graph T=(V(T),E(T)), with vertex set V(T) and edge set E(T), containing precisely one vertex of in-degree zero, the root (denoted by ρ), such that for every vV(T) there exists a unique path from ρ to v and such that there are no vertices with out-degree one. We use VL(T)V(T) to refer to the leaf set of T (i.e., VL(T)={vV(T):out-degree(v)=0}), and we use V˚(T) to denote the set of inner vertices of T (i.e., V˚(T)=V(T)\VL(T)). Moreover, we use n to denote the number of leaves of T, i.e., n=|VL(T)|. Note that ρV˚(T) if n2. If n=1, T consists of only one vertex, which is at the same time the root and its only leaf.

A rooted tree is called binary if all inner vertices have out-degree two, and for every nN1, we denote by BTn the set of (isomorphism classes of) rooted binary trees with n leaves and by Tn the set of (isomorphism classes of) rooted trees with n leaves. We often call a tree TTn an arbitrary tree, but remark that arbitrary trees are also sometimes referred to as non-binary trees in the literature (even though binary trees are also contained in the set of arbitrary trees).

Depth and height The depth δT(v) (or δv for brevity) of a vertex vV(T) is the number of edges on the path from ρ to v, and the height h(T) of T is the maximum depth of any leaf, i.e., h(T)=maxxVL(T)δT(x).

Ancestors, descendants, and (attaching) cherries Let u,vV(T) be vertices of T. Whenever there exists a path from u to v in T, we say that u is an ancestor of v and v is a descendant of u. Note that this implies that each vertex is an ancestor and a descendant of itself. If u and v are connected by an edge, i.e., if (u,v)E(T), we also say that u is the parent of v and v is a child of u. The lowest common ancestor LCAT(u,v) of two vertices u,vV(T) is the unique common ancestor of u and v that is a descendant of every other common ancestor of them. Moreover, two leaves x,yVL(T) are said to form a cherry, if they have the same parent, which is then also called a cherry parent. Finally, by attaching a cherry to a tree T to obtain a tree T, we mean replacing a leaf xVL(T) by a cherry. Notice that T has one more leaf than T.

(Maximal) pending subtrees, clade sizes, and standard decomposition Given a tree T and a vertex vV(T), we denote by Tv the pending subtree of T rooted in v and use nT(v) (or nv for brevity) to denote the number of leaves in Tv, also called the clade size of v. We will often decompose a rooted tree T on n2 leaves into its maximal pending subtrees rooted in the children of ρ. We denote this decomposition as T=(Tv1,,Tvk), where v1,,vk are the children of the root in T, and refer to it as the standard decomposition of T. If T is binary, we have k=2, and thus T=(Tv1,Tv2). Throughout, subtree will always refer to a pending subtree.

Balance values, (perfectly) balanced vertices, and cophenetic values Now let T be a rooted binary tree and let vV˚(T) be an inner vertex of T with children v1 and v2. The balance value bT(v) (or bv for brevity) of v is defined as bT(v):=|nv1-nv2|. An inner vertex v is called balanced if it fulfills bT(v)1 and perfectly balanced if bT(v)=0. Now, let x,yVL(T) be two leaves of a tree TTn. Then, the cophenetic value of x and y is defined as φT(x,y):=δTLCAT(x,y), i.e., it is the depth of their lowest common ancestor. For an example of these definitions, cf. Fig. 1.

Fig. 1.

Fig. 1

Rooted binary tree T with eight leaves and root ρ. The vertices ρ, u, and v are ancestors of v. The parent of v is u and v is one of two children of u. The descendants of v are v, z, w, x, and y. The lowest common ancestor of x and z is LCAT(x,z)=v. The leaves x and y form a cherry whose parent is w. The pending subtree of u is Tu, which is also one of the two maximal pending subtrees of T. It has five leaves and thus nT(u)=5, i.e., the clade size of u is five. The balance value of u and v is one, i.e., bT(u)=bT(v)=1, hence u and v are balanced. The vertex w is balanced, too, and further, it is perfectly balanced, because bT(w)=0. The root ρ is not balanced as bT(ρ)=2

Important (families of) trees Next, we introduce some specific families of trees that will be important throughout this manuscript (see Figure 2 for examples).

Fig. 2.

Fig. 2

Examples of the special trees considered throughout this manuscript

First, the maximally balanced tree (or mb-tree for brevity), denoted by Tnmb, is the rooted binary tree with n leaves in which all inner vertices are balanced. Recursively, a rooted binary tree with n2 leaves is maximally balanced if its root is balanced and its two maximal pending subtrees are maximally balanced.

Second, the greedy from the bottom tree (or gfb-tree for brevity), denoted by Tngfb, is the rooted binary tree with n leaves that results from greedily clustering trees of minimal leaf numbers, starting with n single vertices and proceeding until only one tree is left as described by Coronado et al. (2020), Algorithm 2.

Third, the fully balanced tree of height h (or fb-tree for brevity), denoted by Thfb is the rooted binary tree with n=2h leaves with hN0, in which all leaves have depth precisely h. Note that for h1, we have Thfb=Th-1fb,Th-1fb. Moreover, for hN0, Thfb=Thmb=Thgfb.

Fourth, the caterpillar tree (or simply caterpillar), denoted by Tncat, is the rooted binary tree with n leaves that fulfills either n=1, or n2 and additionally has exactly one cherry.

Finally, the star tree, denoted by Tnstar, is the rooted tree with n leaves that either satisfies n=1, or n2 and additionally has a single inner vertex (the root), which is adjacent to all leaves.

Notice that all trees introduced above are unique (up to isomorphism) and have the property that all their pending subtrees are (smaller) trees of the same type. Moreover, we remark that the caterpillar is generally regarded as the most unbalanced (binary) tree, whereas the fully balanced tree is considered the most balanced binary tree when it exists, i.e., for leaf numbers that are powers of two. For other leaf numbers, both the maximally balanced tree and the greedy from the bottom tree are often regarded as the most balanced binary trees, whereas the star tree is usually considered to be the most balanced arbitrary tree.

Imbalance index, locality and recursiveness We next introduce the concept of a tree imbalance index. First, following Fischer et al. (2023), a (binary) tree shape statistic is a function t:Tn(BTn)R that depends only on the shape of T but not on the labeling of vertices or the length of edges. Based on this, a tree imbalance index is defined as follows:

Definition 2.1

((Binary) imbalance index (Fischer et al. (2023))) A (binary) tree shape statistic t is called an imbalance index if and only if

  • (i)

    the caterpillar Tncat is the unique tree maximizing t on its domain Tn(BTn) for all n1,

  • (ii)

    the fully balanced tree Thfb is the unique tree minimizing t on BTn for all n=2h with hN0.

If the domain of t is BTn, we often call t a binary imbalance index to highlight this fact.

Given two trees, say T,TTn(BTn), and an imbalance index, say t, we say that T is more balanced than T (with respect to t) if t(T)<t(T). More generally, when we say that a tree T minimizes an imbalance index, we mean that it minimizes it among all trees with the same leaf number as T. Analogously, when we compare a tree T to a family of trees (such as the ones defined above), we always compare it to the family’s representative that has the same number of leaves.

We note that in addition to imbalance indices, balance indices also exist. A balance index is minimized by the caterpillar tree and maximized by the fb-tree. Since a balance index can be obtained from an imbalance index (and vice versa) by multiplying by -1, and given that the majority of known indices are formulated as measures of imbalance, we focus exclusively on imbalance indices in this work.

Further, two imbalance indices φ1 and φ2 are considered equivalent, if for all trees T1,T2Tn(BTn), the following holds: φ1(T1)<φ1(T2)φ2(T1)<φ2(T2). In other words, equivalence means that φ1 and φ2 rank trees in the same order from most balanced to least balanced.

We next turn to two desirable properties of imbalance indices, namely locality and recursiveness.

Definition 2.2

(Locality (Mir et al. (2013); Fischer et al. (2023))) Let TTn(BTn) be a tree and let v be a vertex of T. Further, let T be obtained from T by replacing the subtree Tv rooted in v by a (binary) tree Tv with the same leaf number and also rooted in v. An imbalance index t is called local if it fulfills

t(T)-t(T)=t(Tv)-tTvfor allvV(T).

In other words, if t is local and two trees T and T differ only in a pending subtree, then the differences of their t-values is equal to the differences of the subtrees’ t-values.

Next, we introduce the recursiveness of a tree shape statistic.

Definition 2.3

(Recursiveness (based on Fischer et al. (2023))) A recursive tree shape statistic of length xN1 is an ordered pair (λ,r), where λRx and r is an x-vector of symmetric functions each mapping a multiset of x-vectors to R. In this definition, x is the number of recursions that are used to calculate the index, the vector λ contains the start value for each of the x recursions, i.e., the values of TT1 if n=1, and the vector r contains the recursions themselves. In particular, ri(T)=λi for n=1, and for T=(T1,,Tk), recursion ri operates on k vectors of length x, namely (r1(T1),,rx(T1)),,(r1(Tk),,rx(Tk)), each representing one of the maximal pending subtrees T1,,Tk and containing their respective values. The recursions are symmetrical functions, i.e., the order of those k vectors is permutable, because we are solely considering unordered trees. If only binary trees are considered, i.e., k=2 for every pending subtree, we use the term binary recursive tree shape statistic.

In the following, we introduce our main concepts: the definition of a general metaconcept, three tree shape sequences, and three classes of metaconcepts – each based on one of these sequences. We begin by defining the sequences.

Balance value sequence, clade size sequence, and leaf depth sequence First, the balance value sequence of a binary tree TBTn is the list of balance values of all its inner vertices, arranged in ascending order. We denote this sequence by BT:=(b1,,bn-1). The i-th entry of B(T) is denoted by BTi. Note that for any TBTn, the length of B(T) is n-1. Also note that B(T)=(0,,0) if and only if T=Thfb (for a formal argument, see Coronado et al. 2020, Corollary 1).

Second, the clade size sequence of a tree TTn is the list of clade sizes of all its inner vertices, arranged in ascending order. We denote this sequence by N(T):=(n1,,n|V˚(T)|), where N(T)i represents the i-th entry of N(T). The length of the clade size sequence for a tree with n2 leaves can range from 1 to n-1. Specifically, the sequence has length 1 if and only if T is a star tree, and it has length n-1 if and only if T is binary.

Third, the leaf depth sequence of a tree TTn is the list of leaf depths of all its leaves, arranged in ascending order. We denote this sequence by Δ(T):=(δ1,,δn), where Δ(T)i represents the i-th entry of Δ(T). Unlike the clade size sequence, the leaf depth sequence has always length n, regardless of whether the tree is binary.

Balance value metaconcept, clade size metaconcept, and leaf depth metaconcept Next, we define the general metaconcept. In a second step, we derive three classes of metaconcepts from this definition, each based on one of the previously introduced sequences. Let TTTn be a tree, and let Seq(T) be a vertex value sequence on a subset VV(T), i.e., a sequence that assigns each vertex vV a value sv derived from v. Assume that Seq(T) is sorted in ascending order, and let Seq(T)i denote its i-th entry.

Furthermore, let ωN1, c=minTTSeq(T)1, and f:Rc×Rω-1R be a function that depends on an entry of Seq(T) and ω-1 additional values o1(T),,oω-1(T), such as the number of inner vertices, i.e., oi(T)=|V˚(T)|, or the number of leaves of T, i.e., oi(T)=n. Then,

ΦfSeq(T):=sSeq(T)f(s,o1(T),,oω-1(T))

is called the imbalance index metaconcept of order ω. Clearly,

ΦfSeq(T)=vVf(sv,o1(T),,oω-1(T))=i=1|V|f(Seq(T)i,o1(T),,oω-1(T)).

Examples for known balance indices and their interpretations in the framework of metaconcepts of order ω can be found in Tables 1 and 2.

Table 1.

Definitions of imbalance indices that are applicable to arbitrary trees, i.e., those with domain Tn (notice that the s^-shape statistic is applicable to arbitrary trees, but is only an imbalance index on BTn). It is straightforward to see that these imbalance indices are induced by the clade size metaconcept ΦfN and the leaf depth metaconcept ΦfΔ, respectively, when the function f is chosen as specified in the two rightmost columns. The Sackin index and the s^-shape statistic are induced by the first-order metaconcept with id as the identity function. In contrast, the average leaf depth is induced by the second-order metaconcept. Moreover, the total cophenetic index is induced on BTn by the second-order and on Tn by the third-order metaconcept. This is because for binary trees we have |V˚(T)|=n-1, so no further additional value than n is needed. For further details on the total cophenetic index, see Remark 3.14

imbalance index definition CSM ΦfN LDM ΦfΔ
Sackin index Sackin (1972); Shao and Sokal (1990); Blum and François (2005); Fischer (2021) S(T):=xVL(T)δT(x) S(T)=vV˚(T)nT(v) id id
Average leaf depth Kirkpatrick and Slatkin (1993) N¯(T):=1nxVL(T)δT(x) N¯(T)=1n·S(T) fN¯(nv,n)=1n·nv fN¯(δ,n)=1n·δ
s^-shape statistic Blum and François (2006) s^(T):=vV˚(T)lognT(v)-1 fs^(nv)=log(nv-1)
Total cophenetic index Mir et al. (2013) Φ(T):=(x,y)VL(T)2xyφT(x,y) Φ(T)=vV˚(T)\ρnT(v)2 fΦ(nv,n,|V˚(T)|)=nv2-n2|V˚(T)|
Table 2.

Definitions of binary imbalance indices, which are only applicable to binary trees, i.e., those with domain BTn. It is straightforward to see that these binary imbalance indices are induced by the balance value metaconcept ΦfB when the function f is chosen as specified in the right column. The (quadratic) Colless index is induced by the first-order metaconcept where id is the identity function, while the corrected Colless index is induced by the second-order metaconcept.

binary imbalance index definition BVM ΦfB
Colless index Colless (1982); Shao and Sokal (1990) CT:=vV˚TbT(v) id
Corrected Colless index Heard (1992) ICT:=2(n-1)(n-2)·CT; ICT:=0 for n=1,2 fIC(b,n)=2(n-1)(n-2)·b
Quadratic Colless index Bartoszek et al. (2021) QCT:=vV˚TbT(v)2 fQC(b)=b2

We now specialize the general metaconcept to three subclasses. First, let TBTn be a binary tree, and let Seq(T)=B(T) be its balance value sequence. Then, the balance value metaconcept (BVM) of order ω is defined as

ΦfBT:=bBTf(b,o1(T),,oω-1(T)).

Second, if TTn is a rooted tree, and Seq(T){N(T),Δ(T)}, we obtain the clade size metaconcept (CSM) of order ω defined as

ΦfN(T):=nvN(T)f(nv,o1(T),,oω-1(T))

and the leaf depth metaconcept (LDM) of order ω defined as

ΦfΔ(T):=δΔ(T)f(δ,o1(T),,oω-1(T)).

Note that for n2 the minimal balance value is 0, the minimal clade size is 2, and the minimal leaf depth is 1. Hence, the value c in the definition of the metaconcept equals the respective value.

Note that the clade size metaconcept of order 1, when applied with a strictly increasing and strictly concave function f, corresponds to the function Φf in Cleary et al. (2025). Recall that a concave function satisfies f(λx+(1-λ)y)λf(x)+(1-λ)f(y) for all λ(0,1) and all x,yR and xy (and in case of strict concavity, the inequality is also strict). When choosing λ=12 and y=x+2, this yields the inequality 2·f(x)f(x-1)+f(x+1) and hence f(x)-f(x-1)f(x+1)-f(x), i.e., the increments decrease (where again, the inequalities are strict in case of strict concavity). Finally, we will sometimes use the fact that a differentiable function f is (strictly) concave on an interval if and only if its derivative function f is (strictly) decreasing on that interval. Conversely, a (strictly) convex function has (strictly) increasing increments, meaning the inequalities are reversed. Moreover, we call a function f:R0R 2-positive, if f(x)>0 for all x2 and non-negative if f(x)0 for all x0. In the case of an affine function f(x)=m·x+a, we refer to m as the slope and a as the intercept.

Known Imbalance Indices

Tables 1 and 2 define various known imbalance indices. Note that the choice of logarithm base is arbitrary. Additionally, we follow the conventions that 00=0 and that a sum over an empty set equals zero. Note that Fischer et al. (2023) demonstrated that all functions listed in Table 1 satisfy the definition of an imbalance index on Tn, except for the s^-shape statistic, which is only a binary imbalance index. Moreover, all functions listed in Table 2 are binary imbalance indices, too.

Known Results

Before presenting our new results, we first recall some previously established findings. We summarize key results concerning special trees as well as known imbalance indices.

Proposition 2.4

(Coronado et al. (2020), Theorem 1 and Proposition 6) Let nN1. The mb-tree Tnmb and the gfb-tree Tngfb minimize the Colless index on BTn.

Remark 2.5

Notice that for most leaf numbers n, there are trees distinct from the mb-tree Tnmb and the gfb-tree Tngfb that also minimize the Colless index. However, all binary trees with n leaves minimizing the Colless index have been completely characterized by Coronado et al. (2020), Proposition 1 and Proposition 3.

Lemma 2.6

(Fischer (2021), Theorem 2) Let TTn with Inline graphic. Then, T minimizes the Sackin index on BTn if and only if T=Tnfb or T employs precisely two leaf depths, namely hn-1 and hn. Moreover, in this case, S(T)=-2hn+n·(hn+1), which equals hn·2hn if n=2hn.

Remark 2.7

Note that trees with n leaves minimizing the Sackin index as characterized in the lemma above are precisely those trees that can be constructed from the fb-tree of height hn-1 by attaching n-2hn-1 cherries to its leaves. In particular, the gfb-tree and the mb-tree can be constructed in this way. This follows from the fact that the gfb-tree and the mb-tree minimize the Colless index on BTn (Proposition 2.4) and the fact that all trees minimizing the Colless index also minimize the Sackin index on BTn (Coronado et al. (2020), Proposition 9). Note that to construct the gfb-tree, one has to attach the cherries to the fb-tree from left to right (or vice versa) (Cleary et al. (2025), Lemma 4.17).

Proposition 2.8

(Coronado et al. (2020), Theorem 3) For every nN1, let Inline graphic. Then,

cn:=i=1hn-12i·s(2-i·n)

is the minimum value of the Colless index on BTn, where s(x) is the distance from xR to its nearest integer, i.e., s=minzZ|x-z|.

Lemma 2.9

(Fischer (2021), Theorem 1) The caterpillar uniquely maximizes the Sackin index on BTn, and we have STncat=n·(n+1)2-1.

Proposition 2.10

(adapted from Cleary et al. (2025), Corollary 4.4) Let f be strictly increasing and strictly concave. Then, the clade size metaconcept ΦfN is a binary imbalance index. Moreover, the gfb-tree Tngfb uniquely minimizes the clade size metaconcept on BTn.

Proposition 2.11

(adapted from Cleary et al. (2025), Theorem 4.3) Let f be strictly increasing. Then, the caterpillar Tncat uniquely maximizes the clade size metaconcept ΦfN on BTn.

Finally, we recall a result from Cleary et al. (2025) regarding the number of subtrees of the gfb-tree Tngfb for all possible subtree sizes.

Theorem 2.12

(Cleary et al. (2025), Theorem 4.12) Let n1, and let gfbn(i) denote the number of subtrees of Tngfb of size i for i=1,,n. Let Inline graphic. Then, we have:

gfbn(i)=niifi=2hiand if(nmodi)=0or(nmodi)2hi-1,ni-1ifi=2hiand if0<(nmodi)<2hi-1,1ifi2hiand(n-i)mod2hi-1=0,0ifi2hiand(n-i)mod2hi-1>0.

We are now in the position to state our new results.

Results

This section is divided into two subsections. The first subsection examines the underlying tree shape sequences of the metaconcepts, highlighting their differences and similarities. The second subsection analyzes each metaconcept in terms of its minimizing and maximizing trees, as well as its minimum and maximum values. Finally, we investigate their locality and recursiveness.

Tree Shape Sequences

In this subsection, we analyze and compare three sequences derived from a (binary) tree: the balance value sequence, the clade size sequence, and the leaf depth sequence. Each of these sequences serves as the foundation for a specific metaconcept.

A shared property of the three sequences associated with a rooted tree is that they can be computed recursively. We will exploit this property to analyze the recursiveness of our metaconcepts.

To formalize the recursive structure of these sequences, we introduce an operator that allows us to merge two sequences while preserving ascending order. Let Seq1 and Seq2 be two sequences of lengths n1 and n2, respectively. We define their ordered union as Seq1Seq2:=Seq, where Seq is a sequence of length n1+n2 containing all elements of Seq1 and Seq2 arranged in ascending order. For example, (1,4,5,13)(2,2,4,7,8)=(1,2,2,4,4,5,7,8,13). Additionally, for aN, we define Seq1+a as the sequence obtained by increasing each element of Seq1 by a. For example, (1,4,5,13)+1=(2,5,6,14).

Remark 3.1

Let TBTn be a binary tree with standard decomposition T=T1,T2 such that T1 and T2 have n1 and n2 leaves, respectively. Notice that all inner vertices of a maximal pending subtree Ti have the same balance value in Ti as in T and the root of T has balance value n1-n2. Hence,

BT=B(T1)B(T2)(|n1-n2|).

Now, let TTn be an arbitrary tree with standard decomposition T=(T1,,Tk) such that the maximal pending subtree Ti has ni leaves. Note that all inner vertices of a maximal pending subtree Ti have the same clade size in Ti as in T and the root of T has clade size n1++nk=n. Also note that the depth of a leaf in a maximal pending subtree Ti is one less than in T. Thus,

N(T)=N(T1)N(Tk)(n1++nk)=n

and

Δ(T)=Δ(T1)Δ(Tk)+1.

Another shared property of these sequences is that none of them uniquely characterize a (binary) tree. That is, two non-isomorphic (binary) trees can have the same sequence (see Figures 10 and 11 in Appendix A). Moreover, examples exist where two distinct binary trees with n4 leaves have the same/different B and/or the same/different N and/or the same/different Δ. A (unique) minimal example in terms of the leaf number n for each possible pair of sequences is given in Figures 10, 11, 12, 13 and 14 in Appendix A. For an overview, see also Table 3, which indicates the corresponding figures for each case.

Fig. 10.

Fig. 10

Unique minimal example of two binary trees having the same Δ but different B and N, respectively. Specifically, n=6, BT6mb=(0,0,0,1,1)(0,0,0,0,2)=BT6gfb, NT6mb=(2,2,3,3,6)(2,2,2,4,6)=NT6gfb, and ΔT6mb=ΔT6gfb=(2,2,3,3,3,3)

Fig. 11.

Fig. 11

Unique minimal example of two binary trees having the same N and the same B. Note, however, that Δ is not the same. Specifically, n=9, B(T1)=B(T2)=(0,0,0,0,1,1,2,3), N(T1)=N(T2)=(2,2,2,3,4,4,5,9), and Δ(T1)=(2,2,3,4,4,4,4,4,4)(2,3,3,3,3,3,4,5,5)=Δ(T2)

Fig. 12.

Fig. 12

Unique minimal example of two binary trees having the same B, the same N, and the same Δ. Specifically, n=11, B(T1)=B(T2)=(0,0,0,0,0,1,1,2,2,3), N(T1)=N(T2)=(2,2,2,2,3,4,4,5,6,11), and Δ(T1)=Δ(T2)=(2,3,3,3,4,4,4,4,4,5,5)

Fig. 13.

Fig. 13

Unique minimal example of two binary trees having different B but the same N. Specifically, n=11, B(T1)=(0,0,0,0,0,0,1,1,1,3)(0,0,0,0,1,1,1,1,2,2)=B(T2), and N(T1)=N(T2)=(2,2,2,2,3,3,4,5,6,11)

Fig. 14.

Fig. 14

One of 13 minimal examples of two binary trees having the same B but different N. Here, n=13, B(T1)=B(T2)=(0,0,0,0,1,1,1,2,2,2,3,3), and N(T1)=(2,2,2,3,3,3,4,4,4,5,8,13)(2,2,2,2,3,3,4,4,5,6,7,13)=N(T2)

Table 3.

This table provides an overview of where to find a minimal example of two distinct binary trees that either share or differ in two of the three sequences B, N, and Δ. Note that all figures except for Figure 14 show a unique minimal example

First Sequence Second Sequence Figure n
B(T1)=B(T2) N(T1)=N(T2) 11 9
B(T1)B(T2) N(T1)=N(T2) 13 11
B(T1)=B(T2) N(T1)N(T2) 14 13
B(T1)=B(T2) Δ(T1)=Δ(T2) 12 11
B(T1)B(T2) Δ(T1)=Δ(T2) 10 6
B(T1)=B(T2) Δ(T1)Δ(T2) 11 9
N(T1)=N(T2) Δ(T1)=Δ(T2) 12 11
N(T1)N(T2) Δ(T1)=Δ(T2) 10 6
N(T1)=N(T2) Δ(T1)Δ(T2) 11 9

Metaconcepts

In this section, we analyze the three previously introduced metaconcepts. Specifically, we determine which families of the function f ensure that a given metaconcept yields a (binary) imbalance index. Accordingly, we examine the trees that minimize and maximize each metaconcept based on the choice of f and provide formulas for computing their minimum and maximum values. Finally, we analyze the locality and the recursiveness of the metaconcepts.

Throughout this manuscript, we focus exclusively on first-order metaconcepts. However, all results regarding minimizing and maximizing trees extend to higher-order metaconcepts that are equivalent to the first-order case. For examples of such functions, see the following remark.

Remark 3.2

The BVM, the LDM, and the binary CSM of order ω2 with a function of the form f(x,o1,,oω-1)=f1(x)·f2(o1,,oω-1)+f3(o1,,oω-1) are equivalent to the first-order metaconcept with f(x)=f1(x), provided that f2(o1,,oω-1)>0 and the additional values o1,,oω-1 are the same for all trees with the same number of leaves (e.g., oi=n but oih(T)). This equivalence holds because the additional values act as constants, and the number of summands in the calculation of these metaconcepts remains the same for all trees with n leaves.

For arbitrary trees, the number of summands in the CSM varies. Thus, the CSM of order ω2 can only be guaranteed to be equivalent to the first-order metaconcept if the function is of the form f(x,o1,,oω-1)=f1(x)·f2(o1,,oω-1), where f2(o1,,oω-1)>0. In this case, the metaconcept remains equivalent to the first-order metaconcept with function f(x)=f1(x).

Summary of Our Main Results

First, we outline the conditions on the function f that ensure that the respective metaconcept yields a (binary) imbalance index. For a detailed overview, including the minimizing trees on BTn, see Table 4. In the next step, we analyze the locality and the recursiveness of the metaconcepts.

Table 4.

This table provides an overview of our results, indicating for which families of the function f the metaconcepts qualify as (binary) imbalance indices. The column labels refer to four cases, all of which require f to be strictly increasing. A checkmark () indicates that no further conditions on f are needed to satisfy the corresponding property. When additional constraints on f are required, they are explicitly stated in the respective cell. Conversely, a cross (×) indicates that for at least one function in the given family, the metaconcept fails to be a (binary) imbalance index. The entry “depends” means that the binary minimizing tree(s) are not the same for all functions within that family. An entry in square brackets indicates that this result is adapted from Cleary et al. (2025), Corollary 4.4

f strictly increasing and
convex str. concave affine (m>0, aR)
ΦfB imb. index on BTn (Theo. 3.6) (Theo. 3.6) (Theo. 3.6) , C(T) (Rem. 3.5)
min. tree(s) on BTn depends (Rem. 3.3) e.g., Tnmb (Theo. 3.8) depends (Rem. 3.3) argminC(T) (Cor. 3.7)
ΦfN imb. index on BTn × (Rem. 3.3) str. convex: (Cor. 3.20) [] (Prop. 2.10) , S(T) (Rem. 3.14)
imb. index on Tn × (Rem. 3.3) str. convex, 2-positive: (Cor. 3.20) 2-positive: (Cor. 3.16) a0: , a=0S(T) (Rem. 3.14, Prop. 3.21)
min. tree(s) on BTn depends (Theo. 3.17, Prop. 2.10) str. convex: Tnmb (Theo. 3.17) Tngfb (Prop. 2.10) argminS(T) (Prop. 3.21)
ΦfΔ imb. index on BTn × (Rem. 3.3) (Prop. 3.30) × (Rem. 3.3) , S(T) (Rem. 3.29)
imb. index on Tn × (Rem. 3.3) (Prop. 3.30) × (Rem. 3.3) , S(T) (Rem. 3.29)
min. tree(s) on BTn depends (Rem. 3.3) argminS(T) (Prop. 3.30) depends (Rem. 3.3) argminS(T) (Prop. 3.31)

Remark 3.3

In this remark, we provide examples to illustrate some of the cases presented in Table 4.

First, we provide an example showing that the minimizing tree of the BVM can vary for strictly increasing (and possibly strictly concave) functions f. To cover both cases, consider two strictly increasing and strictly concave functions, f1 and f2, defined as follows: f1(x)=log212x+1 and f2(x)=log232x+1.

Let n=5, where three binary trees exist: T5gfb, T1=T2fb,T0fb, and T5cat. For i{1,2}, we have

ΦiBT5gfb=2·fi(0)+2·fi(1),ΦiB(T1)=3·fi(0)+fi(3),ΦiBT5cat=fi(0)+fi(1)+fi(2)+fi(3)

and hence

Φ1BT5gfb1.17,Φ2BT5gfb2.64,Φ1B(T1)1.32,Φ2B(T1)2.46,Φ1BT5cat2.91,Φ2BT5cat5.78.

Thus, for the function f1, the gfb-tree is the unique minimizer, while for f2, tree T1 is the unique minimizer of the BVM when n=5.

Second, we demonstrate that the CSM is not a (binary) imbalance index for all strictly increasing functions f. Specifically, we show that the fb-tree is not the unique minimizing tree on BTn. Consider the tree T2=T5gfb,T3gfb, for which the clade size sequence is

N(T2)=(2,2,2,3,3,5,8).

Similarly, for the fully balanced tree T3fb, we have

NT3fb=(2,2,2,2,4,4,8).

Now, define the function f3 as follows:

f3(x)=xifx3,x+2ifx>3.

With this function, we have

Φ3N(T2)=29<30=Φ3NT3fb.

Thus, T2 attains a smaller value than T3fb, proving that the fb-tree is not always a (unique) minimizer. Consequently, the CSM is not a (binary) imbalance index for all strictly increasing functions f.

Third, we show that the LDM is not a (binary) imbalance index for all strictly increasing (and possibly strictly concave) functions f. Again, to cover both cases, we show that the fb-tree is not the unique tree minimizing the LDM on BTn for a chosen strictly increasing and strictly concave function f4, defined as follows: f4(x)=xx+12. This function is strictly increasing. Moreover, it is strictly concave, because f4(x)=12x2+x+14 is strictly decreasing for xR0. Then, we have for the fully balanced tree T2fb,

Φ4ΔT2fb=4·f4(2)=3.2.

For the caterpillar tree T4cat, we have

Φ4ΔT4cat=f4(1)+f4(2)+2·f4(3)3.18

Thus,

Φ4ΔT2fb>Φ4ΔT4cat,

showing that the LDM is not a (binary) imbalance index for all strictly increasing (and possibly strictly concave) functions f.

Fourth, we show that the minimizing tree for the LDM with a strictly increasing and strictly concave function f also depends on the choice of f. In the previous calculation, we observed that the caterpillar minimizes the LDM for the function f4 defined above. Now, let f5=log2 be another strictly increasing and strictly concave function. Then, we have

Φ5ΔT2fb=4<4.17Φ5ΔT4cat.

Thus, T2fb attains the minimum for the function f5, illustrating that the minimizing tree for the LDM depends on the choice of f.

Choosing a suitable (binary) imbalance index derived from a metaconcept

Before measuring tree balance, three key questions must be addressed: Are the trees binary or arbitrary? Which binary tree(s) should be considered the most balanced? Which aspect of the tree (balance values, clade sizes, or leaf depths) should be used to measure balance? Once these questions are answered, the next step is to determine which imbalance index, derived from which metaconcept, is most suitable.

To support this decision, we provide four decision trees in Figures 3 and 4. Each figure contains two decision trees: one for binary trees and one for arbitrary trees. The decision trees in Figure 3 begin with a choice of binary minimizing tree(s), while those in Figure 4 start with the aspect of the tree to be considered, i.e., the class of metaconcepts, and then proceed to the selection of binary minimizing trees.

In both figures, the notations “C(T)” and “S(T)” indicate that the resulting imbalance index is equivalent to the Colless or Sackin index, respectively. The label “argminS(T)” indicates that the binary minimizing trees coincide with those of the Sackin index, although the imbalance index itself may not be equivalent to the Sackin index (as it can lead to different rankings of non-extremal trees).

To illustrate the different options for binary minimizing trees, Figure 5 shows examples of Tnmb, Tngfb, argminS(T), and, for completeness, also argminC(T) for n=12 leaves.

Calculating the metaconcepts in R

Here, we provide R code to calculate the three metaconcepts using the R packages ape Paradis and Schliep (2019) and treebalance Fischer et al. (2023).graphic file with name 11538_2025_1509_Figa_HTML.jpg

General result regarding minimizing trees and sequences

To investigate the extremal trees associated with these metaconcepts, we frequently use the following lemma. Note that the first part of this lemma is a generalization of (Fischer and Liebscher (2021), Theorem 2).

Lemma 3.4

Let Seq be a sequence of length l, sorted in ascending order, which can be determined for every tree TT, where TTn. Denote the i-th entry of Seq(T) by Seq(T)i. Let f:RR be a function, and define the functional ΦfSeq:TR by

ΦfSeq(T):=i=1lfSeq(T)i.

Then, we have:

    1. If a tree TT minimizes the functional ΦfSeq on T for all strictly increasing functions f, then for all T~T, we have
      Seq(T)iSeq(T~)ifor alli1,,l.
    2. Conversely, if a tree TT satisfies for all T~T and all i1,,l
      Seq(T)iSeq(T~)i,
      then T minimizes the functional ΦfSeq on T for all (not necessarily strictly) increasing functions.
  1. If T uniquely minimizes the functional for some increasing function f, then we have for all T~T\T
    Seq(T)i<Seq(T~)ifor at least onei1,,l.
    Conversely, if
    Seq(T)iSeq(T~)ifor alli1,,l
    and
    Seq(T)i<Seq(T~)ifor at least onei1,,l
    for all T~T\T, then T (uniquely) minimizes the functional for all (strictly) increasing functions f.

Both statements also hold in the maximization case, where “minimizing” is replaced by “maximizing”, and all inequalities are reversed.

Proof

    1. We prove this assertion by contradiction. Let T minimize the functional ΦfSeq for all strictly increasing functions f. Assume that there exists a tree T~T such that Seq(T)i>Seq(T~)i for at least one i1,,l. For the rest of the proof let imin be the smallest i with this property. The strategy of the proof is now to construct a function f~ that is strictly increasing and yields Φf~Seq(T)>Φf~Seq(T~), leading to a contradiction.
      Note that the sequence Seq is sorted in ascending order. Thus, T has more entries in its sequence Seq(T) whose value is at least Seq(T)imin than T~ has in its sequence Seq(T~). Let mT be the number of entries in Seq(T) with Seq(T)jSeq(T)imin, i.e., mT=l-imin+1. Moreover, let mT~ be the number of entries in Seq(T~) with Seq(T~)jSeq(T)imin. Then we have mT~<mT and thus mT~-mT-1.
      The idea of the construction of f~ now is to take the identity function id and add a penalty term x to values greater or equal to Seq(T)imin. Let D:=ΦidSeq(T~)-ΦidSeq(T) be the difference of the functional applied to T and T~ when using the identity function id, which is strictly increasing, thus implying D0.
      Now, let xR>D. We then define the strictly increasing function f~ as follows:
      f~(s):=s,ifs<Seq(T)imins+x,ifsSeq(T)imin.
      This yields
      Φf~Seq(T~)-Φf~Seq(T)=ΦidSeq(T~)+mT~·x-ΦidSeq(T)+mT·x=D+mT~-mT-1·x>D<0.
      This contradicts the assumption that T minimizes the functional for all strictly increasing functions and thus completes the proof for this part.
    2. Now, let TT and suppose that for all T~T and for all i{1,,l}, we have Seq(T)iSeq(T~)i. For any increasing function f, it follows that applying f to each entry preserves the order, i.e., f(Seq(T)i)f(Seq(T~)i) for all i1,,l. Summing over all indices, we obtain
      ΦfSeq(T)=i=1lf(Seq(T)i)i=1lf(Seq(T~)i)=ΦfSeq(T~).
      Thus, T minimizes ΦfSeq for all increasing functions f, which completes the proof of this part.
  1. First, assume that T uniquely minimizes the functional ΦfSeq for some increasing function f. We want to show that
    Seq(T)i<Seq(T~)ifor at least onei1,,land for allT~T\{T}.
    Assume that this is not the case, i.e., assume there exists T^T\{T} such that
    Seq(T)iSeq(T^)ifor alli1,,l.
    Then, while T might minimize the functional, T cannot minimize the functional uniquely, because, by the first part of the lemma, ΦfSeq(T)ΦfSeq(T^). This contradicts the assumption and completes the proof for this part.
    For the second assertion, let TT and assume that
    Seq(T)iSeq(T~)ifor alli1,,l
    and
    Seq(T)i<Seq(T~)ifor at least onei1,,l
    for all T~T\{T}. Then, applying any (strictly) increasing function f, we obtain
    f(Seq(T)i)f(Seq(T~)i)for alli1,,l,
    with strict inequality for at least one index precisely if f is strictly increasing. Summing over all indices gives
    ΦfSeq(T)=i=1lf(Seq(T)i)i=1lf(Seq(T~)i)=ΦfSeq(T~),
    which shows that T (uniquely) minimizes the functional for all (strictly) increasing functions f. Note that the minimum is unique precisely if the inequality is strict, which is the case when f is strictly increasing. This completes the proof for minimization.

The proof for the respective maximization statements follows analogously by reversing all inequalities. Thus, the entire proof is complete.

Note that if T=BTn and SeqB,N,Δ, then the sequences have the same length for all considered trees. Moreover, the functional ΦfSeq corresponds to the respective metaconcept.

Having established this useful lemma, we can now begin our analysis of the three classes of metaconcepts. We start with the balance value metaconcept.

Balance Value Metaconcept ΦfB

In the following, we prove one of our main results, namely that the BVM is a binary imbalance index for all strictly increasing functions f. This generalizes existing results in the literature, which were previously proven only for specific functions f, such as the Colless index, the corrected Colless index, and the quadratic Colless index. Additionally, we demonstrate that all imbalance indices induced by strictly increasing and affine functions f are equivalent to the Colless index. Furthermore, we show that the mb-tree (uniquely) minimizes the BVM if f is strictly increasing and (locally strictly) convex. In a second step, we compute the minimum and maximum values of the BVM.

We first investigate the relationship between the BVM ΦfB and the Colless index, the corrected Colless index, and the quadratic Colless index by focusing on the specific properties of the function f that induces each of these indices.

Remark 3.5

Let f be an affine function, i.e., f(b)=m·b+a, and TBTn. Recalling that a binary tree with n leaves has n-1 inner vertices, we have

ΦfBT=bBTf(b)=bBT(m·b+a)=m·bBTb+(n-1)·a=cf. Table2m·CT+(n-1)·a.

It follows immediately that the BVM with strictly increasing and affine f (i.e., m>0) is equivalent to the Colless index on BTn.

Furthermore, by Remark 3.2, the corrected Colless index is equivalent to the Colless index. Additionally, we note that the functions inducing the Colless index (i.e., the identity function) and the quadratic Colless index (i.e., fQC(b)=b2) are both strictly increasing. The function inducing the Colless index is affine, whereas the function inducing the quadratic Colless index is strictly convex for b0.

With this in mind, we now turn our attention to the extremal trees of the BVM.

3.2.1.1 Extremal Trees

We begin by analyzing the trees that maximize, respectively minimize, the BVM for (strictly) increasing functions f.

Theorem 3.6

Let f be a (strictly) increasing function.

  1. The caterpillar Tncat is the (unique) tree maximizing the balance value metaconcept ΦfB on BTn.

  2. If n=2h with h0, then the fully balanced tree Thfb is the (unique) tree minimizing ΦfB on BTn.

In particular, the balance value metaconcept is a binary imbalance index for all strictly increasing functions f.

Proof

Let f be a (strictly) increasing function.

  1. We will show that the caterpillar (uniquely) maximizes ΦfB on BTn. Specifically, we first show that for all trees TBTn, we have B(T)iBTncati for all i1,,n-1. For n3, there is nothing to show, as there exists only one binary tree with n leaves. Let n4 be the smallest number of leaves for which there exists a tree TBTn such that B(T)i>BTncati for at least one i1,,n-1. By assumption, the statement of the theorem holds for Tn-1cat. Let Tn-1 be a tree from which T can be obtained by attaching a cherry to one of its leaves. Note that Tncat can be obtained in the same way from Tn-1cat. Thus, for both trees, we have B(T)1=BTncat1=0, corresponding to the parent of the attached cherry in each respective tree.

    Note that BTncat=(0,1,,n-2) and, consequently, B(Tn-1cat)=(0,1,,n-3). Moreover, attaching a cherry to Tn-1 to obtain T increases each balance value by at most one. Therefore,
    BTncati=BTn-1cati-1+1B(Tn-1)i-1+1B(T)ifori=2,,n-1,
    contradicting the assumption that BTncati<B(T)i for at least one i1,,n-1.

    Next, we show that B(T)i<BTncati for at least one i. Since TTncat, this follows directly from the fact that T has at least two cherries. Hence, B(T)2=0<1=BTncat2.

    Now, together with Lemma 3.4, this implies that the caterpillar (uniquely) maximizes ΦfB on BTn for any (strictly) increasing function f.

  2. Now, we show that the fb-tree of height h (uniquely) minimizes ΦfB on BTn with n=2h for any (strictly) increasing function f. Recall that we have B(T)=(0,,0) if and only if T=Thfb. In particular, for all i1,2,,n-1, we have
    BThfbi=0B(T)i,
    and if TThfb, then
    BThfbi=0<B(T)i
    for at least one i1,2,,n-1. It now follows from Lemma 3.4 that the fb-tree (uniquely) minimizes ΦfB in this case, which completes the proof.

By both parts of the proof, the BVM is a binary imbalance index for all strictly increasing functions. This completes the proof.

Hence, by Theorem 3.6, we have identified a family of binary imbalance indices, some of which are already known. As shown in Table 2 and Remark 3.5, this family includes the Colless index, the equivalent corrected Colless index, and the quadratic Colless index.

Next, we consider the minimizing trees of the BVM with affine functions f.

Corollary 3.7

Let f be a strictly increasing affine function, i.e., f(b)=m·b+a with m,aR and m>0. Then, for all n, the trees that minimize the balance value metaconcept ΦfB are the same as those that minimize the Colless index. In particular, both the gfb-tree and the mb-tree achieve this minimum.

Proof

The proof follows directly from the equivalence of the BVM to the Colless index, as stated in Remark 3.5, given that m>0.

We remark that all binary trees with n leaves minimizing the Colless index are completely characterized (Remark 2.5), implying that we also have a full characterization of the trees minimizing the BVM ΦfB for strictly increasing affine functions f.

Next, we prove that the mb-tree minimizes the BVM for all strictly increasing and convex functions f.

Theorem 3.8

Let f be a strictly increasing and convex function. Then, the mb-tree Tnmb minimizes the balance value metaconcept ΦfB on BTn for all n. Moreover, the mb-tree uniquely minimizes ΦfB on BTn if additionally f(1)-f(0)<f(2)-f(1), i.e., if f is additionally locally strictly convex.

To prove this theorem we need three more lemmas. Recalling that cn denotes the minimum value of the Colless index on BTn, we first show that cn2 for all n4 that are not powers of two.

Lemma 3.9

Let nN4 be such that n2h for all hN. Then, we have cn2.

Proof

Let n be as stated in the lemma. Let T be a tree minimizing the Colless index, i.e., C(T)=cn. In particular, this implies n5. Seeking a contradiction, assume cn1. Note that cn=0 would imply n=2h as only Thfb can obtain cn=0 (Coronado et al. (2020, Corollary 1)). Thus, we necessarily have cn=1. This, however, means that we have precisely one vertex u in T with balance value 1, so its children, say v and w, induce subtrees of sizes nv and nw with nw=nv+1. This implies that precisely one of the values nv and nw, say nv, is odd, and thus nu=nv+nw is odd, too. Now if nv>1, then v would be an inner vertex with bv1 (as Inline graphic), a contradiction to cn=1. So we must have nv=1. However, as bu=1, this implies nw=2. So nu=2+1=3. Thus, u cannot be the root of the tree as n5. So u must have a parent a of balance value 0 (as u is the only vertex with balance value 1). Thus, a has two children vertices, both of which induce a subtree of size 3 – and thus, as 3 is odd, a vertex of balance value 1. This contradiction completes the proof.

Note that we already know for (strictly) increasing f that Tnmb (uniquely) minimizes the BVM ΦfB on BTn if n=2h for some hN. This is due to the fact that in this case, Tnmb coincides with Thfb (see Theorem 3.6). The following two lemmas addresses the case in which n is not a power of two and show that under certain conditions, Tnmb is then still the (unique) minimizer of ΦfB.

Lemma 3.10

Let f be strictly increasing. Let nN such that n2h for all hN. If we have for all sequences b1,,bk with k=CTnmb, b1++bkk, and bi>1 for some i{1,,k} that k·f(1)f(b1)++f(bk), then Tnmb minimizes the balance value metaconcept ΦfB on BTn. Moreover, the minimizer is unique precisely if the inequality is strict.

Proof

Let f and n be as stated in the lemma. Note that this implies that the smallest value of n we need to consider is n=3 (as n=1=20 and n=2=21). In this case, however, there is only one possible binary tree, so there is nothing to show. Now, let nN4 be such that n2h for all hN. In particular, we can assume n5. Let k=CTnmb, which satisfies kC(T) for all TBTn by Proposition 2.4. We also know by Lemma 3.9 that k2. Furthermore, the balance value sequence of the mb-tree consists of k entries of 1 and n-1-k entries of 0, since all n-1 inner vertices are balanced. Consequently,

ΦfBTnmb=k·f(1)+(n-1-k)·f(0).

Now let TBTn\Tnmb and let b1,,bl1 be the l1 entries of B(T) that are positive. Then, we have:

ΦfB(T)=i=1lf(bi)+(n-1-l)·f(0).

This leads to

ΦfB(T)-ΦfBTnmb=i=1lf(bi)+(n-1-l)·f(0)-(k·f(1)+(n-1-k)·f(0))=i=1lf(bi)-(l-k)·f(0)-k·f(1) 1

Next, we distinguish two cases. Note that b1++bl=C(T)k and at least one bi>1 (otherwise all inner vertices of T would be balanced, contradicting TTnmb).

  • First, let lk. In this case, we have l-k0. Then,
    Eq.(1)fstr. incr.i=1lf(bi)-(l-k)·f(1)-k·f(1)=i=1lf(bi)-l·f(1)>0,
    where the last inequality follows from the fact that b1,,bl1 with at least one bi>1 and the fact that f is strictly increasing.
  • Second, let l<k. Then, define bi:=0 for i=l+1,,k. In this case, we can conclude that
    Eq.(1)=i=1kf(bi)-(k-l)·f(0)-(l-k)·f(0)-k·f(1)=i=1kf(bi)-k·f(1)0,
    where the last inequality follows from the properties of f assumed in this lemma and is strict precisely if the inequality assumed by the lemma is also strict.

Thus, in both cases, we have ΦfB(T)-ΦfBTnmb0 and thus, for strictly increasing f satisfying k·f(1)f(b1)++f(bk), the mb-tree is more balanced than T, and it is even strictly more balanced if the inequality is strict. This completes the proof.

Finally, we use the previous lemma to show that a certain family of functions f satisfies the inequality in Lemma 3.10 and thus yields the mb-tree as (unique) minimizer of the BVM ΦfB with these functions f, too.

Lemma 3.11

Let f be a non-negative and strictly increasing function. Let nN such that n2h for all hN. If we have b·f(1)f(b) for all bN2, then Tnmb minimizes the balance value metaconcept ΦfB on BTn, and it is even the unique minimizer if the inequality is strict.

Proof

If n3, there is only one binary tree and thus there is nothing to show. Now, let n4. Since we have n2h for all hN, we have n5. For this case, we show that the condition of Lemma 3.10 holds for f. Let k=CTnmb and B=(b1,,bk) be any sequence with b1,,bkN such that b1++bkk and bi>1 for at least one i{1,,k}. Note that by Lemma 3.9, we have k2. Our goal is to show that k·f(1)f(b1)++f(bk) (and strict inequality in case of unique minimization).

Let f be as stated in the lemma, in particular, assume b·f(1)f(b) for all bN2. Moreover, let b1,,bl2>1 be the l21 entries of B that are strictly greater than 1. Additionally, let l00 denote the number of entries of B that are equal to 0, and let l10 be the number of entries equal to 1. By assumption, b1++bkk, and therefore, summing the non-zero values of B, we must have

b1++bl2+l1k.

Thus, we can derive the following inequality:

i=1kf(bi)=i=1l2f(bi)+l0·f(0)0+l1·f(1)i=1l2f(bi)+l1·f(1)i=1l2bi·f(1)+l1·f(1)k·f(1).

Note that here, the second-to-last inequality is strict depending on the assumption of the lemma. Therefore, by Lemma 3.10, the mb-tree minimizes the BVM for all non-negative and strictly increasing functions f satisfying b·f(1)f(b) and is even the unique minimizer in case of strict inequality. This completes the proof.

Now, we are in a position to prove Theorem 3.8 using Lemma 3.11.

Proof of Theorem 3.8

In order to prove the theorem, first note that if n=2h for some hN, then Tnmb=Thfb and hence, by Theorem 3.6, the mb-tree is the unique minimizer of the BVM ΦfB for all strictly increasing functions f, and in particular for the function chosen in this theorem.

Hence, we can assume n2h for all hN and start by proving that if f is strictly increasing, convex and satisfies f(1)-f(0)<f(2)-f(1), then the mb-tree uniquely minimizes the BVM. For the proof, we want to apply Lemma 3.11. Therefore, we need to show that we can assume f to be non-negative. We even show that we can assume f(0)=0. We can obtain this assumption from the equivalence of the BVM with function f to the BVM with function f^(x):=f(x)-f(0). Note that f^ retains the properties of being strictly increasing, convex, and satisfying f^(1)-f^(0)<f^(2)-f^(1). This implies that f and f^ have the same extremal properties, which is why we can without loss of generality assume ff^ in the following; in particular, we may assume f(0)=0.

Using f(0)=0, we observe that

f(2)-f(1)>f(1)-f(0)=f(1).

This means that the increment from f(1) to f(2) is greater than f(1). By the convexity of f, it follows (recursively) that the increments from f(x-1) to f(x) for all x2 do not decrease, i.e.,

f(x)-f(x-1)f(2)-f(1)>f(1).

Thus, we obtain for all b2

f(b)=f(1)+i=2bf(i)-f(i-1)>f(1)>b·f(1).

Now, all requirements of Lemma 3.11 are satisfied and we can conclude that the mb-tree uniquely minimizes the BVM for such functions f.

For proving that the mb-tree (not necessarily uniquely) minimizes the BVM if f only satisfies f(1)-f(0)f(2)-f(1) (which is satisfied for convex functions), then the strict inequalities of the proof above may no longer be strict. However, applying Lemma 3.11 again, we conclude that the mb-tree still minimizes the BVM. This completes the proof.

Next, based on the results of this subsection, we determine the extremal values of the balance value metaconcept.

3.2.1.2 Extremal Values

Building on results from the last section, we now determine the minimum and maximum values of the BVM. We first state the maximum value for all n and the minimum value for n=2h if f is an increasing function.

Proposition 3.12

Let f be an increasing function. Then, the maximum value of the balance value metaconcept ΦfB on BTn is given by i=0n-2f(i). Furthermore, if n=2h, the minimum value of ΦfB on BTn is (n-1)·f(0).

Proof

By Theorem 3.6, the caterpillar attains the maximum value, while the fb-tree attains the minimum value. The formulas for the maximum and minimum values now directly follow from the facts that B(Tncat)=(0,,n-2) and BThfb=(0,,0) as well as |B(T)|=n-1 for all binary trees with n leaves.

Finally, we determine the minimum value of the BVM for all n when f is not only strictly increasing but also either affine or convex. Recall that cn (given in Proposition 2.8) denotes the minimum value of the Colless index on BTn.

Proposition 3.13

For any n1, let cn be the minimum value of the Colless index on BTn.

  1. If f is strictly increasing and convex, the minimum value of the balance value metaconcept ΦfB on BTn for all n is
    cn·f(1)+(n-1-cn)·f(0).
  2. If f is strictly increasing and affine, i.e., f(x)=m·x+a, the minimum value of ΦfB on BTn for all n is
    m·cn+(n-1)·a.
Proof
  1. Let f be a strictly increasing and convex function. From Proposition 2.4 and Theorem 3.8, we know that the mb-tree minimizes both the Colless index and the BVM. In particular, CTnmb=cn.

    By definition, the balance value sequence of the mb-tree consists only of ones and zeros, it follows that there are exactly cn ones and n-1-cn zeros. Consequently, the minimum value of the BVM is given by
    ΦfBTnmb=cn·f(1)+(n-1-cn)·f(0).
  2. Now, let f be a strictly increasing and affine function. The correctness of the formula for the minimum value of ΦfB follows directly from the equivalence between the BVM and the Colless index, as stated in Remark 3.5.

This completes the proof.

Next, we analyze the metaconcepts generalizing the Sackin index. We begin with the clade size metaconcept.

Clade Size Metaconcept ΦfN

In the following, we prove that the clade size metaconcept (CSM) is an imbalance index on BTn (or Tn) for all strictly increasing (and 2-positive) functions f that are either affine (i.e., f(x)=m·x+a with m>0 (and a0)) or strictly convex. Recall that by Proposition 2.10, the CSM is an imbalance index on BTn if f is strictly increasing and strictly concave. Here, we will show that the CSM is an imbalance index on Tn if f is additionally 2-positive. In the next step, we calculate the minimum and maximum values of the CSM.

We first highlight the relationship between the CSM ΦfN and several established tree imbalance indices related to clade sizes, such as the Sackin index, the average leaf depth, the s^-shape statistic, and the total cophenetic index. We focus on the specific properties of the functions f that induce each of these indices.

Remark 3.14

Let f be an affine function, i.e., f(nv)=m·nv+a, and let TTn. Then, we have

ΦfN(T)=nvN(T)f(nv)
=nvN(T)(m·nv+a)=m·nvN(T)nv+|N(T)|·a=m·S(T)+|N(T)|·a.

It follows immediately that if m>0, the CSM is equivalent to the Sackin index on BTn, since in this case, |N(T)|=n-1 for each tree TBTn and therefore, |N(T)|·a is a constant. Additionally, the CSM is equivalent to the Sackin index on Tn if m>0 and a=0.

However, if m>0 and a>0, the CSM is not equivalent to the Sackin index on Tn. For example, consider the trees T1 and T2 shown in Figure 6. The Sackin indices are S(T1)=23 and S(T2)=26, meaning S(T1)<S(T2). However, for the function f(nv)=nv+2, we find that ΦfN(T1)=35 and ΦfN(T2)=34, so ΦfN(T1)>ΦfN(T2). This confirms that the CSM and the Sackin index are not equivalent on Tn.

Fig. 6.

Fig. 6

T1 and T2 are ranked differently by the Sackin index and the CSM with f(nv)=nv+2. The inner vertices are labeled with their clade size nv followed by f(nv)=nv+2 in brackets

By Remark 3.2, we know that the average leaf depth is equivalent to the Sackin index. Additionally, the total cophenetic index is induced by the CSM, because on Tn it can also be expressed as

Φ(T)=vV(T)˚\{ρ}nv2=vV˚(T)nv2-n2=nvN(T)nv2-n2|V˚(T)|=ΦΦN(T),

where fΦnv,n,|V˚(T)|=nv2-n2|V˚(T)|. Hence, on Tn the total cophenetic index is induced by the third-order CSM, whereas on BTn it is induced by the second-order CSM, because then V˚(T)=n-1.

Thus, by Remark 3.2, on BTn the total cophenetic index is equivalent to the first-order CSM with the function fΦ~(nv)=nv2. Note that Knüver et al. (2024) introduced a function Φ to measure network balance. When restricted to trees, this function coincides with ΦΦ~N (for further details, see Knüver et al. (2024), page 95.

Finally, note that the functions f that induce the Sackin index (i.e., the identity function), the s^-shape statistic (i.e., fs^(nv)=log(nv-1)), and fΦ~ are all strictly increasing. Moreover, the function for the Sackin index is affine, while the function for the s^-shape statistic is strictly concave but not 2-positive, since fs^(2)=0. In contrast, fΦ~ is strictly convex.

With this in mind, we now turn our attention to the extremal trees of the CSM.

3.2.2.1 Extremal Trees

We begin our analysis of the extremal trees of the CSM by determining its maximum on BTn and Tn. By Proposition 2.11, we already know that the caterpillar uniquely attains the maximum for all strictly increasing functions f on BTn. However, we now extend this result.

Proposition 3.15

The caterpillar Tncat (uniquely) maximizes the clade size metaconcept ΦfN on BTn, provided that f is a (strictly) increasing function. Moreover, the caterpillar (uniquely) maximizes the clade size metaconcept on Tn, if f is (strictly) increasing and 2-positive, i.e., f(x)>0 for x2.

Proof

For the proof, we first show that for all trees TTn, the inequality

Nd(T)iNdTncatifor alli{1,2,,|V˚(T)|}

holds, where Nd denotes the clade size sequences in descending order. The main results then follow directly from this statement.

We begin by considering the case where T is not a binary tree, i.e., TTn\BTn. Our goal is to transform T into a binary tree while ensuring that the clade sizes of its vertices do not decrease. Since T is not binary, it must contain an inner vertex with at least three children. Let v be such a vertex, and denote its children by v1,,vk with k3.

We construct a new tree T from T as follows: Delete the edges (v,vi) for all 2ik, introduce a new vertex w, and add the edges (vw) and (w,vi) for all 2ik. For an illustration, see Figure 7. Note that repeating this process eventually yields a binary tree.

Fig. 7.

Fig. 7

Trees T and T as described in the first part of the proof of Proposition 3.15

Next, we compare the clade size sequences of T and T. Since all original vertices of T remain in T with their clade sizes unchanged, the only difference is the introduction of the new vertex w, which contributes an additional value to the clade size sequence of T. In particular, the transformation does not decrease any clade sizes.

Thus, it suffices to establish the claim for binary trees. However, this follows directly from Proposition 2.11 and Lemma 3.4. Hence, we have Nd(T)iNdTncati for all i{1,2,,|V˚(T)|} and for all TTn. In addition, by the same two results, we get N(T)i<NTncati for at least one i{1,2,,n-1} and for all TBTn\{Tncat}. Thus, again by Lemma 3.4, the caterpillar (uniquely) maximizes ΦfN on BTn if f is (strictly) increasing.

To complete the proof, we now show that for any TTn\BTn with TTncat, we have ΦfN(Tncat)>ΦfN(T) whenever f is increasing and 2-positive. Let m=|V˚(T)|<n-1. Then, we compute

ΦfNTncat=i=1mfNdTncati+i=m+1n-1fNdTncati>0,sincef 2-pos.>i=1mfNdTncatifincr.i=1mfNdTi=ΦfNT.

Thus, the caterpillar (uniquely) maximizes ΦfN on Tn whenever f is (strictly) increasing and 2-positive, thereby completing the proof.

Next, we use this result to conclude that the CSM is an imbalance index for a certain family of functions f.

Corollary 3.16

Let f be a 2-positive function, i.e., f(x)>0 if x2, that is also strictly increasing and strictly concave. Under these conditions, the clade size metaconcept ΦfN is an imbalance index.

Proof

The proof is a direct consequence of Proposition 2.10 and Proposition 3.15.

By Proposition 2.10, we already know that the gfb-tree uniquely minimizes the CSM on BTn if f is strictly increasing and strictly concave. Next, we will show that the mb-tree uniquely minimizes the CSM on BTn if f is strictly increasing and strictly convex.

Theorem 3.17

Let f be a strictly increasing and strictly convex function. Then, Tnmb uniquely minimizes the clade size metaconcept ΦfN on BTn.

Before we can prove this statement, we need two helpful lemmas. In proving these lemmas, we will rely on the locality property of the CSM, which states that if two trees differ only in a rooted subtree, then the difference in their CSM values is entirely determined by these subtrees. To maintain the flow of the manuscript, we postpone the formal statement and proof of this property to Proposition 3.34. Furthermore, we note that the proofs of these lemmas and the main theorem proceed analogously to the proofs presented by Mir et al. (2013) for the total cophenetic index. We include them here for completeness.

Lemma 3.18

Let TBTn4 and suppose that T contains a subtree Tz rooted at an inner vertex z with nT(z)4. Suppose that a and b are the children of z and suppose that they are both inner vertices, inducing subtrees Ta=(T1,T2) and Tb=(T3,T4). Moreover, let ni denote the number of leaves of Ti with i{1,,4} and assume n1n2, n3n4, and n1>n3. If T minimizes the clade size metaconcept ΦfN for strictly convex f, then n4n2.

Proof adapted from Mir et al. (2013), Lemma 11

Let T minimize ΦfN, and assume that f is strictly convex. For the sake of contradiction, suppose that n2>n4. Construct Tz from Tz by swapping the positions of T2 and T4, so that in Tz, vertex a has pending subtrees T1 and T4, while vertex b has pending subtrees T3 and T2. Let T be the tree obtained from T by replacing Tz in T with Tz.

By assumption, we have n1+n2>n1+n4 and n3+n2>n3+n4. Defining λ:=n2-n4>0, we can write n1+n4=n1+n2-λ and n3+n2=n3+n4+λ. Then,

ΦfN(T)-ΦfN(T)=Prop.3.34ΦfN(Tz)-ΦfN(Tz)=f(n1+n2)+f(n3+n4)-f(n1+n4)+f(n3+n2)=f(n1+n2)-f(n1+n2-λ)-(f(n3+n4+λ)-f(n3+n4))=i=1λf(n1+n2+1-i)-f(n1+n2-i)-i=1λf(n3+n4+λ+1-i)-f(n3+n4+λ-i)>fstr. convex0.

The last inequality holds because the assumption n1>n3 yields n1+n2>n3+n2=n3+n4+λ. Together with f being strictly convex, the i-th increment in the first sum is strictly greater than the i-th increment in the second sum. This contradicts the minimality of T and thus completes the proof.

Lemma 3.19

Let TBTn3 and suppose that T contains a subtree Tz rooted at an inner vertex z with nT(z)3 and such that the children of z consist of an inner vertex a and a leaf x of T. Further, let Ta=(T1,T2) with n1n2. If T minimizes the clade size metaconcept ΦfN for a strictly increasing function f, then n1=n2=1.

Proof adapted from Mir et al. (2013), Lemma 12

Let T minimize ΦfN, and assume that f is strictly increasing. For the sake of contradiction, suppose that n1>1. Let Tz be the tree obtained from Tz by switching the positions of the leaf x and the subtree T1, meaning that the pending subtrees of a in Tz are T2 and the leaf x. Moreover, let T be the tree obtained from T by replacing the subtree Tz with Tz. Then,

ΦfN(T)-ΦfN(T)=Prop.3.34ΦfN(Tz)-ΦfN(Tz)=f(n1+n2)-f(n2+1)>fstr. increasing0.

This contradicts the minimization of T and completes the proof.

Now, we are in a position to prove Theorem 3.17.

Proof of Theorem 3.17, adapted from Mir et al. (2013), Theorem 13

First, note that for n=1,2,3, there is only one binary tree, and thus there is nothing to show. Now, assume TBTn\{Tnmb} minimizes ΦfN. Let T have an inner vertex z that is not balanced, but whose children are balanced inner vertices or leaves. Let a and b be the children of z with nanb+2. In particular, a is an inner vertex. If b is a leaf, then by Lemma 3.19, we have na=2. This contradicts the assumption that z is not balanced. Hence, both a and b must be inner vertices. Moreover, due to the choice of z, a and b are balanced. Now, we can express the structure of Tz as in Lemma 3.18, i.e., Ta=(T1,T2) and Tb=(T3,T4) with n1n2 and n3n4. Exploiting the balance of a and b, we deduce that n2{n1-1,n1} and n4{n3-1,n3}. Further, since n1+n2=nanb+2=n3+n4+2, we can conclude that 2n12n3+1, and thus n1>n3. Therefore, it follows that n1>n3n4 and, by Lemma 3.18, we have n4n2. Hence, we know that n2=n1-1, and thus n2=n3=n4. Finally, for the balance value of z, we compute bT(z)=(n1+n2)-(n3+n4)=2n2+1-2n2=1. This contradicts the assumption that z is not balanced and completes the proof. Hence, Tnmb uniquely minimizes the CSM ΦfN on BTn if f is strictly increasing and strictly convex.

Using Theorem 3.17, we can identify another family of functions f that induces (binary) imbalance indices. Note that, by Remark 3.14, the total cophenetic index is equivalent to one of these functions.

Corollary 3.20

Let f be strictly increasing and strictly convex. Then, the clade size metaconcept ΦfN is an imbalance index on BTn. Moreover, if f is also 2-positive, i.e., f(x)>0 if x2, then the clade size metaconcept ΦfN is an imbalance index on Tn.

Proof

The proof is a direct consequence of Proposition 3.15 and Theorem 3.17.

In Remark 3.14, we observed that the CSM with an affine function f(nv)=m·nv+a is equivalent to the Sackin index on BTn if m>0, and on Tn if m>0 and a=0. In both cases, it follows directly that the CSM is a (binary) imbalance index. Further, we show in the following that the CSM is an imbalance index on Tn if m>0 and a0 (rather than only if a=0).

Proposition 3.21

Let f be an affine function, i.e., f(nv)=m·nv+a with m,aR. Then, the clade size metaconcept ΦfN is an imbalance index on Tn if m>0 and a0. Moreover, the clade size metaconcept ΦfN is an imbalance index on BTn if m>0.

Furthermore, for all n, the minimizing trees on BTn of the clade size metaconcept, if m>0, coincide with those of the Sackin index. Specifically, these are either Tnfb or trees that employ precisely two leaf depths, namely hn-1 and hn, where Inline graphic. In particular, both the gfb-tree and the mb-tree minimize the clade size metaconcept for all n.

Proof

The part regarding BTn and the minimizing trees on BTn for n2hn follows directly from Remark 3.14, Lemma 2.6, and Remark 2.7, since in this case the CSM is equivalent to the Sackin index.

To prove the rest of this proposition, we must show that the caterpillar is the unique tree maximizing the CSM on Tn, and the fully balanced tree is the unique tree minimizing the CSM on BTn if n=2hn. The latter again follows from the equivalence to the Sackin index. Finally, we show that the caterpillar is the unique tree maximizing the CSM on Tn. In Remark 3.14, we saw that the CSM is an affine function of the Sackin index if f(nv)=m·nv+a with m,aR for all TTn, i.e.,

ΦfN(T)=m·S(T)+|N(T)|·a.

Note that |N(T)|=|V˚(T)|. Now, assume m>0 and a0. We can exploit the fact that the Sackin index is an imbalance index. Let TTn be an arbitrary tree with TTncat. Then,

ΦfNTncat=m·STncat+NTncat=n-1·a>m·S(T)+N(T)n-1·a=ΦfN(T).

Here, the strict inequality follows from the fact that the Sackin index is an imbalance index, and that m>0 and a0. Thus, the caterpillar maximizes the CSM on Tn if m>0 and a0. This completes the proof.

So far, we have only considered the minimization of the CSM on BTn. In the next proposition, we extend our analysis to arbitrary trees.

Proposition 3.22

Let Tnstar be the star tree on n leaves, and let f be a 2-positive, i.e., f(x)>0 if x2, (though not necessarily increasing) function. Then, the star tree is the unique tree that minimizes the clade size metaconcept ΦfN on Tn.

Proof

Let Tnstar be the star tree on n leaves, and let f be a 2-positive function. For n2, there is only one tree, so there is nothing to show. Now, let n3. Consider a tree TTn\Tnstar distinct from the star tree. Then, |V˚(T)|2. Moreover, every tree has at least one inner vertex with clade size n, namely its root. Thus,

ΦfNTnstar=f(n)<f2-pos.f(n)+i=1|V˚(T)|-1f(N(T)i)=ΦfN(T).

This completes the proof.

Having identified the trees that minimize and maximize the CSM, we can now calculate its minimum and maximum values.

3.2.2.2 Extremal Values

We begin by considering the maximum value of the CSM if f is increasing (and 2-positive).

Proposition 3.23

The maximum value of the clade size metaconcept ΦfN on BTn (on Tn) is i=2nf(i), if f is an increasing (and 2-positive, i.e., f(x)>0 if x2) function.

Proof

By Proposition 3.15, it suffices to show that ΦfNTncat=i=2nf(i). However, this follows directly from the fact that NTncat=(2,,n). This completes the proof.

Next, we consider the minimum values of the CSM. Note that Cleary et al. (2025) have stated a minimum value for their function πc, which is highly related to the CSM on BTn with function f(nv)=log(nv+c) with c>-2. Taking the logarithm of the minimum value of πc (Cleary et al. (2025) Corollary 4.13) yields the minimum value for the CSM on BTn for those functions f. Next, we extend this result to a broader range of functions f.

Proposition 3.24

Let Inline graphic for all iN. Then, the minimum value of the clade size metaconcept ΦfN on BTn for any strictly increasing and strictly concave function f is i=2ngfbn(i)·f(i), where gfbn(i) is as specified in Theorem 2.12.

Proof

First, recall that gfbn(i) is the number of subtrees of Tngfb of size i. Now, for strictly increasing and strictly concave f, the statement is a direct consequence of Proposition 2.10.

Next, we consider the minimum value of the CSM for strictly increasing and strictly convex f.

Proposition 3.25

The minimum value of the clade size metaconcept ΦfN for any strictly increasing and strictly convex function f is i=2nmbn(i)·f(i) with mbn(i) as given in Lemma 3.26.

Before we can prove this statement, we give the number of subtrees of a given size for the mb-tree. The proof of the following lemma can be found in Appendix B. Note that the proof of Proposition 3.25 is merely a direct consequence of Theorem 3.17 and the following Lemma 3.26.

Lemma 3.26

Let n,iN such that in. Let Inline graphic and Inline graphic. For all l{0,,hn}, let rln=n-2l·n2l. Moreover, let mbn(i) denote the number of pending subtrees with i leaves of Tnmb. Then, we have:

Inline graphic

In the next proposition, we consider the maximum and minimum value of the CSM if f is a strictly increasing affine function (with non-negative intercept).

Proposition 3.27

Let f be an affine function, i.e., f(nv)=m·nv+a with m,aR. Then, we have:

  1. The maximum value of the clade size metaconcept ΦfN is
    m·n·(n+1)2-1+(n-1)·a
    1. on Tn if m>0 and a0 and
    2. on BTn if m>0.
  2. Let Inline graphic. Then, the minimum value on BTn is m·-2hn+n·(h+1)+(n-1)·a if m>0, which equals m·hn·2hn+(n-1)·a if n=2hn.

Proof

First, consider the maximum value on Tn for m>0 and a0. By Proposition 3.21, we only need to show that the caterpillar attains the stated maximum value.

By Remark 3.14, we know that for all TTn, the CSM satisfies ΦfN(T)=m·S(T)+|N(T)|·a. Together with Lemma 2.9, we then have

ΦfNTncat=m·STncat+|NTncat|·a=m·n·(n+1)2-1+(n-1)·a.

This completes 1 (a).

The remainder of the proof follows directly from the equivalence of the CSM and the Sackin index, as stated in Remark 3.14, along with either Lemma 2.9 for the maximum value or Lemma 2.6 for the minimum value, thereby completing the proof.

Now, we state the minimum value on Tn for 2-positive f.

Lemma 3.28

Let f be a 2-positive function, i.e., f(x)>0 if x2. Then, if n=1, the minimum value of the clade size metaconcept ΦfN on Tn is 0. Otherwise, if n2, the minimum value of the clade size metaconcept ΦfN on Tn is f(n).

Proof

Let f be a 2-positive function. If n=1, then there exists only one tree with no inner vertex. Hence, the CSM is the empty sum, which is 0. Now, let n2. By Proposition 3.22, it suffices to show that ΦfNTnstar=f(n). This holds because the only inner vertex of Tnstar is the root, which has a clade size of n. This completes the proof.

Finally, we analyze the last metaconcept, the leaf depth metaconcept. As shown in Table 1, it also is a generalization of the Sackin index.

Leaf Depth Metaconcept ΦfΔ

In the following, we will show that the leaf depth metaconcept (LDM) is a (binary) imbalance index for strictly increasing affine functions, as well as for strictly increasing and convex functions. In a second step, we will calculate the minimum and maximum values of the LDM.

Furthermore, we have already observed in Table 1 that the Sackin index and the average leaf depth are induced by the first or second-order LDM, respectively. Now, we show that the LDM is an affine function of the Sackin index for all affine functions f (i.e., f(x)=m·x+a). Additionally, it is equivalent to the Sackin index on Tn if f is strictly increasing and affine, i.e., when m>0.

Remark 3.29

Let f be an affine function, i.e., f(δ)=m·δ+a, and let TTn. Then, we have

ΦfΔ(T)=δΔ(T)f(δ)=δΔ(T)(m·δ+a)=m·δΔ(T)δ+n·a=m·S(T)+n·a

This establishes the equivalence of the LDM with strictly increasing and affine f, i.e., m>0, to the Sackin index on Tn.

With this in mind, we now focus on the extremal trees for the leaf depth metaconcept (LDM).

3.2.3.1 Extremal Trees

In this section, we analyze the maximizing and minimizing trees of the LDM. Through this analysis, we identify two families of functions for which the LDM is a (binary) imbalance index: strictly increasing and convex functions, as well as strictly increasing affine functions.

Proposition 3.30

Let f be a strictly increasing and convex function. Then, the leaf depth metaconcept ΦfΔ is a (binary) imbalance index. Further, for all n, the minimizing trees on BTn of the leaf depth metaconcept coincide with those minimizing the Sackin index. These trees are either Tnfb or those that employ precisely two leaf depths, namely hn-1 and hn, where Inline graphic. In particular, the gfb-tree and the mb-tree minimize the leaf depth metaconcept for all n.

Proof

Let f be strictly increasing and convex. To prove that the LDM is a (binary) imbalance index, we need to show that the caterpillar uniquely maximizes the metaconcept on Tn, and the fb-tree uniquely minimizes it on BTn for n=2hn. For n2, there is nothing to show. Now, let n3.

First, we proceed as we did in the first part of the proof of Proposition 3.15. Specifically, we can turn a tree TTn\BTn step by step into a binary tree. For the exact procedure and notation, refer to Figure 7. By comparing the leaf depths of T and T, we observe the following: all leaves that are descendants of v1 have the same depth in both T and T. For all leaves x that are descendants of vi with i2, it holds that δT(x)+1=δT(x). Based on these two observations and the fact that f is strictly increasing, we conclude that ΦfΔ(T)<ΦfΔ(T). Thus, it remains to show that the caterpillar is the unique maximizer of the LDM on BTn. Therefore, we consider a second procedure for constructing a tree from another tree, which can be viewed as the relocation of a cherry. In this case, both trees involved are binary.

Let TBTn be a binary tree with an inner vertex u such that u is the parent of the cherry formed by the two leaves x and y, and there exists a third leaf z with δ(u)<δ(z). Let T be the tree obtained from T by making z the new parent of the cherry formed by x and y. For an illustration, see Figure 8. As a result, z becomes an inner vertex in T, and u becomes a leaf in T. Note that u and z, respectively, have the same depth in T and T. Further, we have

δ(u)=δT(x)-1andδT(x)+1δT(x)=δ(z)+1. 2

Note that the caterpillar can be constructed in this manner from any other binary tree.

Fig. 8.

Fig. 8

An example illustrating the two ways of relocating a cherry, as used in the proof of Proposition 3.30

Now, we show that ΦfΔ(T)<ΦfΔ(T). By construction, we only need to consider the vertices xyz, and u, as all other leaves remain unchanged in depth between T and T. Hence, exploiting (2), we have

ΦfΔ(T)-ΦfΔ(T)=f(δT(x))-f(δT(u))+f(δT(y))-f(δT(x))+f(δT(z))-f(δT(y))=f(δT(x))-f(δT(x)-1)+f(δT(x))-f(δT(x))+f(δ(z))-f(δT(x))fstr. incr.f(δT(x))-f(δT(x)-1)+f(δT(x))-f(δT(x)+1)0,f convex+f(δ(z))-f(δ(z)+1)<fstr. incr.0.

Thus, ΦfΔ(T)<ΦfΔ(T). Since the caterpillar can be obtained from any other binary tree through repeated applications of this transformation, this completes the proof of this part.

Next, we show that the fb-tree is the unique minimizer of ΦfΔ on BTn for n=2hn. To do so, we consider the reverse operation of the cherry relocation described earlier. Let TBTn be a binary tree with an inner vertex z such that z is the parent of the cherry formed by the two leaves x and y, and suppose there exists a third leaf u with δT(z)>δT(u). Now, let T be the tree obtained from T by making u the new parent of the cherry formed by x and y. For an example, see Figure 8.

In this transformation, u becomes an inner vertex in T, while z is converted into a leaf. As in the previous case, only the vertices xyz, and u are affected, and the relationships given by (2) still hold. Consequently, applying the same calculation as before, we obtain ΦfΔ(T)<ΦfΔ(T). Since the fb-tree can be constructed step by step using this cherry relocation procedure, this establishes the property that it uniquely minimizes ΦfΔ, thus completing the second part of the proof.

It remains to show that if n2hn, the minimizing trees on BTn of the LDM also coincide with those of the Sackin index. By Lemma 2.6, the Sackin minimizing trees are precisely the trees that employ exactly two leaf depths. Now, observe that all such trees share the same leaf depth sequence, meaning they are assigned the same value by the LDM. Furthermore, as demonstrated earlier, these trees can be systematically constructed using the second cherry relocation operation introduced in this proof. Taken together, these observations complete the proof.

Now, we analyze the behavior of the LDM when f is a strictly increasing affine function.

Proposition 3.31

Let f be an affine function, i.e., f(δ)=m·δ+a with m,aR. If m>0, then the leaf depth metaconcept ΦfΔ is a (binary) imbalance index. Further, for all n, the minimizing trees on BTn of the leaf depth metaconcept if m>0, coincide with those that minimize the Sackin index. Specifically, these are either Tnfb or trees that employ precisely two leaf depths, namely hn-1 and hn, where Inline graphic. In particular, both the gfb-tree and the mb-tree minimize the leaf depth metaconcept for all n.

Proof

The fact that ΦfΔ is a (binary) imbalance index follows directly from its equivalence to the Sackin index, as stated in Remark 3.29. For the minimizing trees, in addition to the equivalence to the Sackin index, we use Lemma 2.6 and Remark 2.7 to establish the claim.

In the next Proposition, we establish that the star tree (uniquely) minimizes the LDM on Tn if f is (strictly) increasing.

Proposition 3.32

Let Tnstar be the star tree on n leaves, and let f be a (strictly) increasing function. Then, Tnstar is the (unique) tree minimizing the leaf depth metaconcept ΦfΔ on Tn.

Proof

Let Tnstar be the star tree on n leaves, and let f be a (strictly) increasing function. For n2, there is only one tree, so there is nothing to show. Now, let n3, and let TTn\Tnstar be another tree on n leaves. Since T must have at least one leaf of depth strictly greater than 1, we have

ΔTnstari=1Δ(T)ifor alli=1,,n

and

ΔTnstari=1<Δ(T)ifor at least onei=1,,n.

Since f is increasing, it follows that

ΦfΔTnstar=n·f(1)i=1nfΔ(T)i=ΦfΔ(T),

where the inequality is strict if f is strictly increasing. This completes the proof.

Having proven these three results, we are now in a position to calculate the corresponding maximum and minimum values of the LDM.

3.2.3.2 Extremal Values

Proposition 3.33
  1. Let f be an affine function, i.e., f(δ)=m·δ+a with m,aR and m>0. The maximum value of the leaf depth metaconcept ΦfΔ on both Tn and BTn is
    m·n·(n+1)2-1+n·a.
    Moreover, let Inline graphic. Then, the minimum value on BTn is
    m·-2hn+n·(hn+1)+n·a
    if m>0, which simplifies to m·hn·2hn+n·a if n=2hn.
  2. Let n2, and let f be a strictly increasing and convex function. The maximum value of the leaf depth metaconcept on both Tn and BTn is
    f(n-1)+i=1n-1f(i).
    Moreover, let n=2hn-1+p such that 1p2hn-1. Then, the minimum value of the leaf depth metaconcept on BTn is
    2hn-1-p·f(hn-1)+2·p·f(hn).
  3. Let f be any increasing function. Then, the minimum value of the leaf depth metaconcept on Tn is n·f(1).

Proof
  1. Let f be an affine function, i.e., f(δ)=m·δ+a with m,aR and m>0. The correctness of the maximum value on both Tn and BTn, as well as of the minimum value on BTn, follows directly from the equivalence to the Sackin index, as stated in Remark 3.29, and Lemma 2.9 for the maximum value, respectively Lemma 2.6 for the minimum value.

  2. Let f be strictly increasing and convex, and let n2.

    First, for the maximum value, by Proposition 3.30, we need to prove that ΦfΔ(Tncat)=f(n-1)+i=1n-1f(i). Note that the two leaves in the unique cherry in Tncat have depth n-1, and for every smaller depth, there is exactly one leaf of this depth. Hence, the stated maximum value is correct.

    Second, for the minimum value, again by Proposition 3.30, we must show that the gfb-tree attains the stated minimum value. Let n=2hn-1+p, where 1p2hn-1. By Remark 2.7, we know that the gfb-tree can be constructed from the fb-tree of height hn-1 by attaching p cherries from left to right to its leaves of depth hn-1. The fb-tree of height hn-1 has 2hn-1 leaves of depth hn-1 and 0 cherries of depth hn. For each attached cherry, one leaf of depth hn-1 is replaced by two leaves of depth hn. Therefore, after attaching p cherries, there are 2hn-1-p leaves of depth hn-1 and 2p leaves of depth hn. Thus, the minimum value is (2hn-1-p)·f(hn-1)+2·p·f(hn). This completes this part of the proof.

  3. Let f be any increasing function. By Proposition 3.32, we need to show that ΦfΔTnstar=n·f(1). This holds true because all n leaves of the star tree have depth 1.

This completes the proof.

So far, we have thoroughly analyzed the three classes of metaconcepts, focusing on the trees that minimize and maximize them, as well as their minimum and maximum values. In the next section, we shift our attention to two additional properties a metaconcept can have: locality and recursiveness.

Locality and Recursiveness

In this section, we analyze the locality and prove the recursiveness of the metaconcepts, focusing again on the first-order metaconcepts. Unlike previous results, the conclusions of this section can generally not be extended to higher-order metaconcepts, not even to those that are equivalent to a first-order metaconcept. For example, Fischer et al. (2023), Proposition 12.2 and Proposition 13 proved that the Colless index is local, but the equivalent corrected Colless index is not. Similarly, the Sackin index is local, whereas the equivalent average leaf depth is not (Fischer et al. (2023), Proposition 5.4 and Proposition 6.3). Further, the recursions for a first-order metaconcept do not account for additional values and, thus, do not apply to higher-order metaconcepts. However, it is worth noting that the corrected Colless index and the average leaf depth are recursive (Fischer et al. (2023), Proposition 13.2 and Proposition 6.2).

We begin with the locality. Based on the induced imbalance indices, one might conjecture that the BVM, the CSM, and the LDM are local, since all known imbalance indices induced by the corresponding first-order metaconcepts are local. These include the Colless index (Fischer et al. (2023), Proposition 12.2), the quadratic Colless index (Fischer et al. (2023), Proposition 15.3), the Sackin index (Fischer et al. (2023), Proposition 5.4), and the s^-shape statistic (Fischer et al. (2023), Proposition 9.3). In the next proposition, we will show that this conjecture is true for all functions in the case of the BVM and CSM, and for affine functions in the case of the LDM.

Proposition 3.34
  1. The balance value metaconcept ΦfB and the clade size metaconcept ΦfN are local for all (not necessarily increasing) functions f.

  2. The leaf depth metaconcept ΦfΔ is local if and only if f is affine, i.e., f(δ)=m·δ+a with m,aR.

Before proving this statement, we need a lemma that provides an equivalent condition for a function to be affine.

Lemma 3.35

Let f:RR be a function. Then, f is affine, i.e., f(x)=m·x+a with m,aR, if and only if f(x+z)-f(y+z)=f(x)-f(y) for all x,y,zR.

The proof of this lemma can be found in Appendix B.

Now, we are in a position to prove the proposition above.

Proof of Proposition 3.34
  1. First, we consider the BVM and the CSM. The proof follows the same reasoning as Fischer et al. (2023) (proof of Proposition 12.2 and Proposition 5.4), where the locality of the Colless index and the Sackin index was established. The only difference is that, in our case, the summands are not merely the balance values or clade sizes but rather their evaluations under the function f. However, this modification does not affect the overall argument, thereby proving that both the BVM and the CSM are local.

  2. Now, we establish that the LDM is local if and only if f is an affine function, i.e., f(δ)=m·δ+a with m,aR.

    Consider a tree TTn obtained from a tree TTn by replacing a subtree Tv of T with another subtree Tv, where both subtrees have the same number of leaves and are rooted in v with depth δT(v)=δT(v). Observe that the leaf sets outside these subtrees remain unchanged, i.e., VL(T)\VL(Tv)=VL(T)\VL(Tv), and for all xVL(T)\VL(Tv), we have δT(x)=δT(x), since modifying Tv does not change the distance from the root to those leaves. Additionally, for leaves within these subtrees, we have δT(x)=δT(v)+δTv(x) for xVL(Tv) and δT(x)=δT(v)+δTv(x) for xVL(Tv). Now, we compute the difference
    ΦfΔ(T)-ΦfΔ(T)=xVL(Tv)f(δT(x))+xVL(T)\VL(Tv)f(δT(x))-xVL(Tv)f(δT(x))-xVL(T)\VL(Tv)f(δT(x))=xVL(Tv)f(δT(v)+δTv(x))+xVL(T)\VL(Tv)f(δT(x))-xVL(Tv)f(δT(v)=δT(v)+δTv(x))-xVL(T)\VL(Tv)f(δT(x))=Lem.3.35xVL(Tv)f(δTv(x))-xVL(Tv)f(δTv(x))=ΦfΔ(Tv)-ΦfΔ(Tv)
    Thus, the LDM is local if and only if f is affine, which completes the proof.

A direct consequence of the locality of the metaconcepts is that every subtree of a tree minimizing (respectively, maximizing) a metaconcept, is itself a minimizing (respectively, maximizing) tree for the metaconcept.

Next, we establish the recursiveness of our metaconcepts.

Proposition 3.36

The balance value metaconcept ΦfB is a binary recursive tree shape statistic for all functions f. Similarly, the clade size metaconcept ΦfN is a recursive tree shape statistic for all functions f. In contrast, the leaf depth metaconcept ΦfΔ is a recursive tree shape statistic if f is an affine function with an intercept of a=0.

Let TbBTn be a binary tree with standard decomposition Tb=T1b,T2b such that the maximal pending subtree Tib has nib leaves. Similarly, let TTn be an arbitrary tree with standard decomposition T=(T1,,Tk), where each maximal pending subtree Ti has ni leaves.

Let f be an arbitrary function, and let fm be an affine function with slope mR and intercept a=0. If n=1, we have ΦfBTb=0, ΦfN(T)=0, ΦmΔ(T)=fm(0)=0. Moreover, for n2, we have:

  • ΦfBTb=ΦfBT1b+ΦfBT2b+fn1b-n2b,

  • ΦfN(T)=i=1kΦfN(Ti)+f(n1++nk),

  • ΦmΔ(T)=i=1kΦmΔ(Ti)+(n1++nk)·m.

Proof

Let T=(T1,,Tk), Tb=T1b,T2b, ni, nib, f, and fm as described.

For n2, we can calculate the BVM as follows

ΦfBTb=bBTbf(b)=Rem.3.1bBT1bf(b)+bBT2bf(b)+fn1b-n2b=ΦfBT1b+ΦfBT2b+fn1b-n2b.

Thus, it is a binary recursive tree shape statistic of length x=2, where the recursions r1 and r2 are:

  • BVM: λ1=ΦfBT1cat=0 and
    r1r1T1b,r2T1b,r1T2b,r2T2b=ΦfBT1b+ΦfBT2b+fn1b-n2b
  • leaf number: λ2=1 and r2r1T1b,r2T1b,r1T2b,r2T2b=n1b+n2b

Hence, we have λR2 and ri:R2×R2R. Moreover, all recursions ri are symmetric. This completes the proof for the BVM.

For n2, we can calculate the CSM as follows

ΦfN(T)=nvN(T)f(nv)=Rem.3.1nvN(T1)f(nv)++nvN(Tk)f(nv)+f(n)=i=1kΦfN(Ti)+f(n1++nk).

Thus, it is a recursive tree shape statistic of length x=2, where the recursions r1 and r2 are:

  • CSM: λ1=ΦfNT1cat=0 and
    r1(r1T1,r2T1,,(r1Tk,r2Tk))=ΦfNT1++ΦfNTk+fn1++nk,
  • leaf number: λ2=1 and
    r2r1T1,r2T1,,r1Tk,r2Tk=n1++nk.

Hence, we have λR2 and ri:R2××R2ktimesR. Moreover, all recursions ri are symmetric. This completes the proof for the CSM.

For n2, we can calculate the LDM as follows

ΦmΔT=δΔTfmδ=Rem.3.1δΔT1fmδ+1++δΔTkfmδ+1=δΔT1fmδ+m++δΔTkfmδ+m=i=1kΦmΔTi+n1++nk·m.

Thus, it is a recursive tree shape statistic of length x=2, where the recursions r1 and r2 are:

  • LDM: λ1=ΦmΔT1cat=fm0=0 and
    r1((r1T1,r2T1),,r1Tk,r2Tk)=i=1kΦmΔTi+n1++nk·m,
  • leaf number: λ2=1 and
    r2r1T1,r2T1,,r1Tk,r2Tk=n1++nk.

Hence, we have λR2 and ri:R2××R2ktimesR. Moreover, all recursions ri are symmetric. This completes the proof for the LDM.

Discussion

While tree balance is typically quantified using a single index function, Cleary et al. (2025) recently introduced a functional instead of a function to measure tree balance for rooted trees. This functional is based on the clade size sequence of a tree and depends on another function f. In this manuscript, we have generalized this concept to a broader framework, which we call the imbalance index metaconcept of order ω. This metaconcept allows for any tree shape sequence as its underlying sequence. Exploiting this property, we introduced two additional subclasses alongside the clade size metaconcept (CSM): the balance value metaconcept (BVM) and the leaf depth metaconcept (LDM). We thoroughly analyzed these three metaconcepts with respect to their underlying function f. As a result, we identified many families of functions f for which these metaconcepts yield (binary) imbalance indices, leading to a range of new imbalance indices. To help users identify a suitable imbalance index obtained from a metaconcept, we provided four decision trees. Additionally, we included R code for computing the metaconcepts and, consequently, the resulting imbalance indices. Furthermore, we analyzed the trees that maximize the metaconcepts for all leaf numbers, as well as the minimizing trees. Finally, we determined their minimum and maximum values.

Recall that the Sackin index and the Colless index are induced by the first-order CSM and BVM, respectively, when f is the identity function, i.e., a strictly increasing and affine function. Consequently, both are minimized by several trees in BTn, including the gfb-tree and the mb-tree. We proved that when the identity function is approximated from above by a strictly increasing and strictly convex function, the mb-tree becomes the unique minimizer of both the CSM and the BVM. Conversely, when the identity function is approximated from below by a strictly increasing and strictly concave function, the gfb-tree is the unique minimizer of the CSM. Moreover, this result extends not only to the identity function but to all strictly increasing and affine functions. This is remarkable, because minor changes in the function f lead to major changes in the set of minimizing trees. We elaborate on this observation in Figure 9.

Fig. 9.

Fig. 9

This figure shows (solid line) the strictly increasing and affine identity function f(x)=x, on which the Colless index and the Sackin index are based. In this case, the BVM ΦfB and the CSM ΦfN have several minima, including the gfb-tree and the mb-tree. It also shows (dashed line) a strictly increasing and strictly concave approximation of the identity from below, namely f(x)=x-1x. Note that all functions of the type f(x)=x-1xa for a1 are such approximations, and all of them will lead to the CSM ΦfN having the gfb-tree as its unique minimum. The figure also shows (dotted line) a strictly increasing and strictly convex approximation of the identity from above, namely f(x)=x+1(x+1). Note that all functions of the type f(x)=x+1(x+1)a for a1 are such approximations, and all of them will lead to the BVM ΦfB and the CSM ΦfN having the mb-tree as their unique minimum. Also note that we used x+1 in the last case, because using x in the function would not lead to a strictly increasing f for all passed values greater equal zero

Further, since many known imbalance indices also depend on one of the three sequences underlying the metaconcepts, we can classify these indices by determining which metaconcept they satisfy. We found that seven known imbalance indices fall into one of the three metaconcept classes, which can be further divided into several subclasses depending on the choice of f. Consequently, some of our results for the metaconcepts recover results from the literature in a more concise way. For example, Theorem 3.6 unifies six separate proofs from the literature into a single result. Additionally, one subclass, the LDM with strictly increasing and convex f, is totally new to the literature in the sense that none of the existing imbalance indices is induced by it.

A promising direction for future research is to further compare the introduced metaconcepts, for example, by analyzing the resolution of the imbalance indices they induce. It stands to reason that the LDM is the least resolved metaconcept, as for trees with eight leaves, there exist four pairs of trees that share the same Δ (leaf depth sequence), as well as a quartet of trees with identical Δ values, none of which share the same N (clade size sequence) or B (balance value sequence).

Another avenue for future work is to modify the imbalance index metaconcept itself. For instance, one could use sequences based on pairs of vertices (as in the original definition of the total cophenetic index), or consider sequences based on the edges of a tree rather than the vertices.

Acknowledgements

The authors wish to thank Volkmar Liebscher for various discussions and helpful insights. Parts of this material are based upon work supported by the National Science Foundation under Grant No. DMS-1929284 while MF and KW were in residence at the Institute for Computational and Experimental Research in Mathematics in Providence, RI, during the Theory, Methods, and Applications of Quantitative Phylogenomics semester program.

Additional figures

All inner vertices v of all trees in Figure 10-14 are labeled by a pair (bv,nv) containing its balance value bv and its clade size nv. If the leaves are labeled, then they are labeled by their leaf depth.

Proofs of additional lemmas

Proof of Lemma 3.26

The case i=1 is trivial. Thus, we subsequently assume ni2. Before we consider these cases more in-depth, we start with some general observations. Throughout the proof, we subdivide the vertices in Tnmb into layers, where the root is the only vertex in layer 0, the root’s children are the only vertices in layer 1 and so forth. Then the root of a subtree Ti of Tnmb with i leaves can only be located in layers hn-hi-1 or hn-hi. This is because all leaves must either be contained in layer hn or hn-1 as Tnmb has height hn (cf. Remark 2.7), and as all of its pending subtrees are also maximally balanced trees (cf. Section 2.1), which implies that they have height hi.

The high-level idea of the proof now is as follows:

  • We show that each layer l contains up to 2l vertices, and each layer l, possibly except for layer hn in case n<2hn, contains precisely 2l vertices each.

  • We show that each vertex in layer l is the root of a subtree of size either n2l or Inline graphic. Thus, no other subtree sizes are possible.

  • For i>1, we show that only subtrees of size i=2 can be contained in more than one layer. All other subtree sizes can occur in only one layer (i.e., for i3, if there are subtrees of size i, they are all rooted in the same layer).

  • We then count in each layer the subtrees of sizes n2l and Inline graphic, respectively.

We start with proving that layer l in Tnmb contains up to 2l vertices, all of which induce subtrees with either n2l or Inline graphic many leaves. We prove this by induction on l. For l=0, we have only the root, so indeed we have 20=1 vertices in this layer, and this vertex is the root of the entire tree, so of n=n20 leaves. This completes the base case. For the inductive step, let us assume we know that the statement holds up to l and now consider layer l+1. It is clear that as layer l has at most 2l vertices by induction, layer l+1 can have at most 2l+1 vertices, because each vertex in layer l has at most two children in layer l+1. Moreover, each vertex v in layer l+1 is the child of a vertex u in layer l, which by induction is the root of a tree Tu with either n2l or Inline graphic many leaves. Thus, we know by the definition of Tnmb that this only leaves four options for v: If Tu has n2l many leaves, then the tree Tv induced by v can only have either n2l2 or Inline graphic leaves, and if Tu has Inline graphic many leaves, then the tree Tv induced by v can only have either Inline graphic or Inline graphic leaves. However, using the well-known identities abc=abc and Inline graphic (which hold for all real numbers ab as well as positive integers c), we note that n2l2=n2l+1 and Inline graphic as desired. Similarly, for the remaining two possible values, before using the same identities, we additionally have to convert the inner floor and ceiling functions first by using the identities Inline graphic and Inline graphic (which hold for all real numbers a and positive numbers b). This gives us Inline graphic as well as Inline graphic. Therefore, in all cases, we have that the number nv of leaves of Tv is either n2l+1 or Inline graphic, which completes the induction.

Moreover, note that in Tnmb, all layers l except possibly for layer hn are “full” in the sense that they contain precisely 2l many vertices. This must be true as otherwise there would be two leaves with a depth difference of more than 1, which is not possible in Tnmb (cf. Remark 2.7). Also note that layer hn only contains leaves, i.e., it can only induce subtrees of size 1.

Before we continue, we now consider the possible layers l of Tnmb in which a subtree of size i can be rooted. For i=2, there must be at least one such subtree (as n2), and this is necessarily rooted in layer hn-1 (as the parent of the cherry with maximal depth induces such a subtree). However, depending on n, Tnmb can also contain cherry parents on layer hn-2 (as an example, consider T5mb depicted as tree T in Figure 8). But there cannot be such a cherry parent on layer hn-3 or lower, because this would induce leaves on layer hn-2 or lower, which would inevitably lead to two leaves of a depth difference of more than one in Tnmb (between a leaf of maximum depth hn and the newly found leaf on layer hn-2 or smaller), which cannot happen in Tnmb (cf. Remark 2.7).

Now, consider i3. We show that in this case, the layer l with lhn-2 of Tnmb in which all subtrees of size i can potentially be contained is uniquely determined. In particular, it is log2ni or Inline graphic, depending on i. This is because if a subtree of size i3 is rooted in layer l, we already know that i=n2l or Inline graphic. In the first case, we have in2l<i+1, and thus 2l·in<2l(i+1), which directly implies llog2ni, and thus, as l is an integer, llog2ni. Moreover, from 2l·in<2l(i+1) we also derive log2ni+1<l. Now assume l<log2ni. Then, as l is an integer, we have log2ni+1<l<l+1log2nilog2ni, which implies log2ni-log2ni+1>1. However, this is a contradiction as log2ni-log2ni+1=log2i+1i<1 for all i3. Thus, we must have l=log2ni in the first case as desired.

For the second case, analogously to the first case, it can be easily seen that we have Inline graphic as desired.

Next, we complete our analysis of the case i3 by counting the subtrees of Tnmb of each such size. As we have seen that for i3 all subtrees of such a size must fulfill i=n2l with l=log2ni or Inline graphic with Inline graphic, it immediately follows that if i does not meet this requirement, we have mbn(i)=0, thus proving the fifth statement of the theorem.

Now consider any layer lhn-2 (note that larger layers cannot contain the root of a subtree of size 3 as Tnmb has height hn). We have already seen that this layer contains precisely 2l many vertices, each of which induces a subtree of size i=n2l or Inline graphic. We now distinguish two cases. First, assume Inline graphic. Then, all 2l many subtrees rooted in layer l are of this size. This partially proves the seventh statement of the theorem, because in this case we have rln=0, which can be seen as follows:

nn2l=nn2l=2lrln=n-2l·n2l=n-nn2l·n2l=n-n=0.

If, however, Inline graphic, informally speaking, the only way to divide the n leaves between the 2l subtrees of sizes n2l and Inline graphic is to fill up all of the subtrees with n2l many leaves and then add 1 leaf each to some of them, until the rln=n-2l·n2l leaves are used up. This leads to rln subtrees of size Inline graphic and to 2l-rln subtrees of size n2l, which completes the proof of the sixth and seventh cases.

It remains to consider the case i=2. As subtrees of size 2 are, besides the ones of size 1, the only ones that can occur on more than one layer, we have to distinguish three cases. Note that mbn(2)=0 is not possible as n2, so there must be at least one cherry. However, there are three cases: All cherries are rooted in layer hn-1, and this layer may or may not also contain leaves, or both layers hn-1 and hn-2 contain cherries. In case all cherries are rooted in layer hn-1 and this layer does not contain any leaves, obviously all 2hn-1 vertices in this layer are parents of a cherry, so mbn(i)=2hn-1. This, however, implies that layer 2hn contains 2hn many leaves, i.e., n=2hn. This proves the second statement of the theorem.

So from now on, we consider the case n<2hn and i=2 to prove the third and fourth statement of the theorem. We have already seen that then not all vertices in layer hn-1 can be roots of cherries, so layer hn-1 must contain some leaves. It remains to distinguish the case in which all cherry parents of Tnmb are contained in layer hn-1 from the case in which some of the cherry parents are in layer hn-2.

As we have seen, in both cases we have that in layer hn-1, we have both cherry parents and leaves, so we must have 1=n2hn-1 and Inline graphic. Now, if cherry parents are also contained in layer hn-2, they must there be the vertices inducing smaller subtrees, because all parents of the cherries in layer hn-1 are also contained in layer hn-2 and they induce larger subtrees. Thus, we must also have i=2=n2hn-2. So in summary, if vertices inducing subtrees of size i=2 are contained in both layers hn-1 and hn-2, we must have Inline graphic. This corresponds to the condition of the fourth statement of the theorem. We now count the number of cherry parents in each of the two layers by the same arguments as in the case i3: All 2hn-1 vertices in layer hn-1 are roots of subtrees of size at least 1. The remaining rhn-1n=n-2hn-1·n2hn-1 subtrees contain two leaves. This number needs to be added to the 2-leaf subtrees induced by layer hn-2. In this layer, we know that all 2hn-2 vertices induce trees of size at least 2, but rhn-2n of them induce three leaves. So layer hn-2 comes with 2hn-2-rhn-2n many subtrees of size 2. Thus, layers hn-1 and hn-2 in this case together induce 2hn-2-rhn-2n+rhn-1n many subtrees of size 2, which proves the fourth statement of the theorem.

So only the third statement of the theorem remains. We are still in the case where n<2hn, and we now have that only layer hn-1 contains cherry parents. As we have already seen, the other vertices in layer hn-1 must be leaves, so we have Inline graphic. As layer hn-1 contains precisely 2hn-1 vertices, by the same arguments as used above, we can conclude that the only way to distribute n leaves to these vertices is to consider 2hn-1 trees of size 1, i.e., single leaves, and to turn rhn-1n=n-2hn-1·n2hn-1 many of these leaves into cherry parents. This leads to rhn-1n many cherries, which completes the third statement of the theorem and thus the entire proof.

Proof of Lemma 3.35

  • ” Let f:RR be affine and x,y,zR. Then,
    f(x+z)-f(y+z)=m·(x+z)+a-(m·(y+z)+a)=m·x+a-(m·y+a)=f(x)-f(y).
  • ” Let f:RR be a function such that f(x+z)-f(y+z)=f(x)-f(y) for all x,y,zR. Then, for x=0 and z=1, we have
    f(0+1)-f(y+1)=f(0)-f(y)for allyRf(y+1)-f(y)=f(1)-f(0)for allyR.
    Thus, the slope between y and y+1 is constant, namely
    m=f(y+1)-f(y)(y+1)-y=f(1)-f(0)1=f(1)-f(0).
    Since yR was arbitrary, this implies that f is affine, which completes the proof.

Author Contributions

All authors contributed equally.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Data Availability

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

Declarations

Conflicts of Interest

The authors herewith certify that they have no affiliations with or involvement in any organization or entity with any financial (such as honoraria; educational grants; participation in speakers’ bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licensing arrangements) or non-financial (such as personal or professional relationships, affiliations, knowledge or beliefs) interest in the subject matter discussed in this manuscript.

Footnotes

This work was supported by various research grants.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Mareike Fischer, Tom Niklas Hamann and Kristina Wicke contributed equally to this work.

References

  1. Andersson A (1993) Balanced search trees made simple. In: Dehne F, Sack J-R, Santoro N, Whitesides S (eds) Algorithms and Data Structures. Springer, Berlin, Heidelberg, pp 60–71. 10.1007/3-540-57155-8_236 [Google Scholar]
  2. Knuth DE (1998) The Art of Computer Programming, vol 3, 2nd edn. Sorting and Searching. Addison Wesley Longman Publishing Co., Inc, USA
  3. Fischer M, Herbst L, Kersting S, Kühn L, Wicke K (2023) Tree Balance Indices: A Comprehensive Survey. Springer, Cham. 10.1007/978-3-031-39800-1 [Google Scholar]
  4. Fischer M, Liebscher V (2021) On the balance of unrooted trees. Journal of Graph Algorithms and Applications 25(1):133–150. 10.7155/jgaa.00553 [Google Scholar]
  5. Cleary, S., Fischer, M., John, K.S.: The gfb tree and tree imbalance indices. arXiv:2502.12854 (2025) 10.48550/arXiv.2502.12854
  6. Coronado TM, Fischer M, Herbst L, Rosselló F, Wicke K (2020) On the minimum value of the Colless index and the bifurcating trees that achieve it. J Math Biol 80(7):1993–2054. 10.1007/s00285-020-01488-9 [DOI] [PubMed] [Google Scholar]
  7. Fill JA (1996) On the distribution of binary search trees under the random permutation model. Random Structures and Algorithms 8(1):1–25. 10.1002/(sici)1098-2418(199601)8:1<1::aid-rsa1>3.0.co;2-1 [Google Scholar]
  8. Paradis E, Schliep K (2019) ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35:526–528. 10.1093/bioinformatics/bty633 [DOI] [PubMed] [Google Scholar]
  9. Mir A, Rosselló F, Rotger L (2013) A new balance index for phylogenetic trees. Math Biosci 241(1):125–136. 10.1016/j.mbs.2012.10.005 [DOI] [PubMed] [Google Scholar]
  10. Sackin MJ (1972) “Good’’ and “bad’’ phenograms. Syst Biol 21(2):225–226. 10.1093/sysbio/21.2.225 [Google Scholar]
  11. Shao K-T, Sokal RR (1990) Tree balance. Syst Zool 39(3):266. 10.2307/2992186 [Google Scholar]
  12. Blum MGB, François O (2005) On statistical tests of phylogenetic tree imbalance: The Sackin and other indices revisited. Math Biosci 195(2):141–153. 10.1016/j.mbs.2005.03.003 [DOI] [PubMed] [Google Scholar]
  13. Fischer M (2021) Extremal values of the Sackin tree balance index. Ann Comb 25(2):515–541. 10.1007/s00026-021-00539-2 [Google Scholar]
  14. Kirkpatrick M, Slatkin M (1993) Searching for evolutionary patterns in the shape of a phylogenetic tree. Evolution 47(4):1171–1181. 10.1111/j.1558-5646.1993.tb02144.x [DOI] [PubMed] [Google Scholar]
  15. Blum MG, François O (2006) Which random processes describe the tree of life? A large-scale study of phylogenetic tree imbalance. Syst Biol 55(4):685–691. 10.1080/10635150600889625 [DOI] [PubMed] [Google Scholar]
  16. Colless D (1982) Review of “Phylogenetics: The theory and practice of phylogenetic systematics’’. Syst Zool 31(1):100–104. 10.2307/2413420 [Google Scholar]
  17. Heard SB (1992) Patterns in tree balance among cladistic, phenetic, and randomly generated phylogenetic trees. Evolution 46(6):1818–1826. 10.1111/j.1558-5646.1992.tb01171.x [DOI] [PubMed] [Google Scholar]
  18. Bartoszek K, Coronado TM, Mir A, Rosselló F (2021) Squaring within the Colless index yields a better balance index. Math Biosci 331:108503. 10.1016/j.mbs.2020.108503 [DOI] [PubMed] [Google Scholar]
  19. Knüver L, Fischer M, Hellmuth M, Wicke K (2024) The weighted total cophenetic index: A novel balance index for phylogenetic networks. Discret Appl Math 359:89–142. 10.1016/j.dam.2024.07.037 [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data sharing is not applicable to this article as no new data were created or analyzed in this study.


Articles from Bulletin of Mathematical Biology are provided here courtesy of Springer

RESOURCES