Abstract
Rooted binary galled trees generalize rooted binary trees to allow a restricted class of cycles, known as galls. We build upon the Wedderburn-Etherington enumeration of rooted binary unlabeled trees with n leaves to enumerate rooted binary unlabeled galled trees with n leaves, also enumerating rooted binary unlabeled galled trees with n leaves and g galls, . The enumerations rely on a recursive decomposition that considers subtrees descended from the nodes of a gall, adopting a restriction on galls that amounts to considering only the rooted binary normal unlabeled galled trees in our enumeration. We write an implicit expression for the generating function encoding the numbers of trees for all n. We show that the number of rooted binary unlabeled galled trees grows with , exceeding the growth of the number of rooted binary unlabeled trees without galls. However, the growth of the number of galled trees with only one gall has the same exponential order 2.4833 as the number with no galls, exceeding it only in the subexponential term, compared to . For a fixed number of leaves n, the number of galls g that produces the largest number of rooted binary unlabeled galled trees lies intermediate between the minimum of and the maximum of . We discuss implications in mathematical phylogenetics.
Keywords: Galled trees, Phylogenetics, Unlabeled trees
Introduction
Evolutionary histories of genes, populations, and species are often described by phylogenetic trees that seek to represent their descent relationships. Owing in part to the centrality of phylogenetic trees in evolutionary biology, mathematical studies have characterized numerous classes of phylogenetic trees, investigating their combinatorial properties (Semple and Steel 2003; Felsenstein 2004; Gascuel 2005; Steel 2016; Warnow 2018).
The use of tree structures—typically treated as binary—is often appropriate for representing standard phenomena of evolutionary descent, by which biological entities sequentially bifurcate, in a manner in which diverged entities do not merge back together. Processes such as genetic admixture, horizontal gene transfer, and hybridization, however, produce evolutionary relationships that are not tree-like. These processes involve the merging of separate lineages that had previously descended from shared ancestors. With increasing interest in merging mechanisms during evolutionary descent, much recent attention in mathematical phylogenetics has been devoted to phylogenetic networks (Huson et al. 2010; Gusfield 2014; Kong et al. 2022), in which graphs describing relationships of biological entities permit certain types of cycles.
Among the simplest phylogenetic networks are the galled trees, named for the growths that can appear in plant tissues to produce distinctively shaped structures (Gusfield et al. 2003, 2004b). First introduced in studies of ancestral recombination graphs (Wang et al. 2001; Gusfield et al. 2003, 2004a, b; Gusfield 2005, 2014; Song 2006), galled trees allow diverged lineages to merge forward in time, but only in circumscribed ways. Each merging event creates a gall corresponding to a cycle in the associated network.
From a standpoint that considers galled trees as mathematical objects separately from the processes that could produce them biologically, the defining feature of a galled tree is that cycles in a graph structure are disjoint, so that in a galled tree, a vertex or edge is contained in at most one cycle (Semple and Steel 2006). With this graph-theoretic sense for the meaning of galled trees, the enumerative combinatorics of galled trees has been investigated, both for unrooted and for rooted binary galled trees, focusing on galled trees that are leaf-labeled (Semple and Steel 2006; Bouvel et al. 2020; Cardona and Zhang 2020).
Chang et al. (2018) and Mathur and Rosenberg (2023) have posed the problem of enumerating rooted binary galled trees in which the leaves are not labeled. In a study focused on introducing encodings for galled trees, Chang et al. (2018) argued that the number of rooted binary unlabeled galled trees with n leaves is bounded above by a sequence with a certain generating function. In an enumerative study of labeled histories for rooted binary leaf-labeled galled trees, Mathur and Rosenberg (2023) enumerated a class of rooted binary unlabeled galled trees for n from 1 to 6, obtaining 1, 1, 2, 6, 20, 72. These values are indeed bounded above by the corresponding upper bounds of Chang et al. (2018)—1, 1, 4, 28, 245, 2402 for to 6—though Chang et al. (2018) used a more expansive definition of rooted binary unlabeled galled trees. They are also bounded above by the enumeration in Theorem 8 of Cardona and Zhang (2020) of the corresponding set of rooted binary labeled galled trees, which gives 1, 1, 6, 69, 960, 24,750 for to 6.
How many rooted binary unlabeled galled trees possess a given number of leaves n and a given number of galls g? Here, we perform this general enumeration, counting the rooted binary unlabeled galled trees of Mathur and Rosenberg (2023) for leaves and galls. We first recursively enumerate all such rooted binary unlabeled galled trees with a specified number of leaves n, considering all possible numbers of galls. We then refine this enumeration by subdividing it according to specified numbers of leaves n and galls g, considering all possible values of g for a fixed n.
Background
Definitions
We follow Mathur and Rosenberg (2023) in describing key concepts, assuming that all networks and trees are binary (and henceforth dropping the term binary). A rooted phylogenetic network is a directed acyclic graph with four properties: (i) there exists a unique root node with in-degree 0 and out-degree 2; (ii) all leaf nodes have in-degree 1 and out-degree 0; (iii) all non-leaf, non-root nodes have in-degree 2 and out-degree 1 or in-degree 1 and out-degree 2; and (iv) all edges are directed away from the root. Nodes with in-degree 2 and out-degree 1 are termed reticulation nodes, and nodes with in-degree 1 and out-degree 2 are tree nodes.
A rooted galled tree is a rooted (binary) phylogenetic network in which two properties hold (Fig. 1). First, (i) each reticulation node has a unique ancestor node r such that exactly two non-overlapping paths of edges exist from r to ; if the direction of edges is ignored, then the two paths connecting r and form a cycle , known as a gall. Following Mathur and Rosenberg (2023), the ancestor node r must be separated from by at least two edges. This requirement that cycles contain at least four nodes is required by the perspective of Mathur and Rosenberg (2023) that views galled trees as evolving temporally by a biological process such as hybridization. It is equivalent to the requirement that a galled tree be a normal network and is not imposed in a more expansive galled tree definition that permits 3-node galls (Kong et al. 2022).
Fig. 1.

Rooted galled trees. A A rooted galled tree. In our definition of rooted galled trees, this example is the smallest network that possesses a gall. The gall is a root gall; node 1 is r, the top node; node 2 is the left hybridizing side node; node 4 is the right hybridizing side node; finally, node 3 is , the hybrid node. We depict the hybridizing side nodes and the hybrid node in a horizontal line, representing simultaneity of these nodes in the embedding of the rooted galled tree in time, proceeding from the top to the bottom of the diagram. B A network that does not satisfy our definition of a rooted galled tree, but that does qualify according to some definitions. This network is missing a hybrid node; each gall in our definition possesses at least four nodes. C A more complex rooted galled tree by our definition. D A more complex network that is not a rooted galled tree by our definition, because the red triangle lacks a hybrid node
The second criterion is: (ii) the set of nodes in the gall , associated with reticulation node , and the set of nodes in the gall , associated with reticulation node , are disjoint.
We term the ancestor node r of a gall with reticulation node and cycle the top node. Other nodes in a gall, excluding the top node and reticulation node, are called side nodes. We term the reticulation node a hybrid node, and the two immediate parents of a hybrid node hybridizing side nodes, or just hybridizing nodes. The side nodes to the left and right of the hybrid node are the left side nodes and right side nodes, respectively; the distinction between “left” and “right” is only for convenience, and a gall is invariant with respect to exchange of its left and right side nodes. If the root node of a rooted galled tree is part of a gall, then we call this gall the root gall. The root node is always a top node if it is part of a gall.
Although a rooted galled tree is only strictly a tree if it contains no galls, it is convenient to continue to refer to galled trees as trees; similarly, we allow “subtrees” to possess galls. All networks and trees that we consider are rooted, and we henceforth drop the term rooted. Mathur and Rosenberg (2023) focused on labeled galled trees, in which each leaf is associated with a distinct leaf label; here we consider unlabeled galled trees, and we often drop the term unlabeled. Unlike Mathur and Rosenberg (2023), we have no need to assign a temporal embedding to nodes, with the exception that ancestor nodes can be no more recent than their descendants; the galled trees that we consider are understood to be unordered.
Compositions
We will have occasion to consider the sums of ordered b-tuples of positive integers that equal a positive integer a: the compositions C(a, b) of a into b parts, . The set C(a, b) has cardinality . This result is obtained by noting that a list of a copies of the number 1 has “breakpoints” between consecutive 1’s, and, summing 1’s between neighboring breakpoints, the compositions into b parts are produced by the distinct sets of among the breakpoint locations. For , we define , with .
We distinguish between palindromic and non-palindromic compositions. Palindromic compositions are unchanged when the order of the parts is reversed; non-palindromic compositions do change when the order is reversed. For example, in C(9, 5), (1, 2, 3, 2, 1) is a palindromic composition; (1, 2, 2, 3, 1) is non-palindromic. We denote the palindromic compositions of a into b parts by and the non-palindromic compositions of a into b parts by , such that and .
The numbers of palindromic and non-palindromic compositions, and , appear in Table 1. Palindromic compositions are counted by counting the ways to place breakpoints on the “left” half of a list of 1’s; breakpoints are then palindromically placed on the “right” half. Distinct cases exist depending on the parity of a and b. For non-palindromic compositions, we obtain from .
Table 1.
The number of palindromic compositions of a into b parts, , and the corresponding number of non-palindromic compositions
| Parity of a | Parity of b | ||
|---|---|---|---|
| Even | Even | ||
| Even | Odd | ||
| Odd | Even | 0 | |
| Odd | Odd |
Unlabeled Trees with n Leaves
Our approach to enumerating unlabeled galled trees extends the Wedderburn-Etherington enumeration of unlabeled trees with no galls. For unlabeled trees with no galls, the root of a tree with leaves possesses two immediate subtrees. Assume without loss of generality that the number of leaves in the “left” subtree is greater than or equal to the number of leaves in the “right” subtree. If is odd, then , the number of unlabeled trees with n leaves, is obtained by considering the possible numbers of leaves k for the right subtree, for each k pairing all unlabeled trees with k leaves for the right subtree with all unlabeled trees with leaves for the left subtree. If n is even, then the enumeration is similar for ; if , however, then we have ways choosing two distinct subtrees for the left and right subtrees and ways of choosing two copies of the same subtree.
The recursion for is [e.g. Harding (1971), Felsenstein (2004)]:
| 1 |
With , the generating function for the satisfies ( Comtet 1974, p. 55):
| 2 |
The number of trees with no galls has exponential growth with , for constants and ( Harding 1971; Landau 1977; Flajolet and Sedgewick 2009, p. 65).
The Maximum Number of Galls for Galled Trees with n Leaves
For a fixed number of leaves n, the number of galls that a galled tree can possess is constrained as a function of n. Because a gall contains at least three descendant subtrees—those descended from the two hybridizing nodes and the hybrid node—a minimum of leaves is required before a tree can possess a gall. Each successive addition of a gall then replaces one subtree with a minimum of three subtrees—those descended from the two hybridizing nodes and the hybrid node of the new gall—so that each gall adds at least two leaves. It follows that a galled tree with n leaves can have at most galls (Mathur and Rosenberg 2023).
Unlabeled Galled Trees with n Leaves
We are now ready to enumerate unlabeled galled trees. We denote by the number of unlabeled galled trees with n leaves. Trees with or leaves have no galls: and . To recursively evaluate for leaves, we sum counts from two cases: (1) the root is not the top node of a gall; (2) the root is the top node of a gall. We count galled trees in the former case in , with , and we count galled trees in the latter case in , with . The goal is to evaluate
| 3 |
Root is not a Top Node of a Gall
If the root is not the top node of a gall, then an unlabeled galled tree possesses two immediate subtrees of the root, each of which is itself an unlabeled galled tree (Fig. 2A). The number of unlabeled galled trees then follows a recursion analogous to Eq. 1:
| 4 |
It is convenient to express in a form that considers compositions of n into 2 parts. For odd ,
| 5 |
For even n,
| 6 |
Fig. 2.

Recursive enumeration of rooted galled trees. Triangles indicate unspecified subtrees with at least one leaf. A The root is not a top node of a gall (Eq. 4). B, C The root is a top node of a gall with an even number of subtrees (Eq. 9). The two trees show the two cases with : (B) and (C). D The root is a top node of a gall with an odd number of subtrees and (Eq. 10). In this case, and . E The root is a top node of a gall with an odd number of subtrees, , and the composition of n leaves descended from the root gall into parts representing k subtrees is non-palindromic (Eq. 11). In this case, and . Different outline colors for triangles indicate different numbers of leaves in associated subtrees. F, G The root is a top node of a gall with an odd number of subtrees, , and the composition of n leaves into k parts is palindromic (Eq. 12). In both trees, and ; trees with distinct (F) and identical (G) lists of galled subtrees for left and right subtrees are depicted. Different outline colors for triangles indicate different numbers of leaves in subtrees, and different patterns in the same color indicate different topologies with equally many leaves (Color figure online)
Root is a Top Node of a Gall
If the root is a top node of a gall, then the recursion is more complex. We first count subtrees of the root gall, equal to the count of all side nodes plus the hybrid node. Suppose the root gall contains k subtrees. We have the constraint , as a gall has at least 3 subtrees (of the left hybridizing node, hybrid node, and right hybridizing node), and the root gall can have as many as n subtrees, each ancestral to a single leaf.
Without loss of generality, we can assume that the number of right side nodes in the root gall, r, is less than or equal to the number of left side nodes ; owing to the existence of the hybrid node, . We divide the case further based on the parity of k, writing
| 7 |
where and count unlabeled trees with n leaves in which the root is a top node and the number of descendant subtrees of the root gall is even and odd, respectively.
Even Number of Subtrees of the Root Gall
We consider each even value of k, for . Given k, because , r ranges from 1 to . Because and k is even, we have the strict inequality .
Consider the k subtrees in an order that proceeds from the most ancestral left side node descending through subsequent left side nodes to the left hybridizing node, then to the hybrid node, then the right hybridizing node, and then through ancestors to the most ancestral right side node (Fig. 2B, C). Once k and r have been specified, we consider all possible ways of placing the n leaves into the k subtrees of the gall: the compositions C(n, k) of n into k parts. For each composition in C(n, k), where is the value of the ith term, the number of galled trees is . We have
| 8 |
The summand , representing the number of distinct lists of 2a subtrees with total number of leaves n, does not depend on r, the number of those subtrees descended from right side nodes. For , a sum from to is empty. We can therefore simplify Eq. 8 to obtain, for all ,
| 9 |
Odd Number of Subtrees of the Root Gall
If k is odd, then we consider for . In this case, with , we have . With and k odd, is possible.
If , then r ranges from 1 to (Fig. 2D). We follow the reasoning of the case of even k and find that this case contributes a number of galled trees equal to
| 10 |
Consider . For non-palindromic compositions of n into k parts representing the k subtrees of the gall, an equivalent unlabeled galled tree is obtained by a mirror-image composition that corresponds to an exchange of left and right subtrees of the root gall (Fig. 2E). We therefore multiply by to account for the fact that each unlabeled galled tree is counted twice, so that the non-palindromic compositions of n contribute a number of galled trees equal to
| 11 |
With , for a palindromic composition of n into k parts representing the k subtrees of the root gall (Fig. 2F, G), we can select two distinct lists of galled subtrees for the a left and the a right subtrees; the number of ways to do so is . Alternatively, we can select the same lists of galled subtrees for the a left and a right subtrees, in ways. Any choices for the left and right subtrees can be combined with choices for the subtree of the hybrid node of the gall. The palindromic compositions produce a number of galled trees equal to
| 12 |
Summing the three cases in Eqs. 10, 11, and 12, we obtain
| 13 |
We can simplify this expression further. For a palindromic composition , by definition of palindromic compositions, for . is the disjoint union of and . We can then write
Equation 13 then becomes
Note that for , a sum from to is empty. Therefore, for all ,
| 14 |
Summary
To summarize the enumeration, the desired number of unlabeled galled trees with n leaves, , can be calculated in Eq. 3 by summing Eqs. 9 and 14 in Eq. 7, and then adding the result to from Eq. 4.
We simplify by writing the sum of and the first term of in one expression. When adding the even-k terms in and the odd-k terms , k now ranges from 3 to n, considering even values of k with and , and odd values of k with and . For , , and for , as well.
Recalling Eqs. 5 and 6, we can now write a simplified expression for the recursion for . First, . If the number of leaves n of an unlabeled galled tree is an odd value , then
| 15 |
If the number of leaves n is even, then an extra term appears:
| 16 |
Example
To illustrate the recursive enumeration of rooted galled trees, we enumerate the rooted galled trees with 5 leaves. The base case in Eqs. 15 and 16 is . To evaluate , we first evaluate , , and .
Trees with Two Leaves
Only one galled tree has two leaves: the 2-leaf tree with no galls (Table 2). Equation 16 recovers this result: (Eq. 4), (Eq. 9), and (Eq. 14), so that .
Table 2.
Rooted unlabeled galled trees with at most 5 leaves
| Number of leaves | Tree number | Number of galls | Galled tree |
|---|---|---|---|
| 1 | 1 | 0 | |
| 2 | 1 | 0 | |
| 3 | 1 | 0 | |
| 3 | 2 | 1 | |
| 4 | 1 | 0 | |
| 4 | 2 | 1 | |
| 4 | 3 | 0 | |
| 4 | 4 | 1 | |
| 4 | 5 | 1 | |
| 4 | 6 | 1 | |
| 5 | 1 | 0 | |
| 5 | 2 | 1 | |
| 5 | 3 | 0 | |
| 5 | 4 | 1 | |
| 5 | 5 | 1 | |
| 5 | 6 | 1 | |
| 5 | 7 | 0 | |
| 5 | 8 | 1 | |
| 5 | 9 | 1 | |
| 5 | 10 | 2 | |
| 5 | 11 | 1 | |
| 5 | 12 | 1 | |
| 5 | 13 | 1 | |
| 5 | 14 | 2 | |
| 5 | 15 | 1 | |
| 5 | 16 | 1 | |
| 5 | 17 | 1 | |
| 5 | 18 | 1 | |
| 5 | 19 | 1 | |
| 5 | 20 | 1 |
Galled trees with different numbers of galls appear in different colors (0, black; 1, orange; 2, purple). For each number of leaves n, we enumerate galled trees in a canonical order. We recursively proceed through trees in which the root is not a top node of a gall, incrementing the number of leaves in the right subtree. Next, for trees in which the root is a top node, we proceed in increasing order of the number of subtrees of the root gall; for fixed numbers of subtrees, we proceed in dictionary order of values; for fixed , we use reverse dictionary order of the compositions of leaves into subtrees of the gall. The canonical order is used in proceeding through subtrees of a fixed size
Trees with Three Leaves
For , there are two galled trees (Table 2). Using Eqs. 4, 9, and 14, we have
Summing , representing the tree with leaves and no galls, , and for the unique tree containing a gall, we have .
Trees with Four Leaves
For , the number of galled trees is 6. We use Eqs. 4, 9, and 14 to obtain
counts trees 1 to 3 in Table 2. counts the unique tree with subtrees of the root gall, tree 6. counts trees with subtrees of the root gall, trees 4 and 5. Summing the three quantities, .
Trees with Five Leaves
We are now ready for the calculation of , which produces galled trees with five leaves (Table 2).
enumerates trees in which the root is not a top node of a gall, trees 1 to 8 for in Table 2. enumerates trees with subtrees of the root gall, trees 15 to 18. enumerates trees for which the root gall has (trees 9 to 14) or subtrees (trees 19 and 20). , , and sum to .
Unlabeled Galled Trees with n Leaves and g Galls
A salient feature of a galled tree is its number of galls. Having enumerated unlabeled galled trees with n leaves, we now proceed to subdivide the calculation according to the number of galls: the number of galled trees with n leaves is a sum over g from 0 to of the number of galled trees with n leaves and g galls.
We denote by the number of galled trees with n leaves and g galls. Because the maximum number of galls with n leaves is (Sect. 2.4), we define for . As the unique galled tree with leaf has no galls, the base case is .
Again we separate two cases: (1) the root is not the top node of a gall, and (2) the root is the top node of a gall. For the former case, we denote the count by , with . For the latter case, we denote the count by , with for all g. We seek to obtain . We use reasoning that parallels the case in which we do not keep track of the number of galls (Sect. 3).
Note that when summing over all possible values of g, for , we have
Root is not a Top Node of a Gall
If the root is not a top node, then a tree with leaves and g galls can be decomposed into two subtrees. We assign one of these trees m leaves, , and h galls, . If n and g are both even, then it is possible for the two subtrees to be identical. Similarly to Eq. 4, we have
| 17 |
Note that in this equation, we can replace with g; for , in the summand is zero, as a tree with m leaves has at most galls.
We write another expression for by considering compositions of n into two parts representing the numbers of leaves in the left and right subtrees. We also decompose g; because entries in a composition are strictly positive, we consider compositions of , noting that each entry of the composition exceeds the associated number of galls by 1.
For (n, g) in which n or g is odd and , similarly to Eq. 5,
| 18 |
For (n, g) both even, as in Eq. 6,
| 19 |
Root is a Top Node of a Gall
In the case of a root gall, we distribute among the subtrees of the root gall galls, as one of the g galls is the root gall. We again distinguish between even and odd numbers of subtrees of the root gall, writing , where gives the number of trees with n nodes and g galls in which the root is a top node and the root gall has an even number of descendant subtrees, and gives the corresponding number of trees with an odd number of descendant subtrees of the root gall. We follow our reasoning of Sects. 3.2.1 and 3.2.2.
Even Number of Subtrees of the Root Gall
Suppose the number of the subtrees of the root gall is even, , . As in Sect. 3.2.1, given k, the number of right side nodes of the root gall, r, ranges from 1 to .
Here, however, we consider all ways of distributing galls across subtrees. Just as n leaves are placed into 2a subtrees by a composition of n into 2a parts, galls are placed into 2a subtrees by a composition of into 2a parts. By decomposing , we allow for the possibility of 0 galls in a subtree; in a composition of , the number of galls in entry is .
For all (n, g) with and , the resulting number of trees is similar to Eq. 9:
| 20 |
Odd Number of Subtrees of the Root Gall
For an odd number of subtrees of the root gall , , as in Sect. 3.2.2, r ranges from 1 to a. Again we consider with (as in Eq. 10) and add the case (as in Eqs. 11 and 12).
If , then similarly to the case of even k, the number of galled trees with k subtrees descended from a root gall of a tree with leaves and galls is
| 21 |
If , then we again distinguish between non-palindromic and palindromic compositions of n leaves into the k subtrees. Non-palindromic compositions do not result in symmetric trees, irrespective of the way the galls are placed across the subtrees. Therefore, considering only the non-palindromic compositions, similarly to Eq. 11, the number of galled trees with leaves and galls is
| 22 |
Finally, for the palindromic compositions of n leaves with k odd and , we distinguish between cases with palindromic and non-palindromic compositions describing the placement of the g galls across k subtrees. For the non-palindromic compositions, similarly to Eq. 22, the number of trees is
| 23 |
If both the composition of n leaves and the composition of galls are palindromic, then, as in our reasoning for Eq. 12, we can choose either two distinct or two identical lists of subtrees for the a left subtrees and the a right subtrees, and the number of trees is
| 24 |
We sum Eqs. 21, 22, 23, and 24 to obtain
| 25 |
Summary
As , we summarize by adding Eqs. 17, 20, and 25. and for . For (n, g) with leaves and galls, if n is odd, g is odd, or both n and g are odd, then
| 26 |
If both n and g are even, then an extra term appears:
| 27 |
Example: 1 Gall
After the galled trees with no galls (Sect. 2.3), the next simplest case for enumeration of galled trees is the galled trees with only one gall. For this case, if there is no root gall, then the one gall must be in exactly one of the two subtrees descended from the root. The other subtree is a tree with no galls. If a root gall is present, then there are no other galls, and all subtrees descended from the root gall are trees with no galls.
For , . For , using the odd case Eq. 26, the first term of Eq. 26 when is
The second term is
The third term is
Summing the three terms, the number of galled trees with leaves and gall is:
| 28 |
Generating Functions
We now derive and analyze generating functions for , the number of galled trees with n leaves, and , the number of galled trees with n leaves and 1 gall. We also show that the exponential growth of proceeds faster with n than the exponential growth of , the number of trees without galls—but that and follow the same exponential growth.
To analyze the generating functions, we will need the values of for small n and for small n and g. Hence, we use our recursions to exhaustively calculate the number of galled trees with n leaves, (Eqs. 15 and 16), and the number of galled trees with n leaves and galls, (Eqs. 26 and 27). Considering n from 1 to 18, the numerical values appear in Table 3.
Table 3.
Numbers of galled trees with specified numbers of leaves and galls
| Number of | Total number | Number of trees with a fixed number of galls () | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| leaves (n) | of trees () | |||||||||
| 1 | 1 | 1 | – | – | – | – | – | – | – | – |
| 2 | 1 | 1 | – | – | – | – | – | – | – | – |
| 3 | 2 | 1 | 1 | – | – | – | – | – | – | – |
| 4 | 6 | 2 | 4 | – | – | – | – | – | – | – |
| 5 | 20 | 3 | 15 | 2 | – | – | – | – | – | – |
| 6 | 72 | 6 | 48 | 18 | – | – | – | – | – | – |
| 7 | 272 | 11 | 148 | 107 | 6 | – | – | – | – | – |
| 8 | 1064 | 23 | 435 | 528 | 78 | – | – | – | – | – |
| 9 | 4271 | 46 | 1250 | 2295 | 661 | 19 | – | – | – | – |
| 10 | 17,497 | 98 | 3512 | 9185 | 4356 | 346 | – | – | – | – |
| 11 | 72,843 | 207 | 9726 | 34,503 | 24,564 | 3776 | 67 | – | – | – |
| 12 | 307,307 | 451 | 26,587 | 123,612 | 123,825 | 31,289 | 1543 | – | – | – |
| 13 | 1,310,792 | 983 | 71,975 | 426,218 | 574,149 | 216,501 | 20,720 | 246 | – | – |
| 14 | 5,643,555 | 2179 | 193,200 | 1,425,011 | 2,493,129 | 1,316,450 | 206,644 | 6942 | – | – |
| 15 | 24,493,270 | 4850 | 515,051 | 4,643,119 | 10,269,351 | 7,254,224 | 1,695,084 | 110,647 | 944 | – |
| 16 | 107,043,258 | 10,905 | 1,364,896 | 14,804,696 | 40,496,606 | 36,980,263 | 12,063,205 | 1,291,278 | 31,409 | – |
| 17 | 470,668,034 | 24,631 | 3,598,794 | 46,336,619 | 153,960,249 | 176,934,884 | 76,980,753 | 12,248,152 | 580,235 | 3717 |
| 18 | 2,080,681,402 | 56,011 | 9,447,028 | 142,720,317 | 567,348,929 | 803,058,979 | 450,309,678 | 99,840,890 | 7,756,699 | 142,871 |
Generating Function for
Define a generating function , We rewrite Eqs. 15 and 16 in a single equation. To do so, we note and define and for non-integer values of n. For , we then have
| 29 |
We write the terms of the generating function with three components:
| 30 |
The first term has the form of twice the generating function for the Wedderburn-Etherington numbers (Eq. 2):
| 31 |
For the second term,
| 32 |
| 33 |
The step in Eq. 32 makes use of . Equation 33 is obtained if and only if the sum in the previous step converges; that is, if and only if .
Finally, for the third term,
| 34 |
| 35 |
Equation 34 holds because , and the last equality holds if and only if .
Labeling the radius of convergence of by and inserting into Eq. 30 the quantities in Eqs. 31, 33 and 35, for ,
| 36 |
Growth of
We now address the asymptotic growth of . In particular, we show that the number of galled trees grows exponentially faster in the number of leaves n than the corresponding number of trees without galls.
First, note that the radius of convergence is a positive constant less than 1. The convergence radius of generating function for the (Eq. 2) is a value , and in particular, (p. 262 Landau 1977). Because for all , for all . Hence, we have ; in the Appendix, we show .
Also note that because , for . Because for and increases monotonically for , for .
To find the asymptotic growth from the generating function for galled trees, , we use the asymptotics of implicit tree-like classes theorem (Meir and Moon 1989a, b; Flajolet and Sedgewick 2009, pp. 467–468). This theorem describes the asymptotic growth of the coefficients of a generating function that is described implicitly, such as in Eq. 36. We write , and we denote .
To use the theorem, we must first show that the function , defined by , belongs to the smooth implicit-function schema. Indeed, the necessary conditions are satisfied:
is analytic in t and w around 0 from Eq. 36 and the positive convergence radius .
.
for .
, which is verified by noting that the term in the right-hand side of Eq. 36 is equal to .
, which follows from , and , which is verified from the series expansion of Eq. 36.
From Eq. 33, there exists a coefficient for and : for example, .
- The last condition, which we show below, is that there are solutions and for the characteristic system:
37 38
According to the theorem, functions belonging to the smooth implicit-function schema converge at the solution to the characteristic system, where they possess a square-root singularity. We conclude that converges at , with , and that , where
| 39 |
It remains to show condition (7). We can write as:
| 40 |
where and . Taking the derivative with respect to w, we have
| 41 |
We do not know the value of that appears in . is monotonically increasing with ; because is less than the radius of convergence , converges at and is a finite constant. As shown above, . To find numerically, we first note that Eq. 41 depends on t only through . Hence, we can traverse values of , numerically solving Eq. 38 for the single variable w in terms of y. Solutions for w must satisfy , as by the monotonicity of .
Next, we see that Eq. 40 contains variables w, y, and t; using the pairs (w, y) obtained in the previous step, we numerically solve Eq. 37 for t in terms of w and y. In the third step, for each triple (t, w, y), we insert the value of t into the generating function , where values are taken from Table 3; we retain triples with small . Note that in this step, we could instead have retained triples with small ; faster convergence of compared to with a fixed number of known values of suggests that a more accurate result is obtained by use of y rather than w.
Finally, the best-fit triple (t, w, y) gives the numerical solution for , or (0.2073, 0.3550, 0.0450). We hence have for the radius of convergence, and . The radius is indeed lower than . Taking additional digits, the exponential order of the sequence is approximately , greater than that of the sequence for trees without galls ().
To calculate the asymptotic approximation to , we evaluate the constant . We have:
We numerically evaluate the derivative from the first 25 terms by . Inserting for t and for w, we have , , , by Eq. 39, and
| 42 |
Generating Function for
We next find the generating function of , , writing . We define , and recall that and that Eq. 28 applies for . We then have for
| 43 |
We can now write
| 44 |
As in the derivation of , we calculate the three parts separately.
First, because for ,
| 45 |
For the second term, the derivation is identical to that of Eq. 33:
| 46 |
Analogously to Eq. 33, Eq. 46 relies on a summation that can be completed if and only if , that is, for (Landau 1977, Eqs. 4 and 5).
Finally, for the third term, following the derivation of Eq. 35,
| 47 |
where the last equality holds if and only if . Because , for , and by the monotonicity of for , for .
Summarizing Eqs. 44, 45, 46, and 47, for ,
Solving for , we have
| 48 |
Growth of
We now show that the asymptotic growth of the number of galled trees with one gall follows the asymptotic exponential growth of the number of trees with no galls. We also find the asymptotic approximation of .
First, . From the form of Eq. 48, converges if and only if . It is shown in Eqs. 4 and 5 of Landau (1977) that for , with . We conclude that has the same radius of convergence as . To find the asymptotic behavior of , we notice that
| 49 |
As , with and (Flajolet and Sedgewick 2009, pp. 476–477), . Hence, the first of the two terms in Eq. 49 is the leading term as , producing
| 50 |
At this point, we seek to use transfer theorems to transfer the asymptotic equivalence for to an asymptotic equivalence for its coefficients. To do so, we note that satisfies the technical criterion that it is -analytic at —that is, it is analytic in a domain of particular shape around the singularity at . The computation of from maintains the property that is -analytic with a singularity at .
We can therefore use a transfer formula [Corollary VI.1, page 392 and Theorem VI.4, page 393 in Flajolet and Sedgewick (2009)], according to which, if f(t) is -analytic with a singularity at b, and as with t in , and , then the coefficients of f satisfy . Using Eq. 50, we apply the transfer formula to with in the role of b and for a, noting :
| 51 |
and have the same exponential growth. Whereas has subexponential term , however, has larger subexponential term .
Bivariate Generating Function for
We now find the bivariate generating function . First, note that for each . For , and for . From the recursion for (Eqs. 26, 27), we get
| 52 |
where if at least one of is not in .
We can solve to find an expression for in a manner similar to the solution for . For the second and third terms, we have and ; in these terms, the gth gall is the root gall. Therefore,
| 53 |
| 54 |
| 55 |
In summary, inserting Eqs. 53, 54, and 55 into Eq. 52,
| 56 |
The Distribution of the Number of Galled Trees with a Fixed Number of Leaves
The bivariate generating function provides a basis for studying the distribution of the number of galls across galled trees with n leaves. The approach follows a theorem concerning asymptotic distributions in Theorem 2.23 of Drmota (2009) and Proposition IX.17 on p. 682 of (Flajolet and Sedgewick 2009). We use the form of the theorem quoted in Theorem 2 of Bouvel et al. (2020), who considered labeled galled trees. We conclude that for a fixed number of leaves, the number of galled trees as a function of the number of galls g is asymptotically normally distributed with mean and variance linear in n.
Following Bouvel et al. (2020), we consider a power series C(z, x) in two variables that is defined implicitly as the solution of , where F satisfies certain conditions. We suppose is a sequence of random variables such that . Then is asymptotically normally distributed with a mean and variance that are linear multiples of n calculated from F.
In our scenario, t, u, and play the roles of z, x, and C. is implicitly defined as a function of t, u, and itself. With , gives the random number of galls in a randomly selected galled tree with n leaves. Fixing the number of leaves n in , this random variable satisfies
| 57 |
To conclude that random variable —the random number of galls in a tree with n leaves—is normally distributed, it remains only to verify the conditions of the theorem.
Translating from the notation of Bouvel et al. (2020) and writing with so that , we must show all of the following:
is analytic in t, u, w around 0.
.
for .
All coefficients are real and nonnegative.
- Nonnegative solutions exist for the following pair of equations:
- The solutions satisfy:
Condition 1 holds because is a quotient of analytic functions in t, u, w with denominator greater than 0 near (0, 0, 0). Condition 2 is met because (because for all ) and so . Therefore,
For condition 3, , which is not equal to 0 for . Condition 4 holds trivially from the definition of . For conditions 5 and 6, we first show . First,
Next, we have
We then have . We have already shown conditions 5 and 6 in our analysis of function .
With all the conditions demonstrated, we conclude that the random number of galls in a tree with n leaves is normally distributed.
Numerical Results
The numerical results for the number of galled trees with n leaves and the number of galled trees with galls suggest a number of simple observations (Table 3). First, for , we recover the Wedderburn-Etherington numbers obtained from Eq. 1. For to 6, we obtain the values of and computed by Mathur and Rosenberg (2023). Finally, as g is bounded above by , pairs of consecutive values of n, an odd then an even integer, have the same number of values of g for which the number of galled trees is nonzero, namely .
Considering a fixed number of leaves , we comment informally on the number of galled trees across different values of g. The number of trees with at least one gall is larger than the number without galls. As g increases for fixed n, the number of trees increases to a maximum, then declines. For values of n for which the maximal number of galls is even (), the largest number of trees occurs when the number of galls is , half of this maximum. When is odd (), the largest number of trees occurs at or .
Figure 3 plots the number of trees for fixed n as a function of g, considering four consecutive values of n that represent the four cases possible for the parity of n and . The plots are somewhat symmetric; for and 17, a neighboring value of g produces a number of trees close to the maximum, and for and 18, the peak stands out more clearly. The patterns accord with the asymptotic normal distribution demonstrated for the number of galls as n increases (Sect. 5.6).
Fig. 3.
Number of galled trees as a function of the number of galls g, for fixed numbers of leaves n. A . B . C . D . Values are computed from Eqs. 26 and 27
Figure 4 examines the growth of on a logarithmic scale for different fixed values of g. The number of trees with no galls has exponential growth , for constants and (Sect. 2.3). With one gall, exceeds with growth for , but with the same exponential growth (Sect. 5.4). With specified numbers of galls , we see that growth of for fixed g appears to also follow an exponential trend.
Fig. 4.

Number of galled trees as a function of the number of leaves n, for fixed numbers of galls . Values are computed from Eqs. 26 and 27. The y-axis appears on a logarithmic scale
Discussion
Building on the Wedderburn-Etherington recursion for enumerating rooted binary unlabeled trees with n leaves (Eq. 1), we have introduced a recursion to enumerate rooted binary unlabeled (normal) galled trees with n leaves. The recursion follows the spirit of the Wedderburn-Etherington formula in its recursive descent from the tree root—but with additional terms for cases in which the root of the tree is also the top node of a gall (Eqs. 15 and 16). Continuing with a similar recursive strategy, we have also obtained a recursive formula for the number of galled trees with a fixed number of leaves n and a fixed number of galls g (Eqs. 26 and 27). We have derived generating functions for the number of galled trees (Eq. 36) and for the number of galled trees with 1 gall (Eq. 48), analyzing their asymptotic behavior.
Our numerical calculations find that for small n, for a fixed number of galls g, the increase of the number of galled trees with n appears faster for larger values of the fixed number of galls g (Table 3, Fig. 4). Because for , for higher values of g, values of at small n do not reflect the asymptotic trend. Nevertheless, for , the initial apparent rapid growth of visible with increasing n moderates, in accord with the finding that the exponential order of the increase is the same as for the case of no galls (Sect. 5.4). A similar moderation in growth with increasing n is just observable for , which could hint at a similar exponential growth; we can conjecture that each with fixed g has the same exponential growth. Note that Fuchs et al. (2019, Theorem 5.1; 2022, Theorem 1 and Corollary 2) showed that the exponential growth of labeled tree-child networks and normal networks with a fixed number of reticulation vertices (corresponding to a fixed number of galls in our case) is the same for any such number; only the subexponential growth differs.
On the other hand, when g is not restricted, we have shown in Sect. 5.2 that the convergence radius of the generating function satisfies , so that grows with . We also observed that the number of galled trees grows numerically faster with n than does the number of trees with no galls (Table 3).
For a fixed number of leaves n, the number of trees with a fixed number of galls increases to a maximum when the number of galls is at or near half the maximum number of galls , then decreases. This pattern accords with the normal distribution we expect as n increases (Sect. 5.6). It is explained by the fact that many ways often exist to add a gall to a tree with a small number of galls without changing the number of lineages n (Fig. 5A). As the number of galls grows, fewer places are available in the tree to add more galls (Fig. 5B), and the number of possible trees declines. Informally, for a tree with n leaves, when we have a maximum of potential galls from which to choose, the binomial , describing the number of possible subsets containing g galls, is highest for g near .
Fig. 5.
The addition of a gall to a galled tree. A One of the galled trees with 5 leaves and 0 galls. B The ten galled trees with 5 leaves and 1 gall produced by adding a gall to the tree in (A). The gall is indicated in blue. C The sole galled tree with 5 leaves and 2 galls produced by adding a gall to the tree highlighted in (B). The new gall is indicated in orange. A tree with 5 leaves has at most 2 galls (Color figure online)
Galled trees provide a class of networks for use with biological processes such as admixture of populations, horizontal gene transfer, hybridization, and the recombination processes for which galled trees were originally introduced (Wang et al. 2001; Gusfield et al. 2004a). Other definitions of galled trees have previously been considered in enumerative problems (Semple and Steel 2006; Chang et al. 2018; Bouvel et al. 2020; Cardona and Zhang 2020); our definition, which requires galled trees to be “normal” by imposing a minimum of four nodes per gall, is designed for scenarios in which two lineages merge to form a new third lineage, but continue to have other descendants that are not descended from this merging event. Such scenarios are suited to phenomena such as admixture and hybridization, in which the merging process of two groups to form a third group has this feature: it does not cause the disappearance of the original two groups, which are free to produce additional descendants through processes that do not involve admixture and hybridization.
In related work, Cardona and Zhang (2020) enumerated rooted binary labeled normal galled trees. Their Theorem 8 finds that the number of such trees with n leaves is
| 58 |
where C is the set of vectors of nonnegative integers satisfying . This enumeration accords with our enumeration of the corresponding unlabeled normal galled trees. For and 2, Eq. 58 produces 1 rooted binary labeled normal galled tree; for , it gives 6 labeled trees—in accord with our count of 2 unlabeled normal galled trees, each of which has 3 possible labelings. For , Eq. 58 gives 69 labeled trees; the 6 unlabeled normal galled trees in Table 2 have 12, 12, 3, 12, 6, and 24 labelings, respectively, summing to 69.
The enumeration of galled trees can assist in studies involving mixture processes in the same way that the Wedderburn-Etherington enumeration assists in evolutionary biology more generally, by describing the contents of a space of biologically relevant trees that must be traversed in a variety of algorithmic, combinatorial, probabilistic, and statistical problems [e.g. Harding (1971), Matsen and Evans (2012), Sievers et al. (2014), Colijn and Plazzotta (2018), Rosenberg (2021)]. The study adds to the growing area of enumerative combinatorics of phylogenetic networks [e.g. Bouvel et al. (2020), Cardona and Zhang (2020), Gunawan et al. (2020), Bienvenu et al. (2022), Fuchs et al. (2022)] and is one of relatively few studies to examine a class of unlabeled networks (Chang et al. 2018; Mathur and Rosenberg 2023). Further work can investigate the properties of for fixed .
Acknowledgements
We are grateful to M. Fuchs, B. Gittenberger, and a reviewer; comments from the reviewer and discussions with M. Fuchs and B. Gittenberger have assisted with the asymptotic analyses of the generating functions. We acknowledge NSF grant BCS-2116322 and the Scholarship for Outstanding Postdoctoral Fellows in Data Science given by the council for Higher Education of Israel.
Appendix
This appendix shows that the radius of convergence for the generating function is positive. The approach is to bound from above by a quantity whose generating function is known to have a positive radius of convergence. For clarity in comparing with Bouvel et al. (2020) and Cardona and Zhang (2020), we retain the terms “rooted,” “labeled” and “unlabeled,” and “normal” in this appendix.
Our number of rooted unlabeled normal galled trees is bounded above by , the corresponding number of rooted labeled normal galled trees in Theorem 8 of Cardona and Zhang (2020) (Eq. 58). is in turn bounded above by the number of rooted labeled galled trees tabulated without imposing the normality requirement, a quantity of Bouvel et al. (2020) that we call . Section 5 of Bouvel et al. (2020) showed that the exponential generating function has positive radius of convergence .
To prove that , we note that the number of rooted labeled normal galled trees with n leaves is . Here, the sum proceeds over the rooted unlabeled normal galled trees, as each rooted labeled normal galled tree is obtained by placing a labeling on one of the rooted unlabeled normal galled trees. is the number of labelings of rooted unlabeled normal galled tree .
Next, consider the concept of a symmetric node of a rooted unlabeled galled tree, an internal node with two identical rooted unlabeled subtrees. The top node of a gall can be symmetric, but side nodes of a gall cannot, as one subtree of a side node contains the reticulation node and the other does not. A reticulation node also cannot be a symmetric node, as it has only one subtree.
For a rooted unlabeled normal galled tree, the number of distinct labelings is , where is the number of symmetric nodes of . To see why this result holds, consider a planar representation of , and examine all n! labelings of the n nodes. For each such labeling, for each symmetric node, a rotation of around the node generates a distinct labeling for the same labeled tree—so that each rooted labeled normal galled tree is obtained from of the n! labelings.
The number of symmetric nodes is bounded above by the maximal number of internal nodes that are not side nodes or hybridization nodes. This number is , the number of internal nodes of a rooted tree with no galls; note that each gall adds two internal nodes to the tree, but neither of the “extra” nodes can be symmetric, as they include a side node and a reticulation node.
It is convenient to use n rather than for the upper bound on the number of symmetric nodes. Then
so that
| 59 |
The generating function has positive radius of convergence , so that converges for . Multiplying Eq. 59 by and summing over all n, we have for . Hence, the smaller must have positive radius of convergence . In particular, .
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Bienvenu F, Lambert A, Steel M. Combinatorial and stochastic properties of ranked tree-child networks. Random Struct Algorithms. 2022;60:653–689. doi: 10.1002/rsa.21048. [DOI] [Google Scholar]
- Bouvel M, Gambette P, Mansouri M. Counting phylogenetic networks of level 1 and 2. J Math Biol. 2020;81:1357–1395. doi: 10.1007/s00285-020-01543-5. [DOI] [PubMed] [Google Scholar]
- Cardona G, Zhang L. Counting and enumerating tree-child networks and their subclasses. J Comput Syst Sci. 2020;114:84–104. doi: 10.1016/j.jcss.2020.06.001. [DOI] [Google Scholar]
- Chang K-Y, Hon W-K, Thankachan SV (2018) Compact encoding for galled-trees and its applications. In: 2018 Data Compression Conference, Snowbird, UT, pp 297–306
- Colijn C, Plazzotta G. A metric on phylogenetic tree shapes. Syst Biol. 2018;67:113–126. doi: 10.1093/sysbio/syx046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Comtet L. Advanced combinatorics. Boston: Reidel; 1974. [Google Scholar]
- Drmota M. Random trees. Vienna: Springer; 2009. [Google Scholar]
- Felsenstein J. Inferring phylogenies. Sunderland, MA: Sinauer; 2004. [Google Scholar]
- Flajolet P, Sedgewick R. Analytic combinatorics. Cambridge: Cambridge University Press; 2009. [Google Scholar]
- Fuchs M, Gittenberger B, Mansouri M. Counting phylogenetic networks with few reticulation vertices: tree-child and normal networks. Australas J Comb. 2019;73:385–423. [Google Scholar]
- Fuchs M, Huang E-Y, Yu G-R. Counting phylogenetic networks with few reticulation vertices: a second approach. Discr Appl Math. 2022;320:140–149. doi: 10.1016/j.dam.2022.03.026. [DOI] [Google Scholar]
- Gascuel O. Mathematics of evolution and phylogeny. Oxford: Oxford University Press; 2005. [Google Scholar]
- Gunawan AD, Rathin J, Zhang L. Counting and enumerating galled networks. Discrete Appl Math. 2020;283:644–654. doi: 10.1016/j.dam.2020.03.005. [DOI] [Google Scholar]
- Gusfield D. Optimal, efficient reconstruction of root-unknown phylogenetic networks with constrained and structured recombination. J Comput Syst Sci. 2005;70:381–398. doi: 10.1016/j.jcss.2004.12.009. [DOI] [Google Scholar]
- Gusfield D. ReCombinatorics. Cambridge: MIT Press; 2014. [Google Scholar]
- Gusfield D, Eddhu S, Langley C (2003) Efficient reconstruction of phylogenetic networks with constrained recombination. In: Computational Systems Bioinformatics, CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference, Stanford, CA, pp 363–374 [PubMed]
- Gusfield D, Eddhu S, Langley C (2004a) The fine structure of galls in phylogenetic networks. INFORMS J Comput 16:459–469
- Gusfield D, Eddhu S, Langley C (2004b) Optimal, efficient reconstruction of phylogenetic networks with constrained recombination. J Bioinform Comput Biol 2:173–213 [DOI] [PubMed]
- Harding EF. The probabilities of rooted tree-shapes generated by random bifurcation. Adv Appl Prob. 1971;3:44–77. doi: 10.2307/1426329. [DOI] [Google Scholar]
- Huson DH, Rupp R, Scornavacca C. Phylogenetic networks: concepts, algorithms and applications. Cambridge: Cambridge University Press; 2010. [Google Scholar]
- Kong S, Pons JC, Kubatko L, Wicke K. Classes of explicit phylogenetic networks and their biological and mathematical significance. J Math Biol. 2022;84:47. doi: 10.1007/s00285-022-01746-y. [DOI] [PubMed] [Google Scholar]
- Landau BV. An asymptotic expansion for the Wedderburn–Etherington sequence. Mathematika. 1977;24:262–265. doi: 10.1112/S0025579300009177. [DOI] [Google Scholar]
- Mathur S, Rosenberg NA. All galls are divided into three or more parts: recursive enumeration of labeled histories for galled trees. Algorithms Mol Biol. 2023;18:1. doi: 10.1186/s13015-023-00224-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsen FA, Evans SN. Ubiquity of synonymity: almost all large binary trees are not uniquely identified by their spectra or their immanantal polynomials. Algorithms Mol Biol. 2012;7:14. doi: 10.1186/1748-7188-7-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meir A, Moon JW. On an asymptotic method in enumeration. J Comb Theory Ser A. 1989;51:77–89. doi: 10.1016/0097-3165(89)90078-2. [DOI] [Google Scholar]
- Meir A, Moon JW. Erratum: on an asymptotic method in enumeration. J Comb Theory Ser A. 1989;52:163. doi: 10.1016/0097-3165(89)90071-X. [DOI] [Google Scholar]
- Rosenberg NA. On the Colijn–Plazzotta numbering scheme for unlabaled binary rooted trees. Discrete Appl Math. 2021;291:88–98. doi: 10.1016/j.dam.2020.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Semple C, Steel M. Phylogentics. Oxford: Oxford University Press; 2003. [Google Scholar]
- Semple C, Steel M. Unicyclic networks: compatibility and enumeration. IEEE/ACM Trans Comput Biol Bioinform. 2006;3:84–91. doi: 10.1109/TCBB.2006.14. [DOI] [PubMed] [Google Scholar]
- Sievers F, Hughes GM, Higgins DG. Systematic exploration of guide-tree topology effects for small protein alignments. BMC Bioinform. 2014;15:338. doi: 10.1186/1471-2105-15-338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song YS. A concise necessary and sufficient condition for the existence of a galled-tree. IEEE/ACM Trans Comput Biol Bioinform. 2006;3:186–191. doi: 10.1109/TCBB.2006.15. [DOI] [PubMed] [Google Scholar]
- Steel M. Phylogeny: discrete and random processes in evolution. Philadelphia: Society for Industrial and Applied Mathematics; 2016. [Google Scholar]
- Wang L, Zhang K, Zhang L. Perfect phylogenetic networks with recombination. J Comput Biol. 2001;8:69–78. doi: 10.1089/106652701300099119. [DOI] [PubMed] [Google Scholar]
- Warnow T. Computational phylogenetics. Cambridge: Cambridge University Press; 2018. [Google Scholar]


