Abstract
Recently there has been considerable interest in the problem of finding a phylogenetic network with a minimum number of reticulation vertices which displays a given set of phylogenetic trees, that is, a network with minimum hybrid number. Such networks are useful for representing the evolution of species whose genomes have undergone processes such as lateral gene transfer and recombination that cannot be represented appropriately by a phylogenetic tree. Even so, as was recently pointed out in the literature, insisting that a network displays the set of trees can be an overly restrictive assumption when modeling certain evolutionary phenomena such as incomplete lineage sorting. In this paper, we thus consider the less restrictive notion of rigidly displaying which we introduce and study here. More specifically, we characterize when two trees can be rigidly displayed by a certain type of phylogenetic network called a temporal tree-child network in terms of fork-picking sequences. These are sequences of special subconfigurations of the two trees related to the well-studied cherry-picking sequences. We also show that, in case it exists, the rigid hybrid number for two phylogenetic trees is given by a minimum weight fork-picking sequence for the trees. Finally, we consider the relationship between the rigid hybrid number and three closely related numbers; the weak, beaded, and temporal hybrid numbers. In particular, we show that these numbers can all be different even for a fixed pair of trees, and also present an infinite family of pairs of trees which demonstrates that the difference between the rigid hybrid number and the temporal-hybrid number for two phylogenetic trees on the same set of n leaves can grow at least linearly with n.
Keywords: Phylogenetic network, Hybrid number, Cherry-picking sequence, Fork-picking sequence, Weakly displaying, Rigidly displaying, Temporal tree-child network, Beaded tree
Introduction
Recently there has been great interest in using phylogenetic networks to model processes such as lateral gene transfer and recombination (see e.g. Bapteste et al. 2013). Such networks come in various forms (see e.g. Huson et al. 2010), and here we shall consider explicit networks which are used to provide direct representations of evolutionary histories in the form of a leaf-labelled directed graph. Formally speaking, a phylogenetic network (on a species set X) is a connected directed acyclic graph, with a single root and leaf-set X in which every internal vertex has either indegree one and outdegree two, or indegree two and outdegree one, except for the root which has outdegree two. We call the number of vertices in a phylogenetic network with indegree two the network’s hybrid number, so that a phylogenetic tree is a network with hybrid number zero. We shall mainly focus on temporal tree-child networks, i.e. phylogenetic networks in which each non-leaf vertex has a child whose indegree is one, whose vertices can be labelled with time stamps that move strictly forward on treelike parts of the network, and so that vertices with indegree two have parents with the same time stamp [also known as tree-child, time-consistent networks (Cardona et al. 2009a)].
Any phylogenetic network on some set X displays a set of phylogenetic trees on X, where a phylogenetic tree is displayed by a network if there is a subgraph of the network that is isomorphic to a subdivision of the tree (van Iersel et al. 2010). Biologically speaking, we usually think of the trees displayed by a network as being gene trees, and the fact that a network is required to represent them simultaneously is the consequence of incongruence between the gene trees that can arise from processes such as lateral gene transfer (Zhu et al. 2016). As gene trees are now commonly inferred from genomic data (e.g. by considering either a gene or a genomic locus of interest) it is therefore natural to try and devise ways to construct phylogenetic networks through the process of looking for some network which displays a given set of gene trees (see e.g. Nakhleh 2010, Section 2). For a given set of phylogenetic trees, this leads to the concept of the (temporal) hybrid number, which is the minimum hybrid number taken over all (temporal tree-child) networks that display each of the trees in the set (Baroni et al. 2005b; Humphries et al. 2013a). While the hybrid number exists for any set of phylogenetic trees, it is worth noting that the temporal hybrid number does not always exist, i.e. there are sets of trees that cannot be displayed simultaneously in a temporal tree-child network.
Several results have been presented in the literature concerning displaying phylogenetic trees and hybrid numbers, mainly for pairs of trees. These include structural information on how the hybrid number is related to a so-called maximum acyclic agreement forest for two phylogenetic trees (Baroni et al. 2005a), characterizations for when collections of trees are displayed by special types of networks (Humphries et al. 2013a; Linz and Semple 2019) and related algorithms/complexity results (Bordewich and Semple 2007a, b; Döcker et al. 2019; Humphries et al. 2013b; Huson and Linz 2016). However, all of these results rely on the fact that the networks display the set of trees in question. Recently it has been observed that this is a serious issue when modelling a phenomenon called incomplete lineage sorting where the aim is to model gene tree incongruence arising due to population effects. This is because the set of displayed trees is then no longer able to fully capture the way in which the genes actually evolve along the network, making it difficult to recover the underlying network (Zhu et al. 2016; Zhu and Degnan 2016; Degnan 2016).
A possible solution to this problem is to relax the displaying condition. Roughly speaking, a phylogenetic tree is weakly displayed by a network (Huber et al. 2016) if it can be embedded in the network in such a way that the tree follows along the directed paths in the network (see Sect. 3 for the definition). In biological terms, as nicely explained in van Iersel et al. (2018), “this means that different lineages of the gene tree may “travel down” the same branch of the network, as long as any branching node in the tree coincides with a branching node in the network”. In this paper, we focus on the special situation where two phylogenetic trees and are weakly displayed by a temporal tree-child network under the assumption that there exist simultaneous embeddings of both trees that do not permit more than three branches of and to come together at a reticulation vertex. In this case we shall say that and are rigidly displayed by the network (see Fig. 1 for an example). As relatively little is known about the problem of constructing phylogenetic networks that weakly display a collection of phylogenetic trees, we believe that our results on rigidly displaying provide some useful new insights into approaching this challenging problem. Ultimately this should hopefully also lead to improved approaches to modeling phenomena such as incomplete lineage sorting.
Fig. 1.

Two phylogenetic trees on the set that are rigidly displayed by the phylogenetic network on the right. Note that the tree in the top left is not displayed by the network. The trees and network are adapted from (Zhu et al. 2016, Fig. 2, Fig. 3) and represent a hypothetical evolutionary scenario tracing the evolution of a genomic region within four species
We now summarize the rest of the paper, including statements of our main results. After presenting some definitions in Sect. 2, in Sect. 3 we present the definition of weakly displaying, and we prove some basic facts concerning this concept that are useful later on. In addition, in Theorem 1 we characterize when two phylogenetic trees are displayed by a temporal tree-child network and when they are weakly displayed such a network.
In Sects. 4 and 5, we introduce the concepts of rigidly displaying and fork-operations, respectively, and prove some results concerning these concepts which we use later on. A fork-operation can be thought of as a generalization of picking off a pair of cherries from two phylogenetic trees as defined in Humphries et al. (2013a). In particular, in a key result, Proposition 1, we show that in case a temporal tree-child network rigidly displays two phylogenetic trees, there exists a certain sequence of fork-operations (called a special sequence) which can be applied to these two trees that allows us to apply inductive arguments to prove our main results. Using this, in Theorem 2 we prove that a pair of phylogenetic trees can be rigidly displayed by a temporal tree-child network if and only if there exists a fork-picking sequence for the two trees. Interestingly, we also prove that this is equivalent to the existence of a temporal tree-child network that displays the two trees. As a corollary we show that it is NP-complete to decide whether or not there exists a temporal tree-child network that rigidly displays two phylogenetic trees (Corollary 1).
In Sect. 7, we define the rigid hybrid number of two trees, which is the minimum hybrid number of a phylogenetic network that rigidly displays both trees, taken over all temporal tree-child networks. To capture this number, we consider the weighted fork-picking sequences for a pair of phylogenetic trees, showing in Theorem 3 that, in case it exists, the rigid hybrid number of two phylogenetic trees is equal to the minimum taken over the weights of all possible fork-picking sequences for the two trees. This result can be regarded as analogue of Humphries et al. (2013a, Theorem 2).
In Sect. 8, we consider the relationship between the rigid hybrid number and three closely related hybrid numbers: the weak, beaded, and temporal hybrid numbers (the beaded hybrid number was implicitly defined in van Iersel et al. (2018)). In particular, in Theorem 4, we first show that there is a pair of phylogenetic trees on a set X with |X| arbitrarily large, so that the difference between the temporal and rigid hybrid numbers for these two trees is at least . Then in Theorem 5, we show that there exist two phylogenetic trees whose beaded, weak, rigid, and temporal hybrid numbers are all distinct from one another. We conclude in Sect. 9 by presenting some open problems and discussing some possible future directions of research.
Preliminaries
Let G denote a directed, acyclic graph with a single root, i.e., a vertex with indegree zero. Let V(G) denote the vertex set of G, E(G) the set of (directed) edges of G, and the unique root of G. A vertex in G with indegree one and outdegree zero is called a leaf; an edge of G incident with a leaf of G a pendant edge of G. Furthermore, we denote the set of all leaves of G by L(G).
Suppose . We say that a vertex is above v if there exists a directed path from u to v (note that u could equal v). If u is above v, then we also write or simply if G is clear from the context. Furthermore, we say that v is below u. We call any vertex above v an ancestor of v and any vertex below v a descendant of v. Finally, we say that two distinct edges and of G are comparable if v is above or is above u. Otherwise we say that e and are incomparable.
Let X be a finite set of size at least 2. Following e.g. Huber et al. (2016, p.1764) a rooted, directed acyclic graph is called a phylogenetic network (on X) if the following properties are satisfied:
the root of has indegree zero and outdegree two,
X is the set of leaves of , and
each remaining vertex of has either indegree one and outdegree two, or indegree two and outdegree one.
Unless stated otherwise, phylogenetic networks do not contain parallel edges.
Now let be a phylogenetic network. We refer to a vertex of with indegree two and outdegree one as a reticulation vertex, and to a vertex with indegree one and outdegree zero or two as a tree vertex. The set of reticulation vertices of is denoted by . We put . Moreover, we call a directed path P from a vertex v to a leaf x in a phylogenetic network a tree-path if each vertex on P, except possibly v, is a tree vertex.
A phylogenetic tree (on X) is a phylogenetic network on X that does not have any reticulation vertices. We say that two phylogenetic trees and on X are isomorphic, denoted by , if there exists a bijection that induces a graph isomorphism between and that is the identity on X. If is a phylogenetic tree on X, and , then the last common ancestor of Y, denote by , is the unique vertex v of that is an ancestor of every element in Y and there is no vertex such that w is a descendant of v and w is an ancestor of every element in Y. For any elements , we sometimes also write rather than . Now let . We denote by the minimal subtree of that connects all vertices in and by the tree obtained from by suppressing all vertices that have indegree one and outdegree one. We refer to as the restriction of to . Note that, if is a subset of X such that , then the root of equals .
Suppose is a phylogenetic network on X. Following Steel (2016), we say that is temporal (Moret et al. 2004) if there exists a map such that, for all , we have whenever q is a reticulation vertex and , otherwise. In that case, we call t a temporal labelling of . Unless of relevance to the discussion, we always omit the temporal labelling when depicting a temporal network. We say that is tree-child (Cardona et al. 2009b) if, for each non-leaf vertex at least one of the children of v is a tree vertex. Note that a tree-child network was called a phylogenetic network in Humphries et al. (2013a, p.1883) and that a temporal tree-child network (in our sense) was called a (binary) time-consistent tree-child network, or TCTC-network in Cardona et al. (2009a).
Weakly displaying two trees in a network
To define and understand the notion of rigidly displaying, it is useful to first consider the more general notion of weakly displaying. As well as recalling the definition of weakly displaying, we derive some of its basic properties which will be useful later, and explain how the concept of rigidly displaying is related to the stronger notion of displaying (see Theorem 1).
Let be a phylogenetic tree on X and let be a phylogenetic network on X. Furthermore, let be a map that maps each vertex of to a vertex of and each edge of to a directed path from the image of u under to the image of v under . To distinguish the mapping of a vertex v from that of an edge e in , we use to denote the vertex in that v is mapped to under and to denote the directed path in that e is mapped to under . We call a display map for in if the following properties hold:
-
(i)
for each , ,
-
(ii)
for all , is a tree vertex or the root of ,
-
(iii)
for each edge e of , contains at least one edge of , and
-
(iv)
for any two distinct edges and of , the first edge of is different from the first edge of in .
Note that the definition of a display map is equivalent to being a locally separated reconciliation as defined in Huber et al. (2016, Section 7).
Now, let be a display map for in , and let P be a directed path of . It follows immediately from the definition of , that the edge set
induces a directed path in . We will freely use this fact throughout the remainder of the paper.
Following Huber et al. (2016), we say that is weakly displayed by if there exists a (not necessarily unique) display map for in . To reduce notation, we will sometimes not explicitly refer to the display map. Note that if displays as defined in the introduction then also weakly displays . However, the converse is not necessarily true. For example, referring to Fig. 2, is weakly displayed by for the indicated display map but is not displayed by .
Fig. 2.
Two phylogenetic trees and on that are weakly displayed by the network on X via the indicated display maps and for and , respectively, for which holds for all . However, and are not both displayed by (the tree is not displayed)
The notion of weakly displayed was introduced in Huber et al. (2016) in terms of a construction that allows the unfolding of a phylogenetic network on X into a so-called multi-labelled tree on X (Huber and Moulton 2006). Such trees are similar to phylogenetic trees in that they have no vertices with in- and outdegree one and the root has indegree zero. However the requirement that the leaf-set is X is relaxed to the requirement that an element of X can label more than one leaf (which is not allowed in the case of phylogenetic trees).
Now, suppose that is a display map for in . For any edge , we denote by the set of all vertices in that lie on except for . Let , and let be an edge of . If , we say that ends at w and if , but we say that passes through w. In addition, we define
i.e., counts the number of edges in such that their image under either ends or passes through w so that in particular . Finally, if is an additional phylogenetic tree on X that is weakly displayed by via a display map , then we put
(see e.g. Fig. 2). To reduce notation we sometimes drop the subscript in as indicated if and are clear from the context.
We now prove two lemmas about display maps which will be useful later.
Lemma 1
Let be a temporal tree-child network on X and let be a phylogenetic tree on X. Then the following holds.
-
(i)
displays if and only if there exists a display map for in such that, for all , we have ,
-
(ii)
if is a display map for in as specified in Assertion (i) then .
Proof
(i) Assume first that displays . Then since a phylogenetic network on X that displays a phylogenetic tree on X clearly also weakly displays that tree, it is straight-forward to see that there exists a display map for in such that, for all , we have .
Conversely, assume that there exists a display map for in such that, for all , we have . Then, for each , there exists at most one edge such that passes through v. Note that since v is not a tree-vertex of , there cannot exist an edge in such that ends in v. Since is tree-child it follows that the subgraph of with vertex set and edge set is a rooted tree with leaf set X. Furthermore, the outdegree of the root of must be two as is temporal and, therefore, cannot contain a shortcut, that is, if there exists a directed path from a vertex u to a vertex v in which contains at least three vertices then there cannot also be the edge (u, v) in . Since is isomorphic to a subdivision of it follows that displays .
(ii) Assume for contradiction that is not the root of . Since is temporal tree-child, the two children u and v of must be two distinct tree vertices. Moreover, there must exist a tree-path in from u to some leaf x and a tree-path from v to some leaf y. Note that since and cannot intersect, we must have . Since is a display map for in that satisfies the properties of Assertion (i) and x and y are also leaves of it follows that is mapped to an ancestor of x and y in under as must display . Extending the paths and to tree-paths starting at implies that this ancestor must be . Thus, .
To state the second lemma we require some further definitions. We call a subgraph of a pendant subnetwork of if there exists a tree vertex v in such that when deleting the incoming edge of v the network decomposes into two connected components where the component that contains v in its vertex set is a phylogenetic network on . For technical reasons, we consider to be a pendant subnetwork of itself. A pendant subtree of is a pendant subnetwork of that is a phylogenetic tree. Note that a pendant subnetwork and therefore also a pendant subtree must have at least two leaves. Also, note that in what follows, if is a phylogenetic network that weakly displays a phylogenetic tree via a display map and is a pendant subtree of , then in order to simplify notation we shall identify with its pre-image under .
Lemma 2
Let be a phylogenetic network on X that weakly displays two distinct phylogenetic trees and on X via display maps and , respectively. Furthermore, let v be a tree vertex of . Then the following hold.
-
(i)
If , then there is a vertex in which is an ancestor of v in .
-
(ii)
If and v has a child that is the root of a pendant subtree in , then the pre-image of under and , respectively is also a pendant subtree of the respective tree and .
Proof
(i) Suppose that v is a tree vertex of such that . Then, without loss of generality, we may assume that is such that . Hence, there are two distinct edges and in such that and .
We show first that e and are incomparable in . Indeed, assume for contradiction that e and are comparable. Without loss of generality, we may assume that w is above in . Then the edges on the directed path P in starting at u and ending at are collectively mapped by to a directed path in with edge set . Note that contains and as (directed) subpaths and that cannot be a subpath of and vice versa as otherwise Property (iii) in the definition of a display map cannot hold. Now, since and it follows that contains a directed cycle; a contradiction. Hence, e and are incomparable.
Let , and let P (resp. ) be the directed path in from q to w (resp. ). Furthermore, let and be the directed paths in with edge sets
respectively. As the first vertex on and is , it follows from Property (iv) in the definition of a display map that the first edge on is different from the first edge on . Moreover, since e is an edge of P and is an edge of , it follows that is a subpath of and is a subpath of . Hence, v is a vertex on and . Consequently, there is some vertex in which lies on and and which is an ancestor of v.
(ii) Note that every leaf x in must be contained in the image under of some directed path in from the root of to x, and similarly for . Since, by assumption, , there can be at most one such path in and , respectively, which has this property for every leaf in . Since the leaf set of , , and , respectively, is X it follows that the pre-image of under and , respectively, must be a pendant subtree of the respective tree and , as otherwise .
In Fig. 2, we presented an example where two phylogenetic trees and are weakly displayed by the depicted phylogenetic network , for all but and are not both displayed by . So, in general, it does not suffice to insist that for all for two phylogenetic trees to be displayed by a phylogenetic network. However, if we insist that the network is temporal tree-child, we now show that this condition actually suffices.
Theorem 1
Suppose that is a temporal tree-child network on X and that and are two phylogenetic trees on X that are weakly displayed by via display maps and , respectively. Then the following statements are equivalent.
-
(i)
displays and .
-
(ii)
for all .
-
(iii)
for all .
Proof
(i) (ii) We show first that must hold for all . Assume for contradiction that there exists some vertex such that . Then one of or must hold. Without loss of generality we may assume that . Then there exists no edge such that either passes through v or ends in v. But then there cannot exist a leaf x of that can be reached from v via a tree-path. Thus, is not tree-child; a contradiction. Since, by assumption, displays both and , Lemma 1 implies that for all . Thus, must hold for all .
(ii) (iii) This is trivial.
(iii) (i) By Lemma 1 it suffices to show that and holds for all . Assume for contradiction that there exists some and some tree in , say , such that . Then . We claim that v is a tree vertex. Assume for contradiction that this is not the case. Then . Since is tree-child there exists a leaf below v that can be reached from v via a tree-path. Hence, any directed path from to x must contain v. Thus, which, in turn, implies that ; a contradiction in view of Assertion (iii). Thus, v is a tree vertex, as claimed.
By Lemma 2(i), it follows that there must exist some vertex that is an ancestor of v. Without loss of generality, we may assume that w is such that no vertex in distinct from w that is above v and below w is contained in . Then there is a unique directed path P from w to v in . Since and, , there is a reticulation vertex contained in P that is not w; a contradiction to the choice of w.
Remark 1
Note that using the same proofs it can be seen that both Lemma 1 and Theorem 1 also hold for the more general class of normal networks. These are phylogenetic networks that in addition to being tree-child do not contain a shortcut as defined in the proof of Lemma 1 (see e.g. McDiarmid et al. 2015, p. 208).
Rigidly displaying two trees in a network
We now introduce the notion of rigidly displaying and present some of its basic properties. In Theorem 1, we showed that if is a temporal tree-child network which weakly displays two phylogenetic trees and via display maps and , then in case for all it follows that actually displays . To define rigidly displaying we relax this latter condition as follows.
We say that a phylogenetic network on X rigidly displays two phylogenetic trees and on X if weakly displays and via display maps and respectively, for all we have and, for each parent of a reticulation vertex , we have . For example, the network pictured in Fig. 1 rigidly displays the two phylogenetic trees depicted in that figure (where and are the obvious display maps).
Note that, in contrast to the definitions of displaying and weakly displaying which refer to a single phylogenetic tree, rigidly displaying always refers to two phylogenetic trees. In addition, by Theorem 1 it follows that if and are two phylogenetic trees on X that are displayed by a temporal tree-child network on X, then also rigidly displays and .
We conclude this section by presenting two technical lemmas concerning rigidly displaying trees in tree-child networks which we will use later.
Lemma 3
Suppose is a tree-child network on X and that rigidly displays two phylogenetic trees and on X via display maps and , respectively. Then for all .
Proof
Put and suppose . Then since rigidly displays and it also weakly displays and . Since and there exists some leaf below v that can be reached from v via a tree-path as is tree-child, it follows in view of Property (ii) in the definition of a display map that and that . Hence, .
For the remainder, assume for contradiction that there exists some such that . Then v must be a tree vertex of as rigidly displays and and . Let be a longest directed path of tree vertices in that ends at v. Note that , for all . Also note that since is not a tree vertex of , we cannot have . Let denote the parent of . Then . Hence, we cannot have . Since rigidly displays and it follows that w must be a tree vertex of . But then the extension of P by w results in a directed path of tree vertices of that ends in v and that is longer than P; a contradiction.
Note that the converse of the last lemma does not hold in general (see e. g. Fig. 3).
Fig. 3.
The two phylogenetic trees on the left are weakly displayed via the indicated display maps by the network depicted on the right. However, they are not rigidly displayed by the network for those maps because and v is the parent of a reticulation vertex
Lemma 4
Suppose that is a tree-child network on X that rigidly displays two phylogenetic trees and on X and that is the underlying display map for in . If is an edge of such that passes through a vertex , then must be a parent of w in .
Proof
Suppose is an edge in such that passes through a vertex . Assume for contradiction that is not a parent of w. Let p be the parent of w in such that p lies on . Then . As is tree-child, there must be a tree-path in starting at p and ending at some leaf . So, as x is a leaf of and , there must be some edge in such that passes through or ends at p. Moreover, considering the leaf x again, there must be an edge in which maps to a path in via the underlying display map for in that either ends at or passes through p. It follows that ; a contradiction as rigidly displays and and p is the parent of a vertex in .
Note that, as the example of the two phylogenetic trees and the network in Fig. 2 shows, the assumption that is tree-child is necessary for Lemma 4 to hold.
Fork operations
In the next section we shall characterize when two phylogenetic trees are rigidly displayed by a temporal tree-child network in terms of sequences of certain operations on these trees. The basis for these sequences are fork-operations which we shall now introduce.
First, we recall that two leaves x and y of a phylogenetic tree with are called a cherry of , denoted by , if x and y have the same parent. Now, by a fork we mean a 2-leaved rooted tree (i. e. a cherry), a 3-leaved rooted tree (a 3-fork) or a 4-leaved fully-balanced rooted tree (a 4-fork). The following basic fact concerning forks is straight-forward to show.
Lemma 5
Suppose is a phylogenetic tree with leaves. If then is a 3-fork and if then must contain a pendant subtree that is either a 3-fork or a 4-fork.
Now, a (type-i) fork-operation , , is an operation that can be applied to a pair of phylogenetic trees and on X for which there exists a leaf , together with a fork in each of and containing x as depicted in the second and third columns of Fig. 4 (for example, for a type-2 operation one tree has a 3-fork ((z, x), y) and the other a cherry with distinct). In the 4th and 5th columns of Fig. 4 the result of applying the operation o(x) to the two trees is pictured (for example, applying a type-2 operation to the 3-fork and the cherry in row 3 results in a phylogenetic tree with cherry and a phylogenetic tree whose cherry has been replaced by y). In particular, when we apply an operation o to some element , we remove the leaf x and its incident edge from both trees, and suppress any resulting vertices of degree 2, also removing the root and both edges incident with it in case . In case the type i of the fork-operation is of importance we write instead of o(x). We present an example of applying a sequence of fork-operations to two phylogenetic trees in Fig. 5.
Fig. 4.

The four fork-operations each applied to leaf x in two phylogenetic trees, as defined in the text
Fig. 5.

For the two pictured phylogenetic trees and on , we depict a fork-picking sequence for and which has weight 1. In that sequence, is the special sequence , and makes up . The forks in and to which a fork-operation is applied are indicated by dotted triangles
Now, given two phylogenetic trees and on the set , , we call a sequence of l fork-operations, , a special sequence for and if is a type-1 operation on and and, in case , the following properties hold:
-
(i)
There exists some such that each , , is a type-2 or a type-3 operation on and and the associated 3- or 4-fork is a pendant subtree of ,
-
(ii)
there exist two distinct elements such that the last-but-one operation is a type-2 operation with fork and cherry and the last operation is the type-1 operation applied to the cherries and , and
-
(iii)
if , then must be below for all for the tree in (i).
To illustrate this definition, consider the two phylogenetic trees and on depicted in Fig. 5. Then is a special sequence for and where, for example, is a fork-operation of type-3 and the tree with cherry is the tree mentioned in the definition of a fork-picking sequence. Note that an application of the last operation in a special sequence always results in phylogenetic trees with at least two leaves.
We conclude this section with an observation concerning special sequences which will be key to proving our main results.
Proposition 1
Suppose that is a temporal tree-child network on X, , that rigidly displays two phylogenetic trees and on X. If no type-0 operation can be applied to and , then there is a special sequence for and . Moreover, the two phylogenetic trees resulting from applying to and can be rigidly displayed by a temporal tree-child network with .
Proof
Note first that as otherwise would be a phylogenetic tree that is isomorphic with both and implying that a type-0 operation can be applied to and ; a contradiction.
Let denote a temporal labelling for and pick some whose value is maximum under t. Let u and w be the parents of v. Since does not contain parallel arcs, u and w are two distinct tree vertices of . Since is tree-child, there must exist some child p of u that is not v and, similarly, there must exist a child q of w that is not v. We claim that p is a leaf of .
To see that this claim holds, assume for contradiction that p is the root of a pendant subgraph of . Note that the choice of v implies that is in fact a pendant subtree of . Moreover, Lemma 3 implies that there are display maps for and in such that . By Lemma 2(ii) it follows that is a pendant subtree of both and . Hence, and have a common cherry and, so, we can apply a type-0 operation to and ; a contradiction. Thus p must be a leaf of . Applying similar arguments to q implies that q must also be a leaf of .
Since is tree-child, the choice of v implies that the child s of v is a leaf of or the root of a pendant subtree of . Assume first that s is a leaf of . Then since and are rigidly displayed by and and do not contain a common cherry, it is straight-forward to see using Lemma 4 that, without loss of generality, and must contain the cherries and , respectively. Hence we can apply a type-1 operation to s. This gives a special sequence of length 1 for and , from which the proposition follows.
So, suppose that s is the root of a pendant subtree of , so that has at least two leaves. Note first that . Indeed, since and are rigidly displayed by and is tree-child, we obtain in view of Lemma 3. If held then and would have a common cherry which implies that a type-0 operation can be applied to and ; a contradiction. We can therefore assume without loss of generality that is a pendant subtree of , and that this tree together with the leaf p also forms a pendant subtree of .
In case has only two leaves x and y, say, then since it follows that contains the 3-fork (p, (x, y)) and contains, without loss of generality, the cherries and . Hence we can apply a type-2 operation to y and then apply a type-1 operation to x (since and must contain the cherries and , respectively). It follows that is a special sequence for and since Property (i) of a special sequence holds for and Property (ii) holds for p and q as defined above. The remainder of the proposition is a straight-forward consequence.
Assume for the remainder of the proof that has at least two leaves. Put , . We claim that there exists a special sequence for and with and p and q as defined above. We prove the claim by induction on k.
Note that we have just shown that the claim holds for the base case . So suppose the claim holds for all k, , and that . Note that as , Lemma 5 implies that contains either a 3-fork or a 4-fork.
Suppose contains a 3-fork where are distinct. Then must be a pendant subtree of . As and have no cherries in common and , it follows that contains the cherry or the cherry . Without loss of generality, we may assume that contains the cherry . Hence, we can apply the type-2 operation o(c) to c. Note that this creates a cherry in which, in view of Property (iv) in the definition of a display map, is not a cherry in . Clearly, the network obtained from by removing c and its incoming edge (suppressing the resulting indegree and outdegree one vertex) is temporal tree-child and rigidly displays and . By induction, we obtain a special sequence for and with and p and q as defined above. We postulate that is a special sequence for and with and p and q as defined above. Since o(c) is a type-2 operation applied to c it suffices to show that Property (iii) of a special sequence is satisfied for c, that is, c is below . But this clearly holds since and any ancestor of b is also an ancestor of c as is a cherry in . Thus, is a special sequence with the stated properties, as required.
Suppose contains a 4-fork where are distinct. Then must again be a pendant subtree of . As and have no cherries in common and , it follows that, as before, we may assume without loss of generality that contains the cherries and . Hence, we can apply the type-3 operation o(c) to c. Note that this creates a 3-fork (d, (a, b)) in and that is not a cherry in . Clearly, the network obtained from by removing c and its incoming edge (suppressing the resulting indegree and outdegree one vertex) is again temporal tree-child and rigidly displays and . By induction, we obtain a special sequence for and with and p and q as defined above. Employing similar arguments as in the previous case implies that is a special sequence for and with and p and q as defined above. This completes the proof of the claim.
To complete the proof, note that as is a pendant subtree of , we can remove and v (plus all its incident edges) from , and suppress the resulting vertices of degree two to obtain a network with . As rigidly displays and , it follows that rigidly displays their restrictions and . Moreover, as is tree-child and , we have that is also tree-child. Since p and q are leaves of and is temporal, it follows that is temporal.
Fork-picking sequences
In this section we characterize when two phylogenetic trees are rigidly displayed by a temporal tree-child network, in terms of a generalization of special sequences which we now introduce. Suppose that and are two phylogenetic trees on where and that is a sequence of fork-operations for and where is on and , for all . Then we call a fork-picking sequence for and if is of the form , some , such that
-
(i)
for all , we have that is a possibly empty (except in case ) sequence of type-0 operations, and
-
(ii)
for all , we have that is a special sequence for and , where and is the first operation in (so that, in particular, ).
To ease readability, we omit all those that are empty when writing down fork-picking sequences. Note that it follows from the definition that any fork-picking sequence can be decomposed in a unique way into the form , and that all of the subsequences are non-empty.
To illustrate this definition, note that is a fork-picking sequence for the two phylogenetic trees depicted in Fig. 5, since it is of the form where is the special sequence for and considered in the previous section, , and is the empty sequence.
We now provide a link between for a temporal tree-child network that rigidly displays two phylogenetic trees and fork-picking sequences for these trees. We define the weight of a fork-picking sequence to be the number of special sequences in (or, equivalently, the number of type-1 operations in since any special sequence contains precisely one type-1 operation).
Proposition 2
Suppose that is a temporal tree-child network on X that rigidly displays two phylogenetic trees and on X. Then there is a fork-picking sequence for and with .
Proof
Clearly, the theorem holds for . So assume that . We prove the theorem by induction on . If , then , , and are all isomorphic to one another. But then we can take a fork-picking sequence for and consisting solely of type-0 operations (i.e. ), and so .
Now, assume that , some , and that the theorem holds for all temporal tree-child networks with .
Apply type-0 operations to and until no more can be applied. If the resulting sequence of operations has length , then it is a fork-picking sequence for and and . Hence, the theorem holds. Otherwise, let and be the phylogenetic trees resulting after applying the operations in , noting that .
Since by construction no type-0 operation can be applied to and , by Proposition 1 it follows that there is a special sequence for and , and that the two phylogenetic trees and resulting from applying can be rigidly displayed by a temporal tree-child network with .
It follows by induction that there is a fork-picking sequence , some , for and such that . Hence, is a fork-picking sequence for and such that .
Now, as defined in Humphries et al. (2013a), we say for that an ordering of X is a cherry-picking sequence for two phylogenetic trees and on X if, for all , is contained in a cherry in both and . In addition, the cherry-count associated to is 1 if the cherries in and containing are different and 0 else.
Note that every cherry-picking sequence for two phylogenetic trees and on X gives rise to a fork-picking sequence for and . Namely, we make a sequence of operations with a type-1 operation applied to if and a type-0 operation applied to if . In addition, any fork-picking sequence for two phylogenetic trees and on X clearly gives rise to the cherry-picking sequence . For example, the cherry-picking sequence with cherry counts (1, 1, 1, 1, 0, 0) arises from the fork-picking sequence given at the beginning of this section for the two phylogenetic trees in Fig. 5. Using these observations we obtain the following result.
Theorem 2
Suppose that and are two phylogenetic trees on X. Then the following statements are equivalent.
-
(i)
and are rigidly displayed by a temporal tree-child network on X.
-
(ii)
and are displayed by a temporal tree-child network on X.
-
(iii)
there is a cherry-picking sequence for and .
-
(iv)
there is a fork-picking sequence for and .
Proof
(ii) (i) If two phylogenetic trees are displayed by a phylogenetic network then they are rigidly displayed by that network.
(iii) (ii) Apply (Humphries et al. 2013a, Theorem 1), which states that two phylogenetic trees are displayed by a temporal tree-child network if and only if there is a cherry-picking sequence for them.
(i) (iv) Apply Proposition 2.
(iv) (iii) Apply the observation stated before the statement of the corollary i. e. , that a fork-picking sequence gives rise to a cherry-picking sequence.
Note that the temporal tree-child networks whose existence is guaranteed in Theorem 2(i) and (ii) need not be the same.
Theorem 2 also sheds light on the following decision problem:
Rigidly Displaying
Input: Two phylogenetic trees and on X.
Output: Does there exist a temporal tree-child network on X that rigidly displays and ?
Indeed, Theorem 2 and the main result in Döcker et al. (2019, Theorem 1) (which states that it is NP-complete to decide whether or not there is a cherry-picking sequence for two phylogenetic trees) immediately imply:
Corollary 1
The decision problem Rigidly Displaying is NP-complete.
The rigid hybrid number of two trees
Suppose that there is some fork-picking sequence for two phylogenetic trees and on X (or equivalently by Theorem 2, and are rigidly displayed by some temporal tree-child network on X). We define
and
We call the rigid (temporal tree-child) hybrid number for and .
In this section, we prove:
Theorem 3
If two phylogenetic trees and on X are rigidly displayed by some temporal tree-child network on X, then .
This theorem is an immediate consequence of Proposition 2 and the following result.
Proposition 3
Suppose and are two phylogenetic trees on X and that is a fork-picking sequence for and . Then there exists a temporal tree-child network on X which rigidly displays and and such that .
Proof
We establish the theorem using induction on .
If then , where consists solely of type-0 operations. Hence and are isomorphic and the required temporal tree-child network is given by .
Now suppose that is a fork-picking sequence for and with weight , some , and that, for any two phylogenetic trees and , the theorem holds for all fork-picking sequences for and with .
As , is of the form . Let Y be the set of elements y in X such that o(y) is not in the sequence . Then, as is not the empty sequence, is a fork-picking sequence for and , with . By induction, it follows that there is a temporal tree-child network with that rigidly displays and . Let denote a temporal labelling for .
We now construct a temporal tree-child network from . We first consider the case that is the empty sequence. We illustrate this case in Fig. 6 in terms of the fork-picking sequence considered in Fig. 5.
Fig. 6.
The construction of the temporal tree-child network for the fork-picking sequence for the phylogenetic trees and in Fig. 5 as described in the proof of Proposition 3. Note that is the empty sequence. (i) Insertion of . (ii) Insertion of . (iii) Insertion of and
Let , , be such that . To ease notation, put and, if , .
Assume first that . Since o(x) is a type-1 operation, there exist such that, without loss of generality, is a cherry in and is a cherry in . Subdivide the pendant edges in incident with p and q by adding new vertices and , respectively. Also, add in the leaf x below a newly added reticulation vertex v which has parents and . Denote the resulting phylogenetic network by . Clearly, since is tree-child we also have that is tree-child. Returning to the fork-picking sequence example considered in Fig. 5, we have for Fig. 6(i) that and that , and are all isomorphic. Also , , , and .
For and the parents of and in , respectively, choose some value with and put . Also, choose some such that so that, together with , we obtain a temporal labelling for . Clearly, rigidly displays and . Putting this completes the proof of the theorem in case since .
Assume now that . We explain how to insert y into . By definition of a special sequence, o(y) is a type-2 operation. Without loss of generality, we may assume that has a fork (p, (x, y)) and that has a cherry with p as in operation . Then we can construct a new temporal tree-child network which rigidly displays and by inserting a pendant edge incident with y via a subdivision vertex s on the pendant edge of incident with x to form a cherry in . Define the temporal labelling of by putting , choosing some value with , and appropriately using . Note that and so the theorem is also proven for if .
Assume next that . Suppose that we have a special sequence. Also, suppose that we have created a temporal tree-child network from by successively inserting, for some all leaves , , below the parent of the cherry in to create a pendant subtree of with leaf set so that rigidly displays the phylogenetic trees and . Without loss of generality, we may assume that is the tree in the definition of a special sequence for and .
We now show how to insert into . Consider operation . Then, by definition of a special sequence, is either a type-2 operation or a type-3 operation, for which the 3-fork and 4-fork, respectively, is a pendant subtree of and, in view of Property (iii) of a special sequence, is below . Put .
If is a type-2 operation, then let denote the 3-fork of where distinct. Then must be a cherry in . Since is below , it follows that is a cherry in . We can therefore first insert into the pendant edge of incident with b and then extend the temporal labelling of so as to obtain a temporal tree-child network that rigidly displays and .
If is a type-3 operation, then let denote the 4-fork in where are pairwise distinct. Then (a, (b, c)) must be a 3-fork in . As is below it follows that this 3-fork must also be a pendant subtree of . We can therefore first insert into the pendant edge of incident with a and then extend the temporal labelling of so as to obtain a temporal network that rigidly displays and . Again for the fork-picking sequence example considered in Fig. 5, we have for Fig. 6(iii) that is a type-2 operation with and and is a type-3 operation with and .
In summary, we can insert all of the elements of Y into in this way until we obtain a temporal tree-child network with which rigidly displays and . It follows by induction that
which completes the proof of the theorem in case is empty.
Now, suppose is not empty. Then we first insert all elements of into as described in the case that is the empty sequence above to obtain a network which rigidly displays and and for which holds. Into we then insert all elements for which is contained in in the (reverse) order specified by , and at each step creating a cherry specified by o(z), to obtain a new temporal tree-child network which rigidly displays and . Clearly, . Since is a fork-picking sequence for and and the theorem holds in this case too.
The relationship between the rigid, temporal, weak and beaded hybrid numbers
As mentioned in the introduction, there are various ways to define hybrid numbers for two phylogenetic trees. In this section, we consider the relationship between the rigid hybrid number of two phylogenetic trees and three closely related hybrid numbers (the temporal, weak, and beaded hybrid numbers), in particular, showing that they can all be different from one another. From a biological viewpoint, the results that we present in this section are important as they show that in principle quite different estimates can arise for the number of reticulation events required to accommodate two phylogenetic trees depending on the model being used to embed the two trees in a phylogenetic network.
The temporal hybrid number
For two phylogenetic trees and on X that can be displayed by some temporal tree-child network, the temporal hybrid number of and (Humphries et al. 2013a) is defined as
Note that in case this number exists, the temporal hybrid number for the two phylogenetic trees is not necessarily equal to their hybrid number Humphries et al. (2013a, Figure 1; also p. 1889).
Now, given two phylogenetic trees and on X, Theorem 2 implies that the temporal hybrid number of and exists if and only if the rigid hybrid number exists. Clearly, if these numbers both exist, then . In this section, we show that the difference can grow at least as a linear function of |X|.
To this end, assume that . Consider two phylogenetic trees and with leaves, as given in Fig. 7. In that figure, and are both fully balanced phylogenetic trees with leaves each and is a fully balanced phylogenetic tree with leaves. We next describe the labeling of and . Let be the set of all pendant subtrees on four leaves in . Then, for each , we bijectively label the leaves of with . Turning to the subtrees and of , let be the set of all pendant subtrees on four leaves in and, similarly, let be the set of all pendant subtrees on four leaves in . Then, for each
bijectively label the eight leaves of and with such that has cherries and and has cherries and . For , the leaf labeling of and is illustrated in Fig. 8.
Fig. 7.
Two phylogenetic trees and on the set as defined in the text. Both are rigidly displayed by the depicted temporal tree-child network on X
Fig. 8.
The two phylogenetic trees and in Fig. 7 for the case
Theorem 4
For the two phylogenetic trees and on , , pictured in Fig. 7, we have .
Proof
First note that , as and are rigidly displayed by the temporal tree-child network pictured in Fig. 7.
We now show that from which the theorem follows. First note that, by Theorem 2, there must exist a temporal tree-child network that displays both and . By Humphries et al. (2013b, Theorem 3.3), it follows that is equal to the number of components in a maximum temporal agreement forest for and minus 1, where such a forest is defined as follows. All phylogenetic trees considered in the definition are “planted" by adding a new root plus an edge to their roots, and trees with one leaf are also allowed. Let . Then a collection of planted trees is a maximum temporal agreement forest for (planted versions of) and if the following three properties hold, where , .
The set is a partition of X.
For all , .
- Denoting for the root of by , there exist injective maps
such that any two trees in and are edge-disjoint rooted subtrees of and , respectively.
We now claim that for every subset of X with , at least one of the sets , , and must be contained in Z. This implies that from which the theorem immediately follows. For simplicity, we prove the claim for . Specifically, we show that, for the subset of X, at least one of , , and is an element of Z. The argument for the remaining cases is similar.
Assume that is not an element of Z. Since Z is a partition of X, one of the following three cases must hold:
or is an element in Z,
is an element in Z, or
and are elements in Z,
where , , and are subsets of . Now, observe that Case (c) contradicts Property (P3); particularly and are not edge-disjoint in because has a cherry . Moreover, Case (b) contradicts Property (P2) because and are non isomorphic, where y is an element in . Hence, Case (a) must hold. Thus, one of , , and is an element of Z.
The weak and beaded hybrid numbers
We now consider the relationship between the rigid, temporal and so-called weak and beaded hybrid numbers. First, given two phylogenetic trees and on X, we define the weak hybrid number of and as
Note that for any two phylogenetic trees and there always exists a phylogenetic network that displays and . Hence, is well-defined. The weak hybrid number has been considered implicitly in The Parental Tree Network Problem (Zhu et al. 2016, Definition 5).
Next, recall that a beaded tree on X is a phylogenetic network on X in which parallel edges are allowed, and in which each reticulation vertex v has a unique parent u such that there are two parallel edges from u to v (van Iersel et al. 2018, Definition 7). In van Iersel et al. (2018, Definition 6) the concept of a weak embedding of a multi-labeled tree into a beaded tree is defined, which is closely related to our definition of a display map when restricted to phylogenetic trees.1 For two phylogenetic trees and on X, we define the beaded hybrid number for and to be
In the following result we summarize the relationship between the various hybrid numbers that we have considered:
Theorem 5
Let and denote two phylogenetic trees on X. Then
Moreover, if the rigid (or equivalently temporal) hybrid number for and exists, then
and in this case there exist trees and such that every inequality is strict.
Proof
To see that the first statement holds note that (van Iersel et al. 2018, Lemma 9) implies that any phylogenetic network on X that weakly displays two phylogenetic trees and on X can be transformed into a beaded tree on X such that there exist weak embeddings of and into for which (so in particular exists for any pair of phylogenetic trees and ). Hence, .
Now, consider the second statement in the theorem. Suppose that exists (or equivalently, that exists). By the first statement in the theorem , clearly also holds, and as remarked in the previous section, . Hence, all of the inequalities in the second statement hold.
We now show that there exist two phylogenetic trees and such that every inequality is strict. Consider the trees and on depicted in Fig. 9. Note that since and are not isomorphic, and there exist weak embeddings for and into the pictured beaded tree , respectively. Moreover, by considering fork-picking sequences it is straight-forward to check that , and by considering cherry picking sequences and using Humphries et al. (2013a, Theorem 2), it is straight-forward to check that . Hence, as and are not isomorphic, . In the Appendix we show that , and so it follows that .
Fig. 9.
Two phylogenetic trees and on whose beaded hybrid number is one ( is a beaded tree into which and can be weakly embedded), and whose weak hybrid number is two (the network weakly displays both trees)
Next consider the two phylogenetic trees and on the set in Fig. 10 in the Appendix. Then using similar arguments to those used to determine the various hybrid numbers for the trees in Fig. 9, it is straight-forward to check that , , and .
Fig. 10.

Two phylogenetic trees and on whose weak hybrid number is (see for a phylogenetic network that weakly displays both trees) and whose rigid hybrid number is (see for a temporal tree-child network which rigidly displays and )
Now, to complete the proof, consider the phylogenetic trees and on the set which are constructed as follows: For (respectively, ) take a new root vertex and join this vertex via two edges to the roots of the trees and (respectively, and ). Then it is straightforward to check that for . It follows that
Discussion
Motivated by the fact that in some evolutionary scenarios the notion of a network displaying two phylogenetic trees might be too restrictive, we have introduced and studied the concept of a network rigidly displaying a pair of phylogenetic trees. We have characterized when two trees can be rigidly displayed by a temporal tree-child network and, provided this is possible, have shown that their rigid hybrid number is given by a minimum weight fork-picking sequence for them. In addition, we have shown that the rigid hybrid number is different from the closely related temporal, weak, and beaded hybrid numbers.
There remain several open problems. First, it is well-known that the hybrid number is closely related to the size of a maximum agreement forest for two phylogenetic trees (Baroni et al. 2005a). It would therefore be of interest to know if there is some analogue of a maximum agreement forest for rigidly displaying two trees. Results in Humphries et al. (2013b), including the one mentioned above, concerning temporal agreement forests for two phylogenetic trees to be displayed by temporal tree-child networks could be useful for studying this question. In addition, it could be interesting to define and study rigid hybrid numbers for three or more phylogenetic trees. For example, we could try to understand r-rigidly displaying, where r is the maximum number of edges that come together at each reticulation vertex (note that in this paper we have investigated the concept of r-rigidly displaying for ). Recently, there has been work on understanding the hybrid number for arbitrary sets of trees (Linz and Semple 2019) which might be relevant.
More generally, several questions remain concerning the notion of weakly displaying. For example, it would be interesting to know how large the difference can potentially be between the hybrid number and the weak hybrid number for a collection of phylogenetic trees. As this appears to be a difficult problem, it might be worth first restricting to the case of understanding the “weak temporal tree-child hybrid number"; how different can this number be from the rigid hybrid number, and can we decide when a set of trees is weakly displayed by a temporal tree-child network? To answer these questions it could be worth first trying to decide whether or not two phylogenetic trees are rigidly displayed by some temporal tree-child network if and only if they are weakly displayed by some temporal tree-child network. In another direction, it would be interesting to understand how the weak- and rigid hybrid numbers behave for other classes of networks besides temporal tree-child networks.
Finally, an important open problem is to develop practical algorithms to compute networks with minimum rigid and/or weak hybrid numbers for two (or more) phylogenetic trees. The complexity of computing the weak and rigid hybrid numbers is unknown (see also Zhu et al. 2016, p.278), but we suspect these will probably be NP-hard in general. We base this suspicion in part on the fact that it is known that computing the minimum hybrid and temporal hybrid number for two trees is NP-hard (Bordewich and Semple 2007b; Humphries et al. 2013a). However, by gaining a better understanding of fork-picking sequences it may be possible to at least develop fixed parameter tractable algorithms to construct optimal networks, an approach that has already proven successful for hybrid and temporal hybrid numbers (Bordewich and Semple 2007a; Humphries et al. 2013a).
Acknowledgements
KTH and VM thank the London Mathematical Society and SL thanks the New Zealand Marsden Fund for their financial support. All authors thank the Biomathematics Research Centre, University of Canterbury, and The Lorentz Center, Leiden, where they discussed parts of this work. Also, all authors thank the two reviewers for their helpful comments and suggestions to improve the paper.
Appendix
In this appendix we prove that for the two phylogenetic trees and on pictured in Fig. 9. To simplify notation, we put and .
Suppose to the contrary that . Then there exists a phylogenetic network that weakly displays and such that . Let v be the unique vertex in . Let be a parent of v. Note that since has no parallel edges, u must exist. Also note that u must be a tree vertex of as v is the sole reticulation vertex of . Finally, note that the other child of u cannot be v as does not contain parallel edges. Denoting that child of u by x, we next claim that x must be a leaf of .
To see the claim, assume for contradiction that x is not a leaf. Let denote the subtree of rooted at x. Let be a leaf of and, thus, of . Then since weakly displays via a map say, and v is the sole reticulation vertex of we obtain . Similarly, as weakly displays via a map say, must hold. Thus, . Since Lemma 2(i) implies that as v is the sole reticulation vertex of , it follows that . Hence, by Lemma 2(ii), is also a pendant subtree of and of ; a contradiction as and are the two trees depicted in Fig. 9. Thus, x is a leaf of , as claimed.
Since every element in X is contained in a cherry of or , we may choose some such that is a cherry in or . Without loss of generality, assume that is that tree. Since the only two cherries of are and we may assume without loss of generality that . Let m denote the parent of x and y in .
Let be the parent of u which must exist as . Then as otherwise the fact that is a cherry of but not of implies that y is below v. But then is not weakly displayed by because is an edge of and is not an edge in ; a contradiction.
We next claim that (w, v) cannot be an edge in . To see this, assume for contradiction that (w, v) is an edge in . Then since is weakly displayed by and x is contained in a cherry of but not of it follows that y must be a leaf of below v. If there existed another leaf of below v then that leaf would have to be leaf 5. Since is a cherry of and is weakly displayed by it follows that that cherry must also be below v; a contradiction as is the phylogenetic tree depicted in Fig. 9. Thus, y is in fact the sole leaf of below v. But then must also be a cherry of ; a contradiction since and are the phylogenetic trees depicted in Fig. 9. Thus, (w, v) cannot be an edge in , as claimed. Hence, the other child of w, call it z, is either a leaf of or the root of a pendant subtree of .
Note first that arguments similar to the case of x imply that z must be a leaf of . Let denote the parent of w. We next distinguish between the cases that and .
If then . To see this, assume for contradiction that . Then is an edge in . Since is a cherry of and the parent m of x and y is not adjacent with it follows that is not weakly displayed by ; a contradiction. Thus, , as required.
We next claim that (p, v) cannot be an edge of . Assume for contradiction that (p, v) is an edge of . Then since is a cherry of and is weakly displayed by , similar arguments as before imply that leaf 5 must be the sole leaf of below v and that the unique directed path from to leaf 2 does not contain w. Since is one of the two phylogenetic trees depicted in Fig. 9 it follows that is not weakly displayed by ; a contradiction. Thus, (p, v) cannot be an edge of either.
Let q denote the other child of p. Then similar arguments as in the case of z imply that q must also be a leaf of . Hence, the phylogenetic tree on with cherry is a pendant subtree of . Since p is a tree vertex of , Lemma 2(ii) implies that is a pendant subtree of and ; a contradiction in view of Fig. 9. Hence, in case .
Assume for the remainder that . Then y is a descendant of v in . Since z is a leaf of , it follows that the phylogenetic tree on with cherry is a pendant subtree of . Since we must have . But then as is a cherry of and is weakly displayed by ; a final contradiction. Hence, and, so, Theorem 5 follows.
Footnotes
It differs from our definition of a display map since it allows edges to be mapped to paths of length 0.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Katharina T. Huber, Email: K.Huber@uea.ac.uk
Simone Linz, Email: s.linz@auckland.ac.nz.
Vincent Moulton, Email: v.moulton@uea.ac.uk.
References
- Bapteste E, van Iersel L, Janke A, Kelchner S, Kelk S, McInerney JO, Morrison DA, Nakhleh L, Steel M, Stougie L, et al. Networks: expanding evolutionary thinking. Trends Genet. 2013;29(8):439–441. doi: 10.1016/j.tig.2013.05.007. [DOI] [PubMed] [Google Scholar]
- Baroni M, Grünewald S, Moulton V, Semple C. Bounding the number of hybridization events for a consistent evolutionary history. J Math Biol. 2005;51(2):171–182. doi: 10.1007/s00285-005-0315-9. [DOI] [PubMed] [Google Scholar]
- Baroni M, Semple C, Steel M. A framework for representing reticulate evolution. Ann Comb. 2005;8(4):391–408. doi: 10.1007/s00026-004-0228-0. [DOI] [Google Scholar]
- Bordewich M, Semple C. Computing the hybridization number of two phylogenetic trees is fixed-parameter tractable. IEEE/ACM Trans Comput Biol Bioinform. 2007;4(3):458–466. doi: 10.1109/tcbb.2007.1019. [DOI] [PubMed] [Google Scholar]
- Bordewich M, Semple C. Computing the minimum number of hybridization events for a consistent evolutionary history. Discrete Appl Math. 2007;155(8):914–928. doi: 10.1016/j.dam.2006.08.008. [DOI] [Google Scholar]
- Cardona G, Llabres M, Rossello F, Valiente G. Metrics for phylogenetic networks ii: nodal and triplets metrics. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 2009;6(3):454–469. doi: 10.1109/TCBB.2008.127. [DOI] [PubMed] [Google Scholar]
- Cardona G, Rossello F, Valiente G. Comparison of tree-child phylogenetic networks. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 2009;6(4):552–569. doi: 10.1109/TCBB.2007.70270. [DOI] [PubMed] [Google Scholar]
- Degnan JH. Modelling hybridization under the network multispecies coalescent. Syst Biol. 2016;67(5):786–799. doi: 10.1093/sysbio/syy040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Döcker J, van Iersel L, Kelk S, Linz S. Deciding the existence of a cherry-picking sequence is hard on two trees. Discrete Appl Math. 2019;260:131–143. doi: 10.1016/j.dam.2019.01.031. [DOI] [Google Scholar]
- Huber KT, Moulton V. Phylogenetic networks from multi-labelled trees. J Math Biol. 2006;52(5):613–632. doi: 10.1007/s00285-005-0365-z. [DOI] [PubMed] [Google Scholar]
- Huber KT, Moulton V, Steel M, Wu T. Folding and unfolding phylogenetic trees and networks. J Math Biol. 2016;73(6–7):1761–1780. doi: 10.1007/s00285-016-0993-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humphries PJ, Linz S, Semple C. Cherry picking: a characterization of the temporal hybridization number for a set of phylogenies. Bull Math Biol. 2013;75(10):1879–1890. doi: 10.1007/s11538-013-9874-x. [DOI] [PubMed] [Google Scholar]
- Humphries PJ, Linz S, Semple C. On the complexity of computing the temporal hybridization number for two phylogenies. Discrete Appl Math. 2013;161(7–8):871–880. doi: 10.1016/j.dam.2012.11.022. [DOI] [Google Scholar]
- Huson DH, Linz S. Autumn algorithm—computation of hybridization networks for realistic phylogenetic trees. IEEE/ACM Trans Comput Biol Bioinform. 2016;15(2):398–410. doi: 10.1109/TCBB.2016.2537326. [DOI] [PubMed] [Google Scholar]
- Huson DH, Rupp R, Scornavacca C. Phylogenetic networks: concepts, algorithms and applications. Cambridge: Cambridge University Press; 2010. [Google Scholar]
- Linz S, Semple C. Attaching leaves and picking cherries to characterize the hybridization number for a set of phylogenies. Adv Appl Math. 2019;105:102–129. doi: 10.1016/j.aam.2019.01.004. [DOI] [Google Scholar]
- McDiarmid C, Semple C, Welsh D. Counting phylogenetic networks. Ann Comb. 2015;19(1):205–224. doi: 10.1007/s00026-015-0260-2. [DOI] [Google Scholar]
- Moret BM, Nakhleh L, Warnow T, Linder CR, Tholse A, Padolina A, Sun J, Timme R. Phylogenetic networks: modeling, reconstructibility, and accuracy. IEEE/ACM Trans Comput Biol Bioinform. 2004;1(1):13–23. doi: 10.1109/TCBB.2004.10. [DOI] [PubMed] [Google Scholar]
- Nakhleh L (2010) Evolutionary phylogenetic networks: models and issues. In: Problem solving handbook in computational biology and bioinformatics. Springer, pp 125–158
- Steel M. Phylogeny: discrete and random processes in evolution. Philadelphia: SIAM; 2016. [Google Scholar]
- van Iersel L, Semple C, Steel M. Locating a tree in a phylogenetic network. Inf Process Lett. 2010;110(23):1037–1043. doi: 10.1016/j.ipl.2010.07.027. [DOI] [Google Scholar]
- van Iersel L, Janssen R, Jones M, Murakami Y, Zeh N (2018) Polynomial-time algorithms for phylogenetic inference problems. In: International conference on algorithms for computational biology. Springer, pp 37–49 [DOI] [PubMed]
- Zhu S, Degnan JH. Displayed trees do not determine distinguishability under the network multispecies coalescent. Syst Biol. 2016;66(2):283–298. doi: 10.1093/sysbio/syw097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu J, Yu Y, Nakhleh L. In the light of deep coalescence: revisiting trees within networks. BMC Bioinform. 2016;17(14):415. doi: 10.1186/s12859-016-1269-1. [DOI] [PMC free article] [PubMed] [Google Scholar]






