Skip to main content
Springer logoLink to Springer
. 2021 Mar 26;82(5):40. doi: 10.1007/s00285-021-01594-2

The rigid hybrid number for two phylogenetic trees

Katharina T Huber 1, Simone Linz 2, Vincent Moulton 1,
PMCID: PMC7997861  PMID: 33770290

Abstract

Recently there has been considerable interest in the problem of finding a phylogenetic network with a minimum number of reticulation vertices which displays a given set of phylogenetic trees, that is, a network with minimum hybrid number. Such networks are useful for representing the evolution of species whose genomes have undergone processes such as lateral gene transfer and recombination that cannot be represented appropriately by a phylogenetic tree. Even so, as was recently pointed out in the literature, insisting that a network displays the set of trees can be an overly restrictive assumption when modeling certain evolutionary phenomena such as incomplete lineage sorting. In this paper, we thus consider the less restrictive notion of rigidly displaying which we introduce and study here. More specifically, we characterize when two trees can be rigidly displayed by a certain type of phylogenetic network called a temporal tree-child network in terms of fork-picking sequences. These are sequences of special subconfigurations of the two trees related to the well-studied cherry-picking sequences. We also show that, in case it exists, the rigid hybrid number for two phylogenetic trees is given by a minimum weight fork-picking sequence for the trees. Finally, we consider the relationship between the rigid hybrid number and three closely related numbers; the weak, beaded, and temporal hybrid numbers. In particular, we show that these numbers can all be different even for a fixed pair of trees, and also present an infinite family of pairs of trees which demonstrates that the difference between the rigid hybrid number and the temporal-hybrid number for two phylogenetic trees on the same set of n leaves can grow at least linearly with n.

Keywords: Phylogenetic network, Hybrid number, Cherry-picking sequence, Fork-picking sequence, Weakly displaying, Rigidly displaying, Temporal tree-child network, Beaded tree

Introduction

Recently there has been great interest in using phylogenetic networks to model processes such as lateral gene transfer and recombination (see e.g. Bapteste et al. 2013). Such networks come in various forms (see e.g. Huson et al. 2010), and here we shall consider explicit networks which are used to provide direct representations of evolutionary histories in the form of a leaf-labelled directed graph. Formally speaking, a phylogenetic network (on a species set X) is a connected directed acyclic graph, with a single root and leaf-set X in which every internal vertex has either indegree one and outdegree two, or indegree two and outdegree one, except for the root which has outdegree two. We call the number of vertices in a phylogenetic network with indegree two the network’s hybrid number, so that a phylogenetic tree is a network with hybrid number zero. We shall mainly focus on temporal tree-child networks, i.e. phylogenetic networks in which each non-leaf vertex has a child whose indegree is one, whose vertices can be labelled with time stamps that move strictly forward on treelike parts of the network, and so that vertices with indegree two have parents with the same time stamp [also known as tree-child, time-consistent networks (Cardona et al. 2009a)].

Any phylogenetic network on some set X displays a set of phylogenetic trees on X, where a phylogenetic tree is displayed by a network if there is a subgraph of the network that is isomorphic to a subdivision of the tree (van Iersel et al. 2010). Biologically speaking, we usually think of the trees displayed by a network as being gene trees, and the fact that a network is required to represent them simultaneously is the consequence of incongruence between the gene trees that can arise from processes such as lateral gene transfer (Zhu et al. 2016). As gene trees are now commonly inferred from genomic data (e.g. by considering either a gene or a genomic locus of interest) it is therefore natural to try and devise ways to construct phylogenetic networks through the process of looking for some network which displays a given set of gene trees (see e.g. Nakhleh 2010, Section 2). For a given set of phylogenetic trees, this leads to the concept of the (temporal) hybrid number, which is the minimum hybrid number taken over all (temporal tree-child) networks that display each of the trees in the set (Baroni et al. 2005b; Humphries et al. 2013a). While the hybrid number exists for any set of phylogenetic trees, it is worth noting that the temporal hybrid number does not always exist, i.e. there are sets of trees that cannot be displayed simultaneously in a temporal tree-child network.

Several results have been presented in the literature concerning displaying phylogenetic trees and hybrid numbers, mainly for pairs of trees. These include structural information on how the hybrid number is related to a so-called maximum acyclic agreement forest for two phylogenetic trees (Baroni et al. 2005a), characterizations for when collections of trees are displayed by special types of networks (Humphries et al. 2013a; Linz and Semple 2019) and related algorithms/complexity results (Bordewich and Semple 2007a, b; Döcker et al. 2019; Humphries et al. 2013b; Huson and Linz 2016). However, all of these results rely on the fact that the networks display the set of trees in question. Recently it has been observed that this is a serious issue when modelling a phenomenon called incomplete lineage sorting where the aim is to model gene tree incongruence arising due to population effects. This is because the set of displayed trees is then no longer able to fully capture the way in which the genes actually evolve along the network, making it difficult to recover the underlying network (Zhu et al. 2016; Zhu and Degnan 2016; Degnan 2016).

A possible solution to this problem is to relax the displaying condition. Roughly speaking, a phylogenetic tree is weakly displayed by a network (Huber et al. 2016) if it can be embedded in the network in such a way that the tree follows along the directed paths in the network (see Sect. 3 for the definition). In biological terms, as nicely explained in van Iersel et al. (2018), “this means that different lineages of the gene tree may “travel down” the same branch of the network, as long as any branching node in the tree coincides with a branching node in the network”. In this paper, we focus on the special situation where two phylogenetic trees T and T are weakly displayed by a temporal tree-child network under the assumption that there exist simultaneous embeddings of both trees that do not permit more than three branches of T and T to come together at a reticulation vertex. In this case we shall say that T and T are rigidly displayed by the network (see Fig. 1 for an example). As relatively little is known about the problem of constructing phylogenetic networks that weakly display a collection of phylogenetic trees, we believe that our results on rigidly displaying provide some useful new insights into approaching this challenging problem. Ultimately this should hopefully also lead to improved approaches to modeling phenomena such as incomplete lineage sorting.

Fig. 1.

Fig. 1

Two phylogenetic trees on the set {A,B,C,D} that are rigidly displayed by the phylogenetic network on the right. Note that the tree in the top left is not displayed by the network. The trees and network are adapted from (Zhu et al. 2016, Fig. 2, Fig. 3) and represent a hypothetical evolutionary scenario tracing the evolution of a genomic region within four species

We now summarize the rest of the paper, including statements of our main results. After presenting some definitions in Sect. 2, in Sect. 3 we present the definition of weakly displaying, and we prove some basic facts concerning this concept that are useful later on. In addition, in Theorem 1 we characterize when two phylogenetic trees are displayed by a temporal tree-child network and when they are weakly displayed such a network.

In Sects. 4 and 5, we introduce the concepts of rigidly displaying and fork-operations, respectively, and prove some results concerning these concepts which we use later on. A fork-operation can be thought of as a generalization of picking off a pair of cherries from two phylogenetic trees as defined in Humphries et al. (2013a). In particular, in a key result, Proposition 1, we show that in case a temporal tree-child network rigidly displays two phylogenetic trees, there exists a certain sequence of fork-operations (called a special sequence) which can be applied to these two trees that allows us to apply inductive arguments to prove our main results. Using this, in Theorem 2 we prove that a pair of phylogenetic trees can be rigidly displayed by a temporal tree-child network if and only if there exists a fork-picking sequence for the two trees. Interestingly, we also prove that this is equivalent to the existence of a temporal tree-child network that displays the two trees. As a corollary we show that it is NP-complete to decide whether or not there exists a temporal tree-child network that rigidly displays two phylogenetic trees (Corollary 1).

In Sect. 7, we define the rigid hybrid number of two trees, which is the minimum hybrid number of a phylogenetic network that rigidly displays both trees, taken over all temporal tree-child networks. To capture this number, we consider the weighted fork-picking sequences for a pair of phylogenetic trees, showing in Theorem 3 that, in case it exists, the rigid hybrid number of two phylogenetic trees is equal to the minimum taken over the weights of all possible fork-picking sequences for the two trees. This result can be regarded as analogue of Humphries et al. (2013a, Theorem 2).

In Sect. 8, we consider the relationship between the rigid hybrid number and three closely related hybrid numbers: the weak, beaded, and temporal hybrid numbers (the beaded hybrid number was implicitly defined in van Iersel et al. (2018)). In particular, in Theorem 4, we first show that there is a pair of phylogenetic trees on a set X with |X| arbitrarily large, so that the difference between the temporal and rigid hybrid numbers for these two trees is at least |X|4-3. Then in Theorem 5, we show that there exist two phylogenetic trees whose beaded, weak, rigid, and temporal hybrid numbers are all distinct from one another. We conclude in Sect. 9 by presenting some open problems and discussing some possible future directions of research.

Preliminaries

Let G denote a directed, acyclic graph with a single root, i.e., a vertex with indegree zero. Let V(G) denote the vertex set of G, E(G) the set of (directed) edges of G, and ρG the unique root of G. A vertex in G with indegree one and outdegree zero is called a leaf; an edge of G incident with a leaf of G a pendant edge of G. Furthermore, we denote the set of all leaves of G by L(G).

Suppose vV(G). We say that a vertex uV(G) is above v if there exists a directed path from u to v (note that u could equal v). If u is above v, then we also write uGv or simply uv if G is clear from the context. Furthermore, we say that v is below u. We call any vertex above v an ancestor of v and any vertex below v a descendant of v. Finally, we say that two distinct edges e=(u,v) and e=(u,v) of G are comparable if v is above u or v is above u. Otherwise we say that e and e are incomparable.

Let X be a finite set of size at least 2. Following e.g. Huber et al. (2016, p.1764) a rooted, directed acyclic graph N is called a phylogenetic network (on X) if the following properties are satisfied:

  1. the root ρN of N has indegree zero and outdegree two,

  2. X is the set of leaves of N, and

  3. each remaining vertex of N has either indegree one and outdegree two, or indegree two and outdegree one.

Unless stated otherwise, phylogenetic networks do not contain parallel edges.

Now let N be a phylogenetic network. We refer to a vertex of N with indegree two and outdegree one as a reticulation vertex, and to a vertex with indegree one and outdegree zero or two as a tree vertex. The set of reticulation vertices of N is denoted by Ret(N). We put h(N)=|Ret(N)|. Moreover, we call a directed path P from a vertex v to a leaf x in a phylogenetic network a tree-path if each vertex on P, except possibly v, is a tree vertex.

A phylogenetic tree (on X) is a phylogenetic network on X that does not have any reticulation vertices. We say that two phylogenetic trees T and T on X are isomorphic, denoted by TT, if there exists a bijection V(T)V(T) that induces a graph isomorphism between T and T that is the identity on X. If T is a phylogenetic tree on X, and YX, then the last common ancestor of Y, denote by lcaT(Y), is the unique vertex v of T that is an ancestor of every element in Y and there is no vertex wv such that w is a descendant of v and w is an ancestor of every element in Y. For any 2l|X| elements x1,,xlX, we sometimes also write lcaT(x1,,xl) rather than lcaT({x1,,xl}). Now let YV(T). We denote by T(Y) the minimal subtree of T that connects all vertices in Y and by T|Y the tree obtained from T(Y) by suppressing all vertices that have indegree one and outdegree one. We refer to T|Y as the restriction of T to Y. Note that, if Y is a subset of X such that |Y|2, then the root of T(Y) equals lcaT(Y).

Suppose N is a phylogenetic network on X. Following Steel (2016), we say that N is temporal (Moret et al. 2004) if there exists a map t:V(N)R0 such that, for all (p,q)E(N), we have t(p)=t(q) whenever q is a reticulation vertex and t(p)<t(q), otherwise. In that case, we call t a temporal labelling of N. Unless of relevance to the discussion, we always omit the temporal labelling when depicting a temporal network. We say that N is tree-child (Cardona et al. 2009b) if, for each non-leaf vertex vV(N) at least one of the children of v is a tree vertex. Note that a tree-child network was called a phylogenetic network in Humphries et al. (2013a, p.1883) and that a temporal tree-child network (in our sense) was called a (binary) time-consistent tree-child network, or TCTC-network in Cardona et al. (2009a).

Weakly displaying two trees in a network

To define and understand the notion of rigidly displaying, it is useful to first consider the more general notion of weakly displaying. As well as recalling the definition of weakly displaying, we derive some of its basic properties which will be useful later, and explain how the concept of rigidly displaying is related to the stronger notion of displaying (see Theorem 1).

Let T be a phylogenetic tree on X and let N be a phylogenetic network on X. Furthermore, let ψ be a map that maps each vertex of T to a vertex of N and each edge e=(u,v) of T to a directed path from the image of u under ψ to the image of v under ψ. To distinguish the mapping of a vertex v from that of an edge e in T, we use ψ(v) to denote the vertex in N that v is mapped to under ψ and ψ[e] to denote the directed path in N that e is mapped to under ψ. We call ψ a display map for T in N if the following properties hold:

  • (i)

    for each xX, ψ(x)=x,

  • (ii)

    for all vV(T), ψ(v) is a tree vertex or the root of N,

  • (iii)

    for each edge e of T, ψ[e] contains at least one edge of N, and

  • (iv)

    for any two distinct edges e=(u,v) and e=(u,v) of T, the first edge of ψ[e] is different from the first edge of ψ[e] in N.

Note that the definition of a display map ψ is equivalent to ψ being a locally separated reconciliation as defined in Huber et al. (2016, Section 7).

Now, let ψ be a display map for T in N, and let P be a directed path of T. It follows immediately from the definition of ψ, that the edge set

eE(P)E(ψ[e])

induces a directed path in N. We will freely use this fact throughout the remainder of the paper.

Following Huber et al. (2016), we say that T is weakly displayed by N if there exists a (not necessarily unique) display map for T in N. To reduce notation, we will sometimes not explicitly refer to the display map. Note that if N displays T as defined in the introduction then N also weakly displays T. However, the converse is not necessarily true. For example, referring to Fig. 2, T is weakly displayed by N for the indicated display map but T is not displayed by N.

Fig. 2.

Fig. 2

Two phylogenetic trees T and T on X={a,b,c,d} that are weakly displayed by the network N on X via the indicated display maps ψ and ψ for T and T, respectively, for which γψ.ψ(w)2 holds for all wV(N). However, T and T are not both displayed by N (the tree T is not displayed)

The notion of weakly displayed was introduced in Huber et al. (2016) in terms of a construction that allows the unfolding of a phylogenetic network on X into a so-called multi-labelled tree on X (Huber and Moulton 2006). Such trees are similar to phylogenetic trees in that they have no vertices with in- and outdegree one and the root has indegree zero. However the requirement that the leaf-set is X is relaxed to the requirement that an element of X can label more than one leaf (which is not allowed in the case of phylogenetic trees).

Now, suppose that ψ is a display map for T in N. For any edge e=(u,v)E(T), we denote by ψ[e]-ψ(u) the set of all vertices in V(N) that lie on ψ[e] except for ψ(u). Let wV(N), and let e=(u,v) be an edge of T. If ψ(v)=w, we say that ψ[e] ends at w and if wψ[e]-ψ(u), but ψ(v)w we say that ψ[e] passes through w. In addition, we define

γψ(w)=|{e=(u,v)E(T):wψ[e]-ψ(u)}|,

i.e., γψ(w) counts the number of edges in T such that their image under ψ either ends or passes through w so that in particular γψ(ρN)=0. Finally, if T is an additional phylogenetic tree on X that is weakly displayed by N via a display map ψ, then we put

γ(w)=γψ,ψ(w)=γψ(w)+γψ(w)

(see e.g. Fig. 2). To reduce notation we sometimes drop the subscript in γψ,ψ as indicated if ψ and ψ are clear from the context.

We now prove two lemmas about display maps which will be useful later.

Lemma 1

Let N be a temporal tree-child network on X and let T be a phylogenetic tree on X. Then the following holds.

  • (i)

    N displays T if and only if there exists a display map ψ for T in N such that, for all wV(N), we have γψ(w)1,

  • (ii)

    if ψ is a display map for T in N as specified in Assertion (i) then ψ(ρT)=ρN.

Proof

(i) Assume first that N displays T. Then since a phylogenetic network on X that displays a phylogenetic tree on X clearly also weakly displays that tree, it is straight-forward to see that there exists a display map ψ for T in N such that, for all wV(N), we have γψ(w)1.

Conversely, assume that there exists a display map ψ for T in N such that, for all wV(N), we have γψ(w)1. Then, for each vRet(N), there exists at most one edge eE(T) such that ψ[e] passes through v. Note that since v is not a tree-vertex of N, there cannot exist an edge in eE(T) such that ψ[e] ends in v. Since N is tree-child it follows that the subgraph N of N with vertex set V(N) and edge set eE(T)ψ[e] is a rooted tree with leaf set X. Furthermore, the outdegree of the root of N must be two as N is temporal and, therefore, N cannot contain a shortcut, that is, if there exists a directed path from a vertex u to a vertex v in N which contains at least three vertices then there cannot also be the edge (uv) in N. Since N is isomorphic to a subdivision of T it follows that N displays T.

(ii) Assume for contradiction that ψ(ρT) is not the root ρN of N. Since N is temporal tree-child, the two children u and v of ρN must be two distinct tree vertices. Moreover, there must exist a tree-path px in N from u to some leaf x and a tree-path py from v to some leaf y. Note that since px and py cannot intersect, we must have xy. Since ψ is a display map for T in N that satisfies the properties of Assertion (i) and x and y are also leaves of T it follows that lcaT(x,y) is mapped to an ancestor of x and y in N under ψ as N must display T. Extending the paths px and py to tree-paths starting at ρN implies that this ancestor must be ρN. Thus, ψ(ρT)=ρN.

To state the second lemma we require some further definitions. We call a subgraph N of N a pendant subnetwork of N if there exists a tree vertex v in N such that when deleting the incoming edge of v the network decomposes into two connected components where the component that contains v in its vertex set is a phylogenetic network N on L(N)L(N). For technical reasons, we consider N to be a pendant subnetwork of itself. A pendant subtree of N is a pendant subnetwork of N that is a phylogenetic tree. Note that a pendant subnetwork and therefore also a pendant subtree must have at least two leaves. Also, note that in what follows, if N is a phylogenetic network that weakly displays a phylogenetic tree via a display map ψ and T is a pendant subtree of N, then in order to simplify notation we shall identify T with its pre-image under ψ.

Lemma 2

Let N be a phylogenetic network on X that weakly displays two distinct phylogenetic trees T and T on X via display maps ψ and ψ, respectively. Furthermore, let v be a tree vertex of N. Then the following hold.

  • (i)

    If γψ,ψ(v)3, then there is a vertex in Ret(N) which is an ancestor of v in N.

  • (ii)

    If γψ,ψ(v)=2 and v has a child that is the root of a pendant subtree T in N, then the pre-image of T under ψ and ψ, respectively is also a pendant subtree of the respective tree T and T.

Proof

(i) Suppose that v is a tree vertex of N such that γψ,ψ(v)3. Then, without loss of generality, we may assume that T is such that γψ(v)2. Hence, there are two distinct edges e=(u,w) and e=(u,w) in T such that vψ[e]-ψ(u) and vψ[e]-ψ(u).

We show first that e and e are incomparable in T. Indeed, assume for contradiction that e and e are comparable. Without loss of generality, we may assume that w is above u in T. Then the edges on the directed path P in T starting at u and ending at w are collectively mapped by ψ to a directed path Pe,e in N with edge set fE(P)E(ψ[f]). Note that Pe,e contains ψ[e] and ψ[e] as (directed) subpaths and that ψ[e] cannot be a subpath of ψ[e] and vice versa as otherwise Property (iii) in the definition of a display map cannot hold. Now, since vψ[e]-ψ(u) and vψ[e]-ψ(u) it follows that N contains a directed cycle; a contradiction. Hence, e and e are incomparable.

Let q=lcaT(u,u), and let P (resp. P) be the directed path in T from q to w (resp. w). Furthermore, let Pe and Pe be the directed paths in N with edge sets

fE(P)E(ψ[f])andfE(P)E(ψ[f]),

respectively. As the first vertex on Pe and Pe is ψ(q), it follows from Property (iv) in the definition of a display map that the first edge on Pe is different from the first edge on Pe. Moreover, since e is an edge of P and e is an edge of P, it follows that ψ[e] is a subpath of Pe and ψ[e] is a subpath of Pe. Hence, v is a vertex on Pe and Pe. Consequently, there is some vertex in Ret(N) which lies on Pe and Pe and which is an ancestor of v.

(ii) Note that every leaf x in T must be contained in the image under ψ of some directed path in T from the root of T to x, and similarly for T. Since, by assumption, γψ,ψ(v)=2, there can be at most one such path in T and T, respectively, which has this property for every leaf in T. Since the leaf set of N, T, and T, respectively, is X it follows that the pre-image of T under ψ and ψ, respectively, must be a pendant subtree of the respective tree T and T, as otherwise γψ,ψ(v)3.

In Fig. 2, we presented an example where two phylogenetic trees T and T are weakly displayed by the depicted phylogenetic network N, γ(w)2 for all wV(N) but T and T are not both displayed by N. So, in general, it does not suffice to insist that γ(w)2 for all wV(N) for two phylogenetic trees to be displayed by a phylogenetic network. However, if we insist that the network is temporal tree-child, we now show that this condition actually suffices.

Theorem 1

Suppose that N is a temporal tree-child network on X and that T and T are two phylogenetic trees on X that are weakly displayed by N via display maps ψ and ψ, respectively. Then the following statements are equivalent.

  • (i)

    N displays T and T.

  • (ii)

    γψ,ψ(v)=2 for all vV(N)-{ρN}.

  • (iii)

    γψ,ψ(w)=2 for all wRet(N).

Proof

(i) (ii) We show first that γψ,ψ(v)2 must hold for all vV(N)-{ρN}. Assume for contradiction that there exists some vertex vV(N) such that γψ,ψ(v)1. Then one of γψ(v)=0 or γψ(v)=0 must hold. Without loss of generality we may assume that γψ(v)=0. Then there exists no edge eE(T) such that ψ[e] either passes through v or ends in v. But then there cannot exist a leaf x of N that can be reached from v via a tree-path. Thus, N is not tree-child; a contradiction. Since, by assumption, N displays both T and T, Lemma 1 implies that γ(v)2 for all vV(N)-{ρN}. Thus, γ(v)=2 must hold for all vV(N)-{ρN}.

(ii) (iii) This is trivial.

(iii) (i) By Lemma 1 it suffices to show that γψ(v)1 and γψ(v)1 holds for all vV(N). Assume for contradiction that there exists some vV(N) and some tree in {T,T}, say T, such that γψ(v)2. Then vρN. We claim that v is a tree vertex. Assume for contradiction that this is not the case. Then vRet(N). Since N is tree-child there exists a leaf xX below v that can be reached from v via a tree-path. Hence, any directed path from ρN to x must contain v. Thus, γψ(v)1 which, in turn, implies that γψ,ψ(v)=γψ(v)+γψ(v)2+1=3; a contradiction in view of Assertion (iii). Thus, v is a tree vertex, as claimed.

By Lemma 2(i), it follows that there must exist some vertex wRet(N) that is an ancestor of v. Without loss of generality, we may assume that w is such that no vertex in V(N) distinct from w that is above v and below w is contained in Ret(N). Then there is a unique directed path P from w to v in N. Since γψ,ψ(v)3 and, γψ,ψ(w)=2, there is a reticulation vertex contained in P that is not w; a contradiction to the choice of w.

Remark 1

Note that using the same proofs it can be seen that both Lemma 1 and Theorem 1 also hold for the more general class of normal networks. These are phylogenetic networks that in addition to being tree-child do not contain a shortcut as defined in the proof of Lemma 1 (see e.g. McDiarmid et al. 2015, p. 208).

Rigidly displaying two trees in a network

We now introduce the notion of rigidly displaying and present some of its basic properties. In Theorem 1, we showed that if N is a temporal tree-child network which weakly displays two phylogenetic trees T and Tvia display maps ψ and ψ, then in case γψ,ψ(v)=2 for all vV(N)-{ρN} it follows that N actually displays N. To define rigidly displaying we relax this latter condition as follows.

We say that a phylogenetic network N on X rigidly displays two phylogenetic trees T and T on X if N weakly displays T and T via display maps ψ and ψ respectively, for all vRet(N) we have γψ,ψ(v)3 and, for each parent wV(N) of a reticulation vertex vRet(N), we have γψ,ψ(w)2. For example, the network pictured in Fig. 1 rigidly displays the two phylogenetic trees depicted in that figure (where ψ and ψ are the obvious display maps).

Note that, in contrast to the definitions of displaying and weakly displaying which refer to a single phylogenetic tree, rigidly displaying always refers to two phylogenetic trees. In addition, by Theorem 1 it follows that if T and T are two phylogenetic trees on X that are displayed by a temporal tree-child network N on X, then N also rigidly displays T and T.

We conclude this section by presenting two technical lemmas concerning rigidly displaying trees in tree-child networks which we will use later.

Lemma 3

Suppose N is a tree-child network on X and that N rigidly displays two phylogenetic trees T and T on X via display maps ψ and ψ, respectively. Then 2γψ,ψ(v)3 for all vV(N)-{ρN}.

Proof

Put γ=γψ,ψ and suppose vV(N)-{ρN}. Then since N rigidly displays T and T it also weakly displays T and T. Since vρN and there exists some leaf below v that can be reached from v via a tree-path as N is tree-child, it follows in view of Property (ii) in the definition of a display map that γψ(v)1 and that γψ(v)1. Hence, 2γ(v).

For the remainder, assume for contradiction that there exists some vV(N)-{ρN} such that γ(v)4. Then v must be a tree vertex of N as N rigidly displays T and T and γ(ρN)=0. Let P:v1,v2,,vk,v be a longest directed path of tree vertices in N that ends at v. Note that γ(vi)γ(v), for all 1ik. Also note that since ρN is not a tree vertex of N, we cannot have v1=ρN. Let wV(N) denote the parent of v1. Then γ(w)4. Hence, we cannot have w=ρN. Since N rigidly displays T and T it follows that w must be a tree vertex of N. But then the extension of P by w results in a directed path of tree vertices of N that ends in v and that is longer than P; a contradiction.

Note that the converse of the last lemma does not hold in general (see e. g. Fig. 3).

Fig. 3.

Fig. 3

The two phylogenetic trees on the left are weakly displayed via the indicated display maps by the network depicted on the right. However, they are not rigidly displayed by the network for those maps because γ(v)=3 and v is the parent of a reticulation vertex

Lemma 4

Suppose that N is a tree-child network on X that rigidly displays two phylogenetic trees T and T on X and that ψ is the underlying display map for T in N. If e=(u,v) is an edge of T such that ψ[e] passes through a vertex wRet(N), then ψ(u) must be a parent of w in N.

Proof

Suppose e=(u,v) is an edge in T such that ψ[e] passes through a vertex wRet(N). Assume for contradiction that ψ(u) is not a parent of w. Let p be the parent of w in N such that p lies on ψ[e]. Then ψ(u)p. As N is tree-child, there must be a tree-path in N starting at p and ending at some leaf xX. So, as x is a leaf of T and ψ(u)p, there must be some edge ee in T such that ψ[e] passes through or ends at p. Moreover, considering the leaf x again, there must be an edge in T which maps to a path in N via the underlying display map ψ for T in N that either ends at or passes through p. It follows that γψ,ψ(p)3; a contradiction as N rigidly displays T and T and p is the parent of a vertex in Ret(N).

Note that, as the example of the two phylogenetic trees and the network in Fig. 2 shows, the assumption that N is tree-child is necessary for Lemma 4 to hold.

Fork operations

In the next section we shall characterize when two phylogenetic trees are rigidly displayed by a temporal tree-child network in terms of sequences of certain operations on these trees. The basis for these sequences are fork-operations which we shall now introduce.

First, we recall that two leaves x and y of a phylogenetic tree T with xy are called a cherry of T, denoted by {x,y}, if x and y have the same parent. Now, by a fork we mean a 2-leaved rooted tree (i. e. a cherry), a 3-leaved rooted tree (a 3-fork) or a 4-leaved fully-balanced rooted tree (a 4-fork). The following basic fact concerning forks is straight-forward to show.

Lemma 5

Suppose T is a phylogenetic tree with n3 leaves. If n=3 then T is a 3-fork and if n4 then T must contain a pendant subtree that is either a 3-fork or a 4-fork.

Now, a (type-i) fork-operation o=o(x), 0i3, is an operation that can be applied to a pair of phylogenetic trees T and T on X for which there exists a leaf xX, together with a fork in each of T and T containing x as depicted in the second and third columns of Fig. 4 (for example, for a type-2 operation one tree has a 3-fork ((zx), y) and the other a cherry {x,y} with x,y,zX distinct). In the 4th and 5th columns of Fig. 4 the result of applying the operation o(x) to the two trees is pictured (for example, applying a type-2 operation to the 3-fork and the cherry in row 3 results in a phylogenetic tree with cherry {z,y} and a phylogenetic tree whose cherry {x,y} has been replaced by y). In particular, when we apply an operation o to some element xX, we remove the leaf x and its incident edge from both trees, and suppress any resulting vertices of degree 2, also removing the root and both edges incident with it in case |X|=2. In case the type i of the fork-operation is of importance we write oi(x) instead of o(x). We present an example of applying a sequence of fork-operations to two phylogenetic trees in Fig. 5.

Fig. 4.

Fig. 4

The four fork-operations each applied to leaf x in two phylogenetic trees, as defined in the text

Fig. 5.

Fig. 5

For the two pictured phylogenetic trees T and T on X={x1,,x6}, we depict a fork-picking sequence (S1,C2) for T and T which has weight 1. In that sequence, S1 is the special sequence (o3(x5), o2(x3),o2(x2),o1(x4)), and o0(x1) makes up C2. The forks in T and T to which a fork-operation is applied are indicated by dotted triangles

Now, given two phylogenetic trees T and T on the set X={x1,,xn}, n3, we call a sequence (o(x1),o(x2),,o(xl)) of l fork-operations, 1ln-2, a special sequence for T and T if o(xl) is a type-1 operation on T|X-{x1,,xl-1} and T|X-{x1,,xl-1} and, in case l>1, the following properties hold:

  • (i)

    There exists some T{T,T} such that each o(xi), 1il-1, is a type-2 or a type-3 operation on T|X-{x1,,xi-1} and T|X-{x1,,xi-1} and the associated 3- or 4-fork is a pendant subtree of T|X-{x1,,xi-1},

  • (ii)

    there exist two distinct elements p,qX-{x1,x2,,xl} such that the last-but-one operation o(xl-1) is a type-2 operation with fork (p,(xl-1,xl)) and cherry {xl-1,p} and the last operation is the type-1 operation o(xl) applied to the cherries {p,xl} and {q,xl}, and

  • (iii)

    if l>2, then xi must be below lcaT(xl-1,xl) for all 1il-2 for the tree T in (i).

To illustrate this definition, consider the two phylogenetic trees T and T on X={x1,x2,,x6} depicted in Fig. 5. Then (o3(x5),o2(x3),o2(x2),o1(x4)) is a special sequence for T and T where, for example, o3(x5) is a fork-operation of type-3 and the tree with cherry (x4,x5) is the tree T mentioned in the definition of a fork-picking sequence. Note that an application of the last operation in a special sequence always results in phylogenetic trees with at least two leaves.

We conclude this section with an observation concerning special sequences which will be key to proving our main results.

Proposition 1

Suppose that N is a temporal tree-child network on X, |X|3, that rigidly displays two phylogenetic trees T and T on X. If no type-0 operation can be applied to T and T, then there is a special sequence σ for T and T. Moreover, the two phylogenetic trees resulting from applying σ to T and T can be rigidly displayed by a temporal tree-child network N with h(N)=h(N)-10.

Proof

Note first that h(N)>0 as otherwise N would be a phylogenetic tree that is isomorphic with both T and T implying that a type-0 operation can be applied to T and T; a contradiction.

Let t:V(N)R0 denote a temporal labelling for N and pick some vRet(N) whose value is maximum under t. Let u and w be the parents of v. Since N does not contain parallel arcs, u and w are two distinct tree vertices of N. Since N is tree-child, there must exist some child p of u that is not v and, similarly, there must exist a child q of w that is not v. We claim that p is a leaf of N.

To see that this claim holds, assume for contradiction that p is the root of a pendant subgraph N of N. Note that the choice of v implies that N is in fact a pendant subtree of N. Moreover, Lemma 3 implies that there are display maps for T and T in N such that γ(u)=2. By Lemma 2(ii) it follows that N is a pendant subtree of both T and T. Hence, T and T have a common cherry and, so, we can apply a type-0 operation to T and T; a contradiction. Thus p must be a leaf of N. Applying similar arguments to q implies that q must also be a leaf of N.

Since N is tree-child, the choice of v implies that the child s of v is a leaf of N or the root of a pendant subtree of N. Assume first that s is a leaf of N. Then since T and T are rigidly displayed by N and T and T do not contain a common cherry, it is straight-forward to see using Lemma 4 that, without loss of generality, T and T must contain the cherries {p,s} and {q,s}, respectively. Hence we can apply a type-1 operation to s. This gives a special sequence of length 1 for T and T, from which the proposition follows.

So, suppose that s is the root of a pendant subtree T1 of N, so that T1 has at least two leaves. Note first that γ(v)=3. Indeed, since T and T are rigidly displayed by N and N is tree-child, we obtain 2γ(v)3 in view of Lemma 3. If γ(v)=2 held then T and T would have a common cherry which implies that a type-0 operation can be applied to T and T; a contradiction. We can therefore assume without loss of generality that T1 is a pendant subtree of T, and that this tree together with the leaf p also forms a pendant subtree of T.

In case T1 has only two leaves x and y, say, then since γ(v)=3 it follows that T contains the 3-fork (p, (xy)) and T contains, without loss of generality, the cherries {p,y} and {q,x}. Hence we can apply a type-2 operation to y and then apply a type-1 operation to x (since T|X-{y} and T|X-{y} must contain the cherries {p,x} and {q,x}, respectively). It follows that (o2(y),o1(x)) is a special sequence for T and T since Property (i) of a special sequence holds for T=T and Property (ii) holds for p and q as defined above. The remainder of the proposition is a straight-forward consequence.

Assume for the remainder of the proof that T1 has at least two leaves. Put Y=L(T1)={x1,,xk}, k2. We claim that there exists a special sequence σ=(o(x1),,o(xk)) for T and T with T=T and p and q as defined above. We prove the claim by induction on k.

Note that we have just shown that the claim holds for the base case k=2. So suppose the claim holds for all k, k2, and that |Y|=k+1. Note that as k+13, Lemma 5 implies that T1 contains either a 3-fork or a 4-fork.

Suppose T1 contains a 3-fork τ=(a,(b,c)) where a,b,cX are distinct. Then τ must be a pendant subtree of T. As T and T have no cherries in common and γ(v)=3, it follows that T contains the cherry {a,b} or the cherry {a,c}. Without loss of generality, we may assume that T contains the cherry {a,c}. Hence, we can apply the type-2 operation o(c) to c. Note that this creates a cherry {a,b} in T|X-{c} which, in view of Property (iv) in the definition of a display map, is not a cherry in T|X-{c}. Clearly, the network obtained from N by removing c and its incoming edge (suppressing the resulting indegree and outdegree one vertex) is temporal tree-child and rigidly displays T|X-{c} and T|X-{c}. By induction, we obtain a special sequence σc=(o(x1),o(x2),,o(xk)) for T|X-{c} and T|X-{c} with T=T|X-{c} and p and q as defined above. We postulate that (o2(c),σc) is a special sequence for T and T with T=T and p and q as defined above. Since o(c) is a type-2 operation applied to c it suffices to show that Property (iii) of a special sequence is satisfied for c, that is, c is below lcaT(xk,xk-1). But this clearly holds since bL(T1|Y-{c}) and any ancestor of b is also an ancestor of c as {b,c} is a cherry in τ. Thus, (o2(c),σc) is a special sequence with the stated properties, as required.

Suppose T1 contains a 4-fork τ=((a,b),(c,d)) where a,b,c,dX are distinct. Then τ must again be a pendant subtree of T. As T and T have no cherries in common and γ(v)=3, it follows that, as before, we may assume without loss of generality that T contains the cherries {a,c} and {b,d}. Hence, we can apply the type-3 operation o(c) to c. Note that this creates a 3-fork (d, (ab)) in T|X-{c} and that {a,b} is not a cherry in T|X-{c}. Clearly, the network obtained from N by removing c and its incoming edge (suppressing the resulting indegree and outdegree one vertex) is again temporal tree-child and rigidly displays T|X-{c} and T|X-{c}. By induction, we obtain a special sequence σc=(o(x1),o(x2),,o(xk)) for T|X-{c} and T|X-{c} with T=T|X-{c} and p and q as defined above. Employing similar arguments as in the previous case implies that (o3(c),σc) is a special sequence for T and T with T=T and p and q as defined above. This completes the proof of the claim.

To complete the proof, note that as T1 is a pendant subtree of N, we can remove T1 and v (plus all its incident edges) from N, and suppress the resulting vertices of degree two to obtain a network N with h(N)=h(N)-10. As N rigidly displays T and T, it follows that N rigidly displays their restrictions T|X-L(T1) and T|X-L(T1). Moreover, as N is tree-child and vRet(N), we have that N is also tree-child. Since p and q are leaves of N and N is temporal, it follows that N is temporal.

Fork-picking sequences

In this section we characterize when two phylogenetic trees are rigidly displayed by a temporal tree-child network, in terms of a generalization of special sequences which we now introduce. Suppose that T and T are two phylogenetic trees on X={x1,,xn} where n2 and that σ=(o(x1),o(x2),,o(xn-1)) is a sequence of fork-operations for T and T where o(xi) is on T|X-{x1,,xi-1} and T|X-{x1,,xi-1}, for all 1in-1. Then we call σ a fork-picking sequence for T and T if σ is of the form (C1,S1,C2,S2,,Ck,Sk,Ck+1), some k0, such that

  • (i)

    for all 1ik+1, we have that Ci is a possibly empty (except in case i=k+1) sequence of type-0 operations, and

  • (ii)

    for all 1ik, we have that Si is a special sequence for T|Y and T|Y, where Y={xp,xp+1,,xn} and o(xp) is the first operation in Si (so that, in particular, n-p+13).

To ease readability, we omit all those Ci that are empty when writing down fork-picking sequences. Note that it follows from the definition that any fork-picking sequence can be decomposed in a unique way into the form (C1,S1,C2,S2,,Ck,Sk,Ck+1), and that all of the subsequences Si are non-empty.

To illustrate this definition, note that σ=(o3(x5),o2(x3),o2(x2),o1(x4),o0(x1)) is a fork-picking sequence for the two phylogenetic trees depicted in Fig. 5, since it is of the form (C1,S1,C2) where S1=(o3(x5),o2(x3),o2(x2),o1(x4)) is the special sequence for T and T considered in the previous section, C2=(o0(x1)), and C1 is the empty sequence.

We now provide a link between h(N) for a temporal tree-child network N that rigidly displays two phylogenetic trees and fork-picking sequences for these trees. We define the weight w(σ) of a fork-picking sequence σ to be the number of special sequences in σ (or, equivalently, the number of type-1 operations in σ since any special sequence contains precisely one type-1 operation).

Proposition 2

Suppose that N is a temporal tree-child network on X that rigidly displays two phylogenetic trees T and T on X. Then there is a fork-picking sequence σ for T and T with h(N)w(σ).

Proof

Clearly, the theorem holds for |X|=2. So assume that |X|3. We prove the theorem by induction on h(N). If h(N)=0, then N, T, and T are all isomorphic to one another. But then we can take a fork-picking sequence σ for T and T consisting solely of type-0 operations (i.e. σ=(C1)), and so w(N)=0=h(N).

Now, assume that h(N)=k, some k>0, and that the theorem holds for all temporal tree-child networks N with 0h(N)<k.

Apply type-0 operations to T and T until no more can be applied. If the resulting sequence C1 of operations has length |X|-1, then it is a fork-picking sequence for T and T and w(σ)=0<k=h(N). Hence, the theorem holds. Otherwise, let T1 and T1 be the phylogenetic trees resulting after applying the operations in C1, noting that |L(T1)|=|L(T1)|3.

Since by construction no type-0 operation can be applied to T1 and T1, by Proposition 1 it follows that there is a special sequence S1 for T1 and T1, and that the two phylogenetic trees T2 and T2 resulting from applying S1 can be rigidly displayed by a temporal tree-child network N with h(N)=h(N)-10.

It follows by induction that there is a fork-picking sequence σ=(C1,S1,,Ck+1), some k0, for T2 and T2 such that k-1=h(N)w(σ)=k. Hence, σ=(C1,S1,C1,S1,,Ck+1) is a fork-picking sequence for T and T such that h(N)=kw(σ).

Now, as defined in Humphries et al. (2013a), we say for n=|X| that an ordering σ=(x1,x2,,xn-1,xn) of X is a cherry-picking sequence for two phylogenetic trees T and T on X if, for all 1in-1, xi is contained in a cherry in both Ti=T|X-{x1,,xi-1} and Ti=T|X-{x1,,xi-1}. In addition, the cherry-count ci(σ){0,1} associated to xi is 1 if the cherries in Ti and Ti containing xi are different and 0 else.

Note that every cherry-picking sequence σ=(x1,x2,,xn) for two phylogenetic trees T and T on X gives rise to a fork-picking sequence for T and T. Namely, we make a sequence of operations (o(x1),,o(xn-1)) with a type-1 operation applied to xi if ci(σ)=1 and a type-0 operation applied to xi if ci(σ)=0. In addition, any fork-picking sequence (o(x1),,o(xn-1)) for two phylogenetic trees T and T on X clearly gives rise to the cherry-picking sequence (x1,x2,,xn). For example, the cherry-picking sequence (x5,x3,x2,x4,x1,x6) with cherry counts (1, 1, 1, 1, 0, 0) arises from the fork-picking sequence σ given at the beginning of this section for the two phylogenetic trees in Fig. 5. Using these observations we obtain the following result.

Theorem 2

Suppose that T and T are two phylogenetic trees on X. Then the following statements are equivalent.

  • (i)

    T and T are rigidly displayed by a temporal tree-child network on X.

  • (ii)

    T and T are displayed by a temporal tree-child network on X.

  • (iii)

    there is a cherry-picking sequence for T and T.

  • (iv)

    there is a fork-picking sequence for T and T.

Proof

(ii) (i) If two phylogenetic trees are displayed by a phylogenetic network then they are rigidly displayed by that network.

(iii) (ii) Apply (Humphries et al. 2013a, Theorem 1), which states that two phylogenetic trees are displayed by a temporal tree-child network if and only if there is a cherry-picking sequence for them.

(i) (iv) Apply Proposition 2.

(iv) (iii) Apply the observation stated before the statement of the corollary i. e. , that a fork-picking sequence gives rise to a cherry-picking sequence.

Note that the temporal tree-child networks whose existence is guaranteed in Theorem 2(i) and (ii) need not be the same.

Theorem 2 also sheds light on the following decision problem:

Rigidly Displaying

Input: Two phylogenetic trees T and T on X.

Output: Does there exist a temporal tree-child network on X that rigidly displays T and T?

Indeed, Theorem 2 and the main result in Döcker et al. (2019, Theorem 1) (which states that it is NP-complete to decide whether or not there is a cherry-picking sequence for two phylogenetic trees) immediately imply:

Corollary 1

The decision problem Rigidly Displaying is NP-complete.

The rigid hybrid number of two trees

Suppose that there is some fork-picking sequence for two phylogenetic trees T and T on X (or equivalently by Theorem 2, T and T are rigidly displayed by some temporal tree-child network on X). We define

sr(T,T)=min{w(σ):σisafork-pickingsequenceforTandT},

and

hr(T,T)=min{h(N):Nistemporaltree-childandrigidlydisplaysTandT}.

We call hr(T,T) the rigid (temporal tree-child) hybrid number for T and T.

In this section, we prove:

Theorem 3

If two phylogenetic trees T and T on X are rigidly displayed by some temporal tree-child network on X, then sr(T,T)=hr(T,T).

This theorem is an immediate consequence of Proposition 2 and the following result.

Proposition 3

Suppose T and T are two phylogenetic trees on X and that σ is a fork-picking sequence for T and T. Then there exists a temporal tree-child network Nσ on X which rigidly displays T and T and such that w(σ)h(Nσ).

Proof

We establish the theorem using induction on w(σ).

If w(σ)=0 then σ=(C1), where C1 consists solely of type-0 operations. Hence T and T are isomorphic and the required temporal tree-child network Nσ is given by T.

Now suppose that σ is a fork-picking sequence for T and T with weight w(σ)=k, some k>0, and that, for any two phylogenetic trees T1 and T1, the theorem holds for all fork-picking sequences σ for T1 and T1 with w(σ)<k.

As w(σ)=k, σ is of the form (C1,S1,C2,S2,,Sk,Ck+1). Let Y be the set of elements y in X such that o(y) is not in the sequence (C1,S1). Then, as Ck+1 is not the empty sequence, σ=(C2,S2,,Sk,Ck+1) is a fork-picking sequence for T|Y and T|Y, with w(σ)=k-1. By induction, it follows that there is a temporal tree-child network N with h(N)w(σ) that rigidly displays T|Y and T|Y. Let t denote a temporal labelling for N.

We now construct a temporal tree-child network Nσ from N. We first consider the case that C1 is the empty sequence. We illustrate this case in Fig. 6 in terms of the fork-picking sequence considered in Fig. 5.

Fig. 6.

Fig. 6

The construction of the temporal tree-child network Nσ for the fork-picking sequence σ=(C1,S1,C2) for the phylogenetic trees T and T in Fig. 5 as described in the proof of Proposition 3. Note that C1 is the empty sequence. (i) Insertion of x4. (ii) Insertion of x2. (iii) Insertion of x3 and x5

Let Y1={y1,,yl}, l1, be such that S1=(o(y1),,o(yl)). To ease notation, put x=yl and, if l2, y=yl-1.

Assume first that l=1. Since o(x) is a type-1 operation, there exist pqY such that, without loss of generality, {p,x} is a cherry in T|Y{x} and {q,x} is a cherry in T|Y{x}. Subdivide the pendant edges in N incident with p and q by adding new vertices up and uq, respectively. Also, add in the leaf x below a newly added reticulation vertex v which has parents up and uq. Denote the resulting phylogenetic network by N. Clearly, since N is tree-child we also have that N is tree-child. Returning to the fork-picking sequence example considered in Fig. 5, we have for Fig. 6(i) that Y={x1,x6} and that T|Y, T|Y and N are all isomorphic. Also y=x2, x=x4, p=x1, and q=x6.

For wp and wq the parents of up and uq in N, respectively, choose some value t(uq) with max{t(wp),t(wq)}<t(uq)<min{t(p),t(q)} and put t(uq)=t(up)=t(v). Also, choose some t(x) such that t(v)<t(x) so that, together with t, we obtain a temporal labelling t for N. Clearly, N rigidly displays T|Y{x} and T|Y{x}. Putting Nσ=N this completes the proof of the theorem in case l=1 since h(N)=h(N)+1w(σ)+1=w(σ).

Assume now that l=2. We explain how to insert y into N. By definition of a special sequence, o(y) is a type-2 operation. Without loss of generality, we may assume that T|Y{x,y} has a fork (p, (xy)) and that T|Y{x,y} has a cherry {y,p} with p as in operation o(yl). Then we can construct a new temporal tree-child network N which rigidly displays T|Y{x,y} and T|Y{x,y} by inserting a pendant edge incident with y via a subdivision vertex s on the pendant edge of N incident with x to form a cherry {x,y} in N. Define the temporal labelling t of N by putting t(y)=t(x), choosing some value t(s) with t(v)<t(s)<t(x), and appropriately using t. Note that h(N)=h(N) and so the theorem is also proven for Nσ=N if l=2.

Assume next that l3. Suppose that we have a special sequence. Also, suppose that we have created a temporal tree-child network N from N by successively inserting, for some i0 all leaves yj, i+1jl-2, below the parent of the cherry {x,y} in N to create a pendant subtree T¯ of N with leaf set Y2={yi+1,,yl-2,y,x}Y1 so that N rigidly displays the phylogenetic trees T|YY2 and T|YY2. Without loss of generality, we may assume that T is the tree T in the definition of a special sequence for T and T.

We now show how to insert yi into N. Consider operation o(yi). Then, by definition of a special sequence, o(yi) is either a type-2 operation or a type-3 operation, for which the 3-fork and 4-fork, respectively, is a pendant subtree of T1=T|YY2{yi} and, in view of Property (iii) of a special sequence, yi is below lcaT(y,x). Put T1=T|YY2{yi}.

If o(yi) is a type-2 operation, then let (a,(yi,b)) denote the 3-fork of T1 where a,bX-{yi} distinct. Then {a,b} must be a cherry in T|YY2. Since yi is below lcaT(x,y), it follows that {a,b} is a cherry in T¯. We can therefore first insert yi into the pendant edge of N incident with b and then extend the temporal labelling of N so as to obtain a temporal tree-child network that rigidly displays T1 and T1.

If o(yi) is a type-3 operation, then let ((a,yi),(b,c)) denote the 4-fork in T1 where a,b,cX-{yi} are pairwise distinct. Then (a, (bc)) must be a 3-fork in T1|YY2. As yi is below lcaT(x,y) it follows that this 3-fork must also be a pendant subtree of T¯. We can therefore first insert yi into the pendant edge of N incident with a and then extend the temporal labelling of N so as to obtain a temporal network that rigidly displays T1 and T1. Again for the fork-picking sequence example considered in Fig. 5, we have for Fig. 6(iii) that o(x3) is a type-2 operation with a=x4 and b=x2 and o(x5) is a type-3 operation with a=x4 and {b,c}={x2,x3}.

In summary, we can insert all of the elements of Y into N in this way until we obtain a temporal tree-child network Nσ with h(Nσ)=h(N)+1 which rigidly displays T and T. It follows by induction that

w(σ)=w(σ)+1h(N)+1=h(Nσ),

which completes the proof of the theorem in case C1 is empty.

Now, suppose C1 is not empty. Then we first insert all elements of Y1 into N as described in the case that C1 is the empty sequence above to obtain a network N1 which rigidly displays T|YY1 and T|YY1 and for which w(σ)h(N1) holds. Into N1 we then insert all elements zX for which o0(z) is contained in C1 in the (reverse) order specified by C1, and at each step creating a cherry specified by o(z), to obtain a new temporal tree-child network Nσ which rigidly displays T and T. Clearly, h(Nσ)=h(N1). Since σ=(C1,S1,C2,S2,,Sk,Ck+1) is a fork-picking sequence for T and T and w(σ)=w(σ) the theorem holds in this case too.

The relationship between the rigid, temporal, weak and beaded hybrid numbers

As mentioned in the introduction, there are various ways to define hybrid numbers for two phylogenetic trees. In this section, we consider the relationship between the rigid hybrid number of two phylogenetic trees and three closely related hybrid numbers (the temporal, weak, and beaded hybrid numbers), in particular, showing that they can all be different from one another. From a biological viewpoint, the results that we present in this section are important as they show that in principle quite different estimates can arise for the number of reticulation events required to accommodate two phylogenetic trees depending on the model being used to embed the two trees in a phylogenetic network.

The temporal hybrid number

For two phylogenetic trees T and T on X that can be displayed by some temporal tree-child network, the temporal hybrid number of T and T (Humphries et al. 2013a) is defined as

ht(T,T)=min{h(N):Nisatemporaltree-childnetworkthatdisplaysTandT}.

Note that in case this number exists, the temporal hybrid number for the two phylogenetic trees is not necessarily equal to their hybrid number Humphries et al. (2013a, Figure 1; also p. 1889).

Now, given two phylogenetic trees T and T on X, Theorem 2 implies that the temporal hybrid number ht(T,T) of T and T exists if and only if the rigid hybrid number hr(T,T) exists. Clearly, if these numbers both exist, then hr(T,T)ht(T,T). In this section, we show that the difference ht(T,T)-hr(T,T) can grow at least as a linear function of |X|.

To this end, assume that m3. Consider two phylogenetic trees T and T with 2m+2 leaves, as given in Fig. 7. In that figure, T1 and T2 are both fully balanced phylogenetic trees with 2m-1 leaves each and T3 is a fully balanced phylogenetic tree with 2m leaves. We next describe the labeling of T and T. Let {S1,S2,,S2m-2} be the set of all pendant subtrees on four leaves in T3. Then, for each i{1,2,,2m-2}, we bijectively label the leaves of Si with {4i-3,4i-2,41-1,4i}. Turning to the subtrees T1 and T2 of T, let {S1,S3,,S2m-2-1} be the set of all pendant subtrees on four leaves in T1 and, similarly, let {S2,S4,,S2m-2} be the set of all pendant subtrees on four leaves in T2. Then, for each

i{1,3,,2m-2-1},

bijectively label the eight leaves of Si and Si+1 with {4i-3,4i-2,,4i+4} such that Si has cherries {4i-3,4i} and {4i+1,4i+4} and Si+1 has cherries {4i-2,4i-1} and {4i+2,4i+3}. For m=4, the leaf labeling of T and T is illustrated in Fig. 8.

Fig. 7.

Fig. 7

Two phylogenetic trees T and T on the set X={1,2,,2m+2} as defined in the text. Both are rigidly displayed by the depicted temporal tree-child network N on X

Fig. 8.

Fig. 8

The two phylogenetic trees T and T in Fig. 7 for the case m=4

Theorem 4

For the two phylogenetic trees T and T on X={1,,2m+2}, m3, pictured in Fig. 7, we have ht(T,T)-hr(T,T)|X|4-3.

Proof

First note that hr(T,T)=1, as T and T are rigidly displayed by the temporal tree-child network N pictured in Fig. 7.

We now show that ht(T,T)2m-2-1 from which the theorem follows. First note that, by Theorem 2, there must exist a temporal tree-child network that displays both T and T. By Humphries et al. (2013b, Theorem 3.3), it follows that ht(T,T) is equal to the number of components in a maximum temporal agreement forest for T and T minus 1, where such a forest is defined as follows. All phylogenetic trees considered in the definition are “planted" by adding a new root plus an edge to their roots, and trees with one leaf are also allowed. Let k=ht(T,T). Then a collection {T0,,Tk} of k+1 planted trees is a maximum temporal agreement forest for (planted versions of) T and T if the following three properties hold, where Xi=L(Ti), 0ik.

  1. The set Z={X0,Xk} is a partition of X.

  2. For all 0ik, TiT|XiT|Xi.

  3. Denoting for 0ik the root of Ti by ρi, there exist injective maps
    χ:{ρ0,ρ1,,ρk}V(T)andχ:{ρ0,ρ1,,ρk}V(T)
    such that any two trees in {T(Xi{χ(ρi)}):0ik} and {T(Xi{χ(ρi)}):0ik} are edge-disjoint rooted subtrees of T and T, respectively.

We now claim that for every subset {4i-3,,4i} of X with 1i2m-2, at least one of the sets {4i-3}, {4i-2}, and {4i-3,4i-2} must be contained in Z. This implies that k+12m-2 from which the theorem immediately follows. For simplicity, we prove the claim for i=1. Specifically, we show that, for the subset {1,2,3,4} of X, at least one of {1}, {2}, and {1,2} is an element of Z. The argument for the remaining cases 2i2m-2 is similar.

Assume that {1,2} is not an element of Z. Since Z is a partition of X, one of the following three cases must hold:

  1. {1} or {2} is an element in Z,

  2. {1,2}Y1 is an element in Z, or

  3. {1}Y2 and {2}Y3 are elements in Z,

where Y1, Y2, and Y3 are subsets of X-{1,2}. Now, observe that Case (c) contradicts Property (P3); particularly {1}Y2 and {2}Y3 are not edge-disjoint in T because T has a cherry {1,2}. Moreover, Case (b) contradicts Property (P2) because T|{1,2,y} and T|{1,2,y} are non isomorphic, where y is an element in Y1. Hence, Case (a) must hold. Thus, one of {1}, {2}, and {1,2} is an element of Z.

The weak and beaded hybrid numbers

We now consider the relationship between the rigid, temporal and so-called weak and beaded hybrid numbers. First, given two phylogenetic trees T and T on X, we define the weak hybrid number hwd(T,T) of T and T as

hwd(T,T)=min{h(N):Nis a phylogenetic network that weakly displaysTandT}.

Note that for any two phylogenetic trees T and T there always exists a phylogenetic network that displays T and T. Hence, hwd(T,T) is well-defined. The weak hybrid number has been considered implicitly in The Parental Tree Network Problem (Zhu et al. 2016, Definition 5).

Next, recall that a beaded tree B on X is a phylogenetic network on X in which parallel edges are allowed, and in which each reticulation vertex v has a unique parent u such that there are two parallel edges from u to v (van Iersel et al. 2018, Definition 7). In van Iersel et al. (2018, Definition 6) the concept of a weak embedding of a multi-labeled tree into a beaded tree is defined, which is closely related to our definition of a display map when restricted to phylogenetic trees.1 For two phylogenetic trees T and T on X, we define the beaded hybrid number hb(T,T) for T and T to be

hb(T,T)=min{h(B):Bis a beaded tree such that there exist weakembeddings ofTandTintoB}.

In the following result we summarize the relationship between the various hybrid numbers that we have considered:

Theorem 5

Let T and T denote two phylogenetic trees on X. Then

hb(T,T)hwd(T,T).

Moreover, if the rigid (or equivalently temporal) hybrid number for T and T exists, then

hb(T,T)hwd(T,T)hr(T,T)ht(T,T),

and in this case there exist trees T and T such that every inequality is strict.

Proof

To see that the first statement holds note that (van Iersel et al. 2018, Lemma 9) implies that any phylogenetic network N on X that weakly displays two phylogenetic trees T and T on X can be transformed into a beaded tree B on X such that there exist weak embeddings of T and T into B for which |Ret(N)|=|Ret(B)| (so in particular hb(T,T) exists for any pair of phylogenetic trees T and T). Hence, hb(T,T)hwd(T,T).

Now, consider the second statement in the theorem. Suppose that hr(T,T) exists (or equivalently, that ht(T,T) exists). By the first statement in the theorem hb(T,T)hwd(T,T), hwd(T,T)hr(T,T) clearly also holds, and as remarked in the previous section, hr(T,T)ht(T,T). Hence, all of the inequalities in the second statement hold.

We now show that there exist two phylogenetic trees T and T such that every inequality is strict. Consider the trees T1 and T1 on X={1,,6} depicted in Fig. 9. Note that hb(T1,T1)=1 since T1 and T1 are not isomorphic, and there exist weak embeddings for T1 and T1 into the pictured beaded tree N, respectively. Moreover, by considering fork-picking sequences it is straight-forward to check that hr(T1,T1)=2, and by considering cherry picking sequences and using Humphries et al. (2013a, Theorem 2), it is straight-forward to check that ht(T1,T1)=4. Hence, as T1 and T1 are not isomorphic, 1hwd(T1,T1)hr(T1,T1)=2. In the Appendix we show that hwd(T1,T1)1, and so it follows that hwd(T1,T1)=2.

Fig. 9.

Fig. 9

Two phylogenetic trees T1 and T1 on X={1,,6} whose beaded hybrid number hb(T1,T1) is one (N is a beaded tree into which T1 and T1 can be weakly embedded), and whose weak hybrid number hwd(T1,T1) is two (the network N weakly displays both trees)

Next consider the two phylogenetic trees T2 and T2 on the set Y={a,b,c,d,e} in Fig. 10 in the Appendix. Then using similar arguments to those used to determine the various hybrid numbers for the trees in Fig. 9, it is straight-forward to check that hb(T2,T2)=hwd(T2,T2)=1, hr(T2,T2)=2, and ht(T2,T2)=3.

Fig. 10.

Fig. 10

Two phylogenetic trees T2 and T2 on Y={a,,e} whose weak hybrid number is hwd(T2,T2)=1 (see N for a phylogenetic network that weakly displays both trees) and whose rigid hybrid number is hr(T2,T2)=2 (see N for a temporal tree-child network which rigidly displays T2 and T2)

Now, to complete the proof, consider the phylogenetic trees T and T on the set XY which are constructed as follows: For T (respectively, T) take a new root vertex and join this vertex via two edges to the roots of the trees T1 and T2 (respectively, T1 and T2). Then it is straightforward to check that h(T,T)=h(T1,T1)+h(T2,T2) for h=hb,hwd,hr,ht. It follows that

hb(T,T)=2<hwd(T2,T2)=3<hr(T2,T2)=4<hr(T2,T2)=7.

Discussion

Motivated by the fact that in some evolutionary scenarios the notion of a network displaying two phylogenetic trees might be too restrictive, we have introduced and studied the concept of a network rigidly displaying a pair of phylogenetic trees. We have characterized when two trees can be rigidly displayed by a temporal tree-child network and, provided this is possible, have shown that their rigid hybrid number is given by a minimum weight fork-picking sequence for them. In addition, we have shown that the rigid hybrid number is different from the closely related temporal, weak, and beaded hybrid numbers.

There remain several open problems. First, it is well-known that the hybrid number is closely related to the size of a maximum agreement forest for two phylogenetic trees (Baroni et al. 2005a). It would therefore be of interest to know if there is some analogue of a maximum agreement forest for rigidly displaying two trees. Results in Humphries et al. (2013b), including the one mentioned above, concerning temporal agreement forests for two phylogenetic trees to be displayed by temporal tree-child networks could be useful for studying this question. In addition, it could be interesting to define and study rigid hybrid numbers for three or more phylogenetic trees. For example, we could try to understand r-rigidly displaying, where r is the maximum number of edges that come together at each reticulation vertex (note that in this paper we have investigated the concept of r-rigidly displaying for r=3). Recently, there has been work on understanding the hybrid number for arbitrary sets of trees (Linz and Semple 2019) which might be relevant.

More generally, several questions remain concerning the notion of weakly displaying. For example, it would be interesting to know how large the difference can potentially be between the hybrid number and the weak hybrid number for a collection of phylogenetic trees. As this appears to be a difficult problem, it might be worth first restricting to the case of understanding the “weak temporal tree-child hybrid number"; how different can this number be from the rigid hybrid number, and can we decide when a set of trees is weakly displayed by a temporal tree-child network? To answer these questions it could be worth first trying to decide whether or not two phylogenetic trees are rigidly displayed by some temporal tree-child network if and only if they are weakly displayed by some temporal tree-child network. In another direction, it would be interesting to understand how the weak- and rigid hybrid numbers behave for other classes of networks besides temporal tree-child networks.

Finally, an important open problem is to develop practical algorithms to compute networks with minimum rigid and/or weak hybrid numbers for two (or more) phylogenetic trees. The complexity of computing the weak and rigid hybrid numbers is unknown (see also Zhu et al. 2016, p.278), but we suspect these will probably be NP-hard in general. We base this suspicion in part on the fact that it is known that computing the minimum hybrid and temporal hybrid number for two trees is NP-hard (Bordewich and Semple 2007b; Humphries et al. 2013a). However, by gaining a better understanding of fork-picking sequences it may be possible to at least develop fixed parameter tractable algorithms to construct optimal networks, an approach that has already proven successful for hybrid and temporal hybrid numbers (Bordewich and Semple 2007a; Humphries et al. 2013a).

Acknowledgements

KTH and VM thank the London Mathematical Society and SL thanks the New Zealand Marsden Fund for their financial support. All authors thank the Biomathematics Research Centre, University of Canterbury, and The Lorentz Center, Leiden, where they discussed parts of this work. Also, all authors thank the two reviewers for their helpful comments and suggestions to improve the paper.

Appendix

In this appendix we prove that hwd(T1,T1)1 for the two phylogenetic trees T1 and T1 on X={1,,6} pictured in Fig. 9. To simplify notation, we put T=T1 and T=T1.

Suppose to the contrary that hwd(T,T)=1. Then there exists a phylogenetic network N that weakly displays T and T such that h(N)=1. Let v be the unique vertex in Ret(N). Let uV(N)-{ρN} be a parent of v. Note that since N has no parallel edges, u must exist. Also note that u must be a tree vertex of N as v is the sole reticulation vertex of N. Finally, note that the other child of u cannot be v as N does not contain parallel edges. Denoting that child of u by x, we next claim that x must be a leaf of N.

To see the claim, assume for contradiction that x is not a leaf. Let T denote the subtree of N rooted at x. Let x be a leaf of T and, thus, of N. Then since N weakly displays T via a map ψ say, and v is the sole reticulation vertex of N we obtain γψ(u)1. Similarly, as N weakly displays T via a map ψ say, γψ(u)1 must hold. Thus, γψ,ψ(u)2. Since Lemma 2(i) implies that γψ,ψ(u)2 as v is the sole reticulation vertex of N, it follows that γψ,ψ(u)=2. Hence, by Lemma 2(ii), T is also a pendant subtree of T and of T; a contradiction as T and T are the two trees depicted in Fig. 9. Thus, x is a leaf of N, as claimed.

Since every element in X is contained in a cherry of T or T, we may choose some yX-{x} such that {x,y} is a cherry in T or T. Without loss of generality, assume that T is that tree. Since the only two cherries of T are {1,3} and {4,6} we may assume without loss of generality that {x,y}={1,3}. Let m denote the parent of x and y in T.

Let wV(N) be the parent of u which must exist as uρN. Then wρN as otherwise the fact that {x,y} is a cherry of T but not of T implies that y is below v. But then T is not weakly displayed by N because (ρN,u) is an edge of N and (ρT,m) is not an edge in T; a contradiction.

We next claim that (wv) cannot be an edge in N. To see this, assume for contradiction that (wv) is an edge in N. Then since T is weakly displayed by N and x is contained in a cherry of T but not of T it follows that y must be a leaf of N below v. If there existed another leaf of N below v then that leaf would have to be leaf 5. Since {5,6} is a cherry of T and T is weakly displayed by N it follows that that cherry must also be below v; a contradiction as T is the phylogenetic tree depicted in Fig. 9. Thus, y is in fact the sole leaf of N below v. But then {x,y} must also be a cherry of T; a contradiction since T and T are the phylogenetic trees depicted in Fig. 9. Thus, (wv) cannot be an edge in N, as claimed. Hence, the other child of w, call it z, is either a leaf of N or the root of a pendant subtree of N.

Note first that arguments similar to the case of x imply that z must be a leaf of N. Let pV(N) denote the parent of w. We next distinguish between the cases that z=y and zy.

If z=y then pρN. To see this, assume for contradiction that p=ρN. Then (ρN,w) is an edge in N. Since {x,y} is a cherry of T and the parent m of x and y is not adjacent with ρT it follows that T is not weakly displayed by N; a contradiction. Thus, pρN, as required.

We next claim that (pv) cannot be an edge of N. Assume for contradiction that (pv) is an edge of N. Then since {x,y} is a cherry of T and T is weakly displayed by N, similar arguments as before imply that leaf 5 must be the sole leaf of N below v and that the unique directed path from ρN to leaf 2 does not contain w. Since T is one of the two phylogenetic trees depicted in Fig. 9 it follows that T is not weakly displayed by N; a contradiction. Thus, (pv) cannot be an edge of N either.

Let q denote the other child of p. Then similar arguments as in the case of z imply that q must also be a leaf of N. Hence, the phylogenetic tree R on {q,x,y} with cherry {x,y} is a pendant subtree of T. Since p is a tree vertex of N, Lemma 2(ii) implies that R is a pendant subtree of T and T; a contradiction in view of Fig. 9. Hence, hwd(T,T)1 in case z=y.

Assume for the remainder that zy. Then y is a descendant of v in N. Since z is a leaf of N, it follows that the phylogenetic tree on {x,y,z} with cherry {x,y} is a pendant subtree of T. Since {x,y}={1,3} we must have z=5. But then x=6 as {5,6} is a cherry of T and T is weakly displayed by N; a final contradiction. Hence, hwd(T,T)1 and, so, Theorem 5 follows.

Footnotes

1

It differs from our definition of a display map since it allows edges to be mapped to paths of length 0.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Katharina T. Huber, Email: K.Huber@uea.ac.uk

Simone Linz, Email: s.linz@auckland.ac.nz.

Vincent Moulton, Email: v.moulton@uea.ac.uk.

References

  1. Bapteste E, van Iersel L, Janke A, Kelchner S, Kelk S, McInerney JO, Morrison DA, Nakhleh L, Steel M, Stougie L, et al. Networks: expanding evolutionary thinking. Trends Genet. 2013;29(8):439–441. doi: 10.1016/j.tig.2013.05.007. [DOI] [PubMed] [Google Scholar]
  2. Baroni M, Grünewald S, Moulton V, Semple C. Bounding the number of hybridization events for a consistent evolutionary history. J Math Biol. 2005;51(2):171–182. doi: 10.1007/s00285-005-0315-9. [DOI] [PubMed] [Google Scholar]
  3. Baroni M, Semple C, Steel M. A framework for representing reticulate evolution. Ann Comb. 2005;8(4):391–408. doi: 10.1007/s00026-004-0228-0. [DOI] [Google Scholar]
  4. Bordewich M, Semple C. Computing the hybridization number of two phylogenetic trees is fixed-parameter tractable. IEEE/ACM Trans Comput Biol Bioinform. 2007;4(3):458–466. doi: 10.1109/tcbb.2007.1019. [DOI] [PubMed] [Google Scholar]
  5. Bordewich M, Semple C. Computing the minimum number of hybridization events for a consistent evolutionary history. Discrete Appl Math. 2007;155(8):914–928. doi: 10.1016/j.dam.2006.08.008. [DOI] [Google Scholar]
  6. Cardona G, Llabres M, Rossello F, Valiente G. Metrics for phylogenetic networks ii: nodal and triplets metrics. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 2009;6(3):454–469. doi: 10.1109/TCBB.2008.127. [DOI] [PubMed] [Google Scholar]
  7. Cardona G, Rossello F, Valiente G. Comparison of tree-child phylogenetic networks. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 2009;6(4):552–569. doi: 10.1109/TCBB.2007.70270. [DOI] [PubMed] [Google Scholar]
  8. Degnan JH. Modelling hybridization under the network multispecies coalescent. Syst Biol. 2016;67(5):786–799. doi: 10.1093/sysbio/syy040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Döcker J, van Iersel L, Kelk S, Linz S. Deciding the existence of a cherry-picking sequence is hard on two trees. Discrete Appl Math. 2019;260:131–143. doi: 10.1016/j.dam.2019.01.031. [DOI] [Google Scholar]
  10. Huber KT, Moulton V. Phylogenetic networks from multi-labelled trees. J Math Biol. 2006;52(5):613–632. doi: 10.1007/s00285-005-0365-z. [DOI] [PubMed] [Google Scholar]
  11. Huber KT, Moulton V, Steel M, Wu T. Folding and unfolding phylogenetic trees and networks. J Math Biol. 2016;73(6–7):1761–1780. doi: 10.1007/s00285-016-0993-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Humphries PJ, Linz S, Semple C. Cherry picking: a characterization of the temporal hybridization number for a set of phylogenies. Bull Math Biol. 2013;75(10):1879–1890. doi: 10.1007/s11538-013-9874-x. [DOI] [PubMed] [Google Scholar]
  13. Humphries PJ, Linz S, Semple C. On the complexity of computing the temporal hybridization number for two phylogenies. Discrete Appl Math. 2013;161(7–8):871–880. doi: 10.1016/j.dam.2012.11.022. [DOI] [Google Scholar]
  14. Huson DH, Linz S. Autumn algorithm—computation of hybridization networks for realistic phylogenetic trees. IEEE/ACM Trans Comput Biol Bioinform. 2016;15(2):398–410. doi: 10.1109/TCBB.2016.2537326. [DOI] [PubMed] [Google Scholar]
  15. Huson DH, Rupp R, Scornavacca C. Phylogenetic networks: concepts, algorithms and applications. Cambridge: Cambridge University Press; 2010. [Google Scholar]
  16. Linz S, Semple C. Attaching leaves and picking cherries to characterize the hybridization number for a set of phylogenies. Adv Appl Math. 2019;105:102–129. doi: 10.1016/j.aam.2019.01.004. [DOI] [Google Scholar]
  17. McDiarmid C, Semple C, Welsh D. Counting phylogenetic networks. Ann Comb. 2015;19(1):205–224. doi: 10.1007/s00026-015-0260-2. [DOI] [Google Scholar]
  18. Moret BM, Nakhleh L, Warnow T, Linder CR, Tholse A, Padolina A, Sun J, Timme R. Phylogenetic networks: modeling, reconstructibility, and accuracy. IEEE/ACM Trans Comput Biol Bioinform. 2004;1(1):13–23. doi: 10.1109/TCBB.2004.10. [DOI] [PubMed] [Google Scholar]
  19. Nakhleh L (2010) Evolutionary phylogenetic networks: models and issues. In: Problem solving handbook in computational biology and bioinformatics. Springer, pp 125–158
  20. Steel M. Phylogeny: discrete and random processes in evolution. Philadelphia: SIAM; 2016. [Google Scholar]
  21. van Iersel L, Semple C, Steel M. Locating a tree in a phylogenetic network. Inf Process Lett. 2010;110(23):1037–1043. doi: 10.1016/j.ipl.2010.07.027. [DOI] [Google Scholar]
  22. van Iersel L, Janssen R, Jones M, Murakami Y, Zeh N (2018) Polynomial-time algorithms for phylogenetic inference problems. In: International conference on algorithms for computational biology. Springer, pp 37–49 [DOI] [PubMed]
  23. Zhu S, Degnan JH. Displayed trees do not determine distinguishability under the network multispecies coalescent. Syst Biol. 2016;66(2):283–298. doi: 10.1093/sysbio/syw097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Zhu J, Yu Y, Nakhleh L. In the light of deep coalescence: revisiting trees within networks. BMC Bioinform. 2016;17(14):415. doi: 10.1186/s12859-016-1269-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Mathematical Biology are provided here courtesy of Springer

RESOURCES