Abstract
As phylogenetic networks grow increasingly complicated, systematic methods for simplifying them to reveal properties will become more useful. This paper considers how to modify acyclic phylogenetic networks into other acyclic networks by contracting specific arcs that include a set D. The networks need not be binary, so vertices in the networks may have more than two parents and/or more than two children. In general, in order to make the resulting network acyclic, additional arcs not in D must also be contracted. This paper shows how to choose D so that the resulting acyclic network is “pre-normal”. As a result, removal of all redundant arcs yields a normal network. The set D can be selected based only on the geometry of the network, giving a well-defined normal phylogenetic network depending only on the given network. There are CSD maps relating most of the networks. The resulting network can be visualized as a “wired lift” in the original network, which appears as the original network with each arc drawn in one of three ways.
Keywords: Phylogeny, Network, Phylogenetic network, CSD map, Normal network
Introduction
A phylogenetic tree is a directed tree whose vertices represent biological species, whose leaves typically correspond to known extant species, and whose branchings indicate speciation events, usually by genetic mutation. As such, internal vertices have in-degree one and out-degree at least two (except for the root with in-degree zero). In the last decades it has become clear that other events such as hybridization and lateral gene transfer are also important in evolution, even though they are not easily modeled using phylogenetic trees (Delwiche and Palmer 1996; Doolittle and Bapteste 2007; Inagaki et al. 2002; Jones et al. 2013). As a result there is interest in phylogenetic networks, in which some vertices have in-degree two or higher, corresponding to such events (Moret et al. 2004; Solís-Lemus et al. 2016). Overviews of phylogenetic networks may be found in Steel (2016) and Huson et al. (2010).
A phylogenetic X-network is an acyclic directed graph in which the leaves are identified with a particular collection X of species, usually extant species. We assume that a phylogenetic X-network describes gene flow, and each vertex corresponds to a biological species. Such phylogenetic networks can be quite complicated. The focus will be on simplifying such networks by recursively merging the ends of particular arcs in a natural manner. We will then apply the results to study simplification into a normal network.
In this paper, an X-network is a directed graph in which the leaves are identified with members of a particular set X. Our notion of an X-network is broad. Vertices can have in-degree and/or out-degree greater than two, so we are not assuming that the networks are binary. An exact definition is given in Sect. 2, along with other basic notions. In Sect. 3 we describe constructions which do not necessarily yield acyclic networks, and then find conditions that ensure that the results are acyclic. Hence in this paper an X-network need not be acyclic, and we will refer to one that is acyclic as an acyclic X-network. In turn, an acyclic X-network that has no vertices with both out-degree and in-degree equal to one is a phylogenetic X-network.
One measure of the complexity of an acyclic X-network is the number of vertices. In terms of bounds on the number of vertices we have the following comparisons between certain families of networks. The definitions of these families are given in Sect. 2. (The result is from Willson (2010) with slight changes since, in Willson (2010), X contained the root as well as the leaves.)
Theorem 1.1
Suppose is an acyclic X-network and , . Assume and there are no vertices of out-degree one.
(a) If N is a rooted tree, then .
(b) If N is normal, then .
(c) If N is regular, then .
(d) If N is tree-child, then v is unbounded.
The fact that for normal networks the number of vertices grows at worst quadratically with n indicates that normal networks are potentially a more tractable network type than regular or tree-child. Also indicative of their tractable nature is the fact (Steel 2016) that the number of hybrid vertices is at most . Yet another indication is that binary normal networks are determined by their caterpillars on three and four leaves (Linz and Semple 2020).
A vertex v of the X-network is visible (Francis et al. 2021; Huson et al. 2010) if there exists a leaf such that every path in N from the root to includes v. In a tree-child network, every vertex is visible (Cardona et al. 2009). Since any normal network is tree-child, every vertex of a normal network is visible, yielding another useful property of normal networks.
As in Pardi and Scornavacca (2015) we take the view that rather than try to deal with networks that are possibly not identifiable, it is desirable to focus instead on networks that are sufficiently tractable to be tested with data. Since every vertex of a normal network is visible, potentially every vertex of a normal network can be so tested, and simplification into uniquely determined normal networks will become useful.
This paper relies on results from Willson (2012). This earlier paper focused on networks that were not necessarily acyclic. This current paper extends the results to ensure that the constructed networks are acyclic. If N is a given network and D is a list of certain arcs in N satisfying a weak condition, this paper in Sect. 3 computes the result of merging the arcs in D as well as certain additional arcs required to ensure that is acyclic. Of interest will be the choice of D so as to obtain ultimately a normal network.
In Sect. 5 we study the result of removing all “redundant” arcs from N. In Sect. 7 we describe ways to find sets D of arcs of N such that is a normal network.
Combining these techniques we describe in Sect. 7 a method, given an X-network N, to construct a normal acyclic X-network which is a phylogenetic X-network depending only on the geometry of N. The construction makes no arbitrary choices such as between different parents or children.
As phylogenetic X-networks grow increasingly complicated, it will become useful to “simplify” them. Simplification into a normal network may make them easier to interpret since normal networks are potentially tractable.
If N and M are X-networks, a connected surjective digraph map (CSD map) is a surjective map with various properties. (See Willson 2012 and Sect. 2 of this paper.) The merging procedure in this paper always yields a CSD map . Results in Willson (2012) show that there is then a “wired lift” of into N, from which properties of can be visualized in N. The wired lift is not a subnetwork of N in the usual sense.
Section 6 of this paper generalizes the notion of “wired lift”. As a result we obtain a wired lift of into N, even though there is usually no CSD map from N to . The wired lift is visualized by drawing the diagram of N with each arc drawn in one of three different ways. Thus we can visualize the resulting normal network by looking at a redrawn diagram of N. The current author thinks such visualizations can provide a tool for better understanding complicated networks.
Section 8 contains two examples of the methods applied to published networks based on biological data. Section 9 contains some discussion.
Francis et al. (2021) describe an elegant procedure, given an acyclic X-network N, to find a related, uniquely determined, normal X-network, which I will denote . Its calculation is based on locating the visible vertices of N. The fast program PhyloSketch (Huson and Steel 2020) is available to compute it. The paper (Francis et al. 2021) assumes that non-root vertices have either in-degree one or out-degree one. Nevertheless, visibility of vertices is well-defined for the X-networks defined in this paper and their procedure applies to any acyclic X-network in our sense. I therefore use to represent the result of this extension of their method. We will occasionally compare with .
Basic Notions
Let be a directed graph, where V is a finite set of vertices and A is the set of arcs. An arc (u, v) is regarded as directed from u to v, so we call u a parent of v and v a child of u. We assume N is a simple graph: there are no loops (u, u); and there is at most one arc (a, b) for . We may sometimes denote or .
If is a directed graph, the corresponding undirected graph Und(N) is the graph where iff either or . Thus, arcs are replaced by edges and are not directed. In this paper, N will always refer to a directed graph unless otherwise specified.
The in-degree of a vertex v in N, denoted indeg(v) or indeg(v; N) , is the number of arcs (u, v), i.e. the number of parents of v. The out-degree of a vertex v, denoted outdeg(v) , is the number of arcs (v, u) , i.e., the number of children of v.
We shall not assume that our directed graphs are binary. Thus a vertex v may have or or both.
A leaf is a vertex with out-degree 0. A root is a vertex with in-degree 0. A vertex v is hybrid or reticulate if . A child u of v is a tree-child if , so (v, u) is the only arc coming into u. A vertex v is trivial if . A trivial vertex merely subdivides an arc, and we will often systematically suppress trivial vertices.
If u and v are vertices, a path or, for emphasis, a directed path from u to v is a sequence of vertices such that for all i, , . The length of the path is the number n of arcs. Note that the arcs are uniquely determined by the vertices in the list since N is a simple graph. No two successive vertices can be the same since there are no loops. We say the path contains arc for . In some situations we may focus on a certain part of the path such as ; we may refer to such a portion as a segment. (For example, in certain circumstances we might modify the path by replacing a segment by a segment .)
The path of length 0 consisting only of is the trivial path at . A path is closed if and . A closed path is a cycle.
Let X be a nonempty finite set. In the applications, X is usually a set of extant biological species. An X-directed graph N is a 4-tuple where (V, A) is a simple directed graph; is a distinguished node; and is a map .
An X-network N is an X-directed graph such that
(N1) V is a finite set (the set of nodes or vertices).
(N2) A (the set of arcs) is a finite set of ordered pairs (u, v) with u and v distinct members of V.
(N3) , called the root, is a node with in-degree 0.
(N4) The map is one-to-one.
(N5) Each leaf is a vertex with in-degree 1 and hence has a unique parent.
(N6) The image of is the set of leaves.
(N7) is the only vertex with in-degree 0.
(N8) For each there is a path from to v.
(N9) For each there is a path from v to some leaf.
An acyclic X-network is an X-network that also satisfies
(N10) N has no cycles.
Following Steel (2016), we define a phylogenetic X-network to be an acyclic X-network that contains no trivial vertices.
These assumptions are not intended to be the minimal possible; rather, they tell the properties we will utilize the most.
If the unique parent of by (N5) will be denoted p(x) or p(x; N). The arc of form for some will be called the x-arc. If x is not specified, any such arc will be called an X-arc.
Suppose N is an X-network. By (N4) and (N6) we may identify X with the set of leaves.
If there is a directed path from u to v then we write . The trivial path shows for all . (N9) says for each there is with . (N8) says that for any , . If the X-network is acyclic, then is a partial order; otherwise it is possible that for distinct vertices u and v we have .
Two X-networks or two X-directed graphs and are X-isomorphic iff there exists a map such that
(i) f is one-to-one and surjective.
(ii) .
(iii) (thus for , with the obvious interpretations).
(iv) iff .
In this situation, N and are essentially the same and we write .
For each , we write cl(v; N) (or cl(v) when N is understood) for . We call it the cluster of v. Note that by (N8). By (N9) for every vertex v, cl(v) is nonempty.
It is immediate that if then .
Let be the set of clusters of N.
Let N be an X-labeled graph. An arc (a, b) is redundant or a short-cut if there exists a path , , that does not contain the arc (a, b). Thus, there is no such that and . Such a path is called a lengthening or a lengthening path of (a, b). Examples will be seen in several figures later, such as Fig. 3. If then the arc cannot be redundant since any such lengthening path would have to satisfy by (N5).
Fig. 3.

A network N containing two lengthening paths of the redundant arc (7,13)
We shall have need of the following result:
Theorem 2.1
Suppose N is an acyclic X-network. Suppose there is a directed path in N from a to b. Then, a directed path in N from a to b of maximal length contains no redundant arc.
Proof
Since the vertex set is finite and there are no cycles, there is an upper bound to the length of a path. Suppose is a directed path P in N of maximal length k. If the result is false, we may assume that for some , is redundant. In that case there is a directed path with . We can then lengthen the path P by replacing the segment by , a contradiction.
There are several types of X-networks which will be of interest:
An acyclic X-network N is tree-child (Cardona et al. 2009) if every vertex that is not a leaf has a tree-child.
An acyclic X-network (possibly not satisfying (N5)) is regular (Baroni et al. 2004) if
(1) the cluster map is one-to-one, where P(X) is the power set of X;
(2) N has no redundant arcs; and
(3) iff .
An acyclic X-network N is normal (Willson 2010) if
(1) N is tree-child; and
(2) N contains no redundant arc.
Sometimes there are small differences in the definition of a network. In Baroni et al. (2004) and Willson (2010) the authors do not assume condition (N5). In Baroni et al. (2004) no vertex can have out-degree one. Particularly simple are normal networks in which no vertex has out-degree one, since these are regular (Willson 2010).
Let N and be acyclic X-networks. One interesting way to compare them is their Robinson-Foulds distance defined as the number of members of Cl(N) and which are present in one but not both (an extension of Robinson and Foulds (1981) for trees). It is symmetric and satisfies the triangle inequality. For certain classes of X-networks is a metric. As an example, for fixed X, it is a metric on the collection of regular X-networks (Baroni et al. 2004).
If N is a normal X-network, let S(N) denote the result of contracting every arc (u, v) such that . For example, suppose in N, for some , p(x) is hybrid and has out-degree one. Then in S(N) the arc in N will have been contracted, and S(N) will not satisfy (N5). Thus in S(N) a leaf can be hybrid. Moreover, any trivial vertices will have been suppressed.
The following result shows that two normal networks and such that are essentially the same.
Lemma 2.2
(1) If N is a normal X-network, then S(N) is a regular X-network and .
(2) Suppose and are normal X-networks and . Then .
Proof
(1) For any X-network N, if (u, v) is an arc and , it is immediate that . Hence . Moreover, if N is normal then S(N) remains normal and hence is a regular network (Willson 2010).
(2) If , then . Hence , so . The result follows from the fact (Baroni et al. 2004) that is a metric on regular X-networks.
The distance has the interesting property that since it is defined for all acyclic X-networks, it can be used to compare how well various networks of various types “approximate” a given network. For example, if N is a complicated acyclic X-network and T and are X-networks that are rooted trees, then T might be a better approximation to N than if .
In this paper we will be “simplifying” an acyclic X-network N into a normal X-network . From this point of view we would prefer that is as small as possible.
Let and be X-directed graphs. A connected surjective digraph (CSD) map (Willson 2012) is a map such that
(C1) is surjective.
(C2) For each arc , either or else . In the latter case we may write . (Thus is a digraph map).
(C3) For each , . More simply, .
(C4) .
(C5) For each there exists u, v in V such that , , and .
(C6) For each , consists of the vertices of a connected subgraph of N. Thus in the undirected graph Und(N) of N, if , the induced subgraph with vertex set W and edge set or is connected.
Note that if in N and is a CSD map, then in .
Let and be X-networks. A CSD-map is leaf-preserving if for each
(C7) is the only vertex in V such that ; thus ; and
(C8) the x-arc is taken to the x-arc ; thus .
If and are CSD maps, then it is proved in Willson (2012) that the composition is also a CSD map. If both maps are leaf-preserving, then it is easy to see that the composition is also leaf-preserving. We will use this fact repeatedly.
Note that in Willson (2012) the term “X-network” refers to what in this paper is an X-directed graph satisfying (N1), (N2), (N3), (N4), (N6), (N7), (N8), and (N9). Thus the networks in Willson (2012) were not required to be acyclic. Of interest in this current paper is the behavior when the final networks are required to be acyclic, as are phylogenetic networks in biology. The CSD maps become more useful to biologists when both N and are required to be acyclic.
Contraction of Arcs
Here is a summary of this fundamental section: The basic tool used in this paper is that of successively contracting arcs in an X-network. Suppose N is an X-network and D is a subset of its arcs. In this section under weak conditions we describe how to construct an X-network by merging just the arcs of D. In general may contain cycles. When D is “strongly closed” we show that is acyclic. Moreover, any D has a unique “strong closure” K(D) which contains D and is strongly closed. Hence, we are able to define as a uniquely determined acyclic X-network that results from contracting the arcs of D and also the other arcs needed for acyclicity. The sections after this one will rely on the iterated use of this construction. The fundamental problem studied in this paper is, roughly, how to choose D so that we can find a normal network from .
Let be an X-network. Suppose is an equivalence relation on V. Let [v] denote the equivalence class of . Let denote the set of equivalence classes of V under .
As in Willson (2012) the quotient digraph is defined by
(1) .
(2) .
(3) For each , .
(4) Let [u] and [v] be in . There is an arc if and only if and there exist and such that .
We will denote this quotient digraph by or . Note that by (4), contains no loops and is a simple graph.
The equivalence relation is connected if each equivalence class [v] is connected in N. An equivalence class [v] is convex if, whenever is a path in N with both and , then for all i, , . We say that is convex if each equivalence class [v] is convex.
The relation is root-preserving if the equivalence class is convex. The relation is leaf-preserving provided
(1) if and , then ; and
(2) for any , . Thus is the only vertex u such that .
Let and be X-networks. A CSD-map is leaf-preserving if for each
(1) is the only vertex in V such that ; thus ; and
(2) the x-arc is taken to the x-arc ; thus .
The following result is similar to Theorem 3.1 of Willson (2012). We outline the proof here again because some of the definitions have slightly changed, for example, to allow for (N5) and leaf-preserving CSD maps.
Theorem 3.1
Let be an X-network. Let be a connected leaf-preserving and root-preserving equivalence relation on V, and let be the quotient digraph . Then
(1) is an X-network.
(2) The natural map given by induces a leaf-preserving CSD map .
Proof
First we prove (1). (N1) and (N2) are immediate. (N4) is immediate since is leaf-preserving. If , then it is immediate that either or else . It follows that if there is a path from a to b in N, then there is a path from to in . Thus (N8) is true.
Each for is a leaf of since otherwise there would be an arc for some [u] and hence an arc from some to . But since is leaf-preserving, and so , contradicting that is a leaf of N.
Let . By (N9) and (N6) for N there is a path in N from v to for some . Then, there is a path from [v] to in , proving (N9) for .
Suppose [v] is a leaf of . If v is not a leaf of N, then there is a path in N from v to for some by (N9) and (N6). Let the path be with . Since [v] is a leaf, it follows since otherwise . If then so because is leaf-preserving. If , since otherwise while is a leaf. It follows by an easy inductive argument that . Thus, every leaf of has the form for some , proving (N6).
Suppose in there is an arc . Then, there exists and such that there is an arc . By (N8) there is a path in N from to and from there to via the arc . This path from to satisfies that , so, since is root-preserving, it follows that each vertex in the path lies in . In particular so , contradicting the arc . This proves (N3).
For (N7), suppose [u] has in-degree 0. By (N8) there is a path from to u, say . If then . Since [u] has in-degree 0, it follows that and so . If then ; since [u] has in-degree 0, it follows , so . Repeating this argument we see by induction that . But , whence , proving (N7).
For (N5), we know from (N6) above that each leaf of is of form . By (N5) for N, has a unique parent in N, denoted p(x), so . By the definition of either or . Since is leaf-preserving, the former is not possible, so . Suppose and . Since because is leaf-preserving, we may assume . This implies by (N5) in N, proving (N5) for .
This completes the proof of (1).
We now prove (2). (C1) is immediate since every vertex of has the form [v]. (C2) is immediate since if and , then by definition . (C3), (C4), and (C5) are immediate from the definition of . For (C6) note that for any , , the latter as a set. Since is connected, [v] is connected as well.
(C7) restates that , which is true since is leaf-preserving. (C8) restates the fact, proved above, that is the unique arc entering This proves is leaf-preserving, completing the proof of (2).
We do not claim that is acyclic, even if N is acyclic.
We will refer to the map as the projection from N to .
Let be an X-network. Let D be a subset of A. Define a relation on V by saying vertices a and b satisfy iff there is a sequence (not necessarily a path) such that for , either or .
Let D be a subset of A. A path is called a D-path provided that for each i, , . Call D root-preserving if whenever is a D-path, then every path from to a is a D-path.
Theorem 3.2
Let be an X-network. Let D be a subset of A. Then
(1) is a connected equivalence relation.
(2) is leaf-preserving if D contains no X-arc.
(3) is root-preserving if D is root-preserving.
Proof
(1) For any vertex v the trivial path at v shows . Suppose because of the path such that for , either or . Then because of the path . Transitivity is immediate. Since D is a subset of A, it is clear that each equivalence class is connected. This proves (1).
Write for the equivalence class of v under .
(2) If D contains no X-arc, then for each , . If then by (N4). This proves (2).
(3) Assume that D is root-preserving. We must show that the equivalence class is convex. Let be a path in N such that and . We must show that each . Choose a path from to ; such exists by (N8). Then is a path from to . Since , each vertex satisfies , proving (3).
Let (or [v] if D is understood) denote the equivalence class of v under . We call the equivalence relation determined by D. It is clearly the smallest equivalence relation (i.e., with the fewest pairs (u, v) satisfying ), such that, for each arc , .
Theorem 3.3
Let be an X-network. Let D be a subset of A. Assume D contains no X-arc and D is root-preserving. Then the quotient digraph is an X-network. Moreover, the projection is a leaf-preserving CSD map.
Proof
By Theorem 3.2, is a connected leaf-preserving and root-preserving equivalence relation. The conclusions follow from Theorem 3.1.
Henceforth if D contains no X-arc and is root-preserving, we will write for . We may call it the quotient X-network of N under D and refer to its formation as contracting or merging the arcs of D. In general, may contain cycles even if N is acyclic.
It is not hard to program a computer to find .
Figure 1a shows an acyclic X-network N. Let . Write for , so and . Note that D contains no X-arc. Note the root and is convex. Then is defined and shown in (b). Note and .
Fig. 1.

a An acyclic X-network N. Let . b . Note that D is not closed
Let be an X-network. A subset of arcs is closed if it satisfies the following: Suppose . Then for every path with length , for each i satisfying , . In particular if (a, b) is an arc and , then .
In Fig. 1, D is not closed since yet .
Theorem 3.4
Let be an X-network. Let be a subset of arcs. If D is closed then each equivalence class of is convex.
Proof
Suppose D is closed. Let . We wish to prove that [v] is convex. Write for . Let be a path in N where and . By transitivity, . By closure, each arc , proving convexity.
Figure 1 shows that the converse of Theorem 3.4 is false. is acyclic and is convex even though D is not closed.
Figure 1 also illustrates the fact that often when an arc (u, v) is merged, the number of vertices drops by one, reducing the resolution. In Fig. 1, D contained 4 arcs, and the number of vertices dropped from 13 in (a) to 9 in (b). On the other hand, if , then and the merging of (7,9) does not further reduce the number of vertices.
Let be an X-network. A subset of arcs is strongly closed if it satisfies the following: Suppose there are vertices in V with such that , , , , , , , , and in addition . Then for k such that each of the arcs lies in K.
Theorem 3.5
Let be an X-network. Let be a subset of arcs. If D is strongly closed, then D is closed. Hence D is root-preserving.
Proof
Suppose D is strongly closed. Suppose there is a sequence (not necessarily a path) such that for , either or . Let be a path from a to b. Then . Trivially, we have , , , , , , , . Since D is strongly closed, for , the arc . Hence D is closed.
By Theorem 3.4 it follows that each equivalence class of is convex. In particular is convex, so D is root-preserving.
Figure 2 shows a set D that is closed but not strongly closed. Indeed, , , , , yet and . Note that contains a cycle since there are both arcs ([8], [9]) and ([9], [8]), where and .
Fig. 2.

A network N. Suppose . Then D is closed but not strongly closed. has both arcs ([8], [9]) and ([9], [8])
The next result shows that, if D is strongly closed, then is acyclic.
Theorem 3.6
Suppose is an X-network. Assume contains no X-arc. If D is strongly closed, then is acyclic.
Proof
Write for . Assume D is strongly closed. By Theorem 3.5, D is root-preserving. By Theorem 3.3, is an X-network. Suppose contains a cycle hence a path with . Let be the projection CSD map. Since is a CSD map, for each arc with there is an arc with and . Thus satisfies , , , , , , , , and . Thus , , , , , , , , and . Since D is strongly closed, each arc . Hence . Thus, all the points on the cycle were the same, a contradiction.
The following theorem shows that from a given D we can construct a uniquely determined strongly closed set K that contains D.
Theorem 3.7
Let be an X-network and be a subset of arcs. There exists a unique such that
(i) ,
(ii) K is strongly closed, and
(iii) for every strongly closed such that , it follows that .
Thus K is the unique minimal strongly closed subset of A containing D.
Proof
We define a sequence of subsets of A. Let and . Let .
If is not strongly closed, there are vertices in V such that , , , , , , , , and in addition , but not every arc lies in . Let be obtained from by including also all the arcs for . If is strongly closed, we are done. Otherwise replace n by and repeat the argument.
By construction . Since A is a finite set, the chain must terminate with some , at which point is strongly closed. Let . Then K contains and is strongly closed. Moreover, any strongly closed set C that contains for any must necessarily also contain by the strong closure property. Hence, C must contain K.
If D is a set of arcs in the X-network N, the strong closure of D is the smallest set K of arcs that contains D and is strongly closed. By Theorem 3.7K is uniquely determined.
The next theorem is the main result of this section.
Theorem 3.8
Let be an X-network. Assume contains no X-arc. Let K(D) be the strong closure of D. Let . Then
(1) is an acyclic X-network.
(2) Each equivalence class [v] of is convex.
(3) The projection is a leaf-preserving CSD map.
Proof
Write for . K(D) exists by Theorem 3.7 and is root-preserving by Theorem 3.5. It contains no X-arc since otherwise D would contain an X-arc. Then (1) follows from Theorem 3.6. Note K(D) is closed by Theorem 3.5. Hence (2) follows from Theorem 3.4. Then (3) follows from Theorem 3.3.
Call the merged acyclic X-network for D. Note that in general, some arcs not in D need to be merged to produce an acyclic network. We nevertheless call D the merging set for .
The strong closure K(D) can be computed in practice using the method of the proof of Theorem 3.7. For hand calculation the following is often easier: Given N and D, since K(D) must be closed by Theorem 3.6, we adjoin to D all arcs in any directed path between two vertices u and v such that . If necessary, repeat the process. Call the resulting set of arcs C. When C cannot be enlarged in this way, we compute . Let be the projection CSD map. If has a cycle, add to C any arcs (u, v) in N such that is an arc on a cycle of .
For Fig. 2, with the indicated D, we find D is closed. We then find , also shown, where [8] represents [6,7,8] and [9] represents [9,10,11]. Let be the CSD projection map. In there is a cycle [8], [9], [8]. Since (11,6) in N satisfies in , we must adjoin (11,6) to D. Since (8,9) in N satisfies in , we must adjoin (8,9) to D. Hence . We see is acyclic, so .
For Fig. 3, suppose . The path 7, 8, 9, 10, 13 shows that we must adjoin (7,8), (8,9), (9,10), (10,13), so now . But now 8 and 10 are vertices satisfying , so the path 8, 11, 12, 10 shows we must adjoin (8,11), (11,12), (12,10) to C. This enlarges C to . This enlarged is closed and is acyclic, so .
In the case where N is a cyclic X-network and , we find is nonempty using this procedure. On the other hand, if N is an acyclic X-network then .
Theorem 3.9
Let be an X-network. Assume contains no X-arc. Let be the projection. An arc is in K(D) iff .
Proof
is defined utilizing the equivalence relation . If it follows that so , whence . Conversely, suppose and . Since K(D) is strongly closed, it is closed by Theorem 3.6. By closure it follows that .
Deriving an SCD Network from N
This section gives a general construction, given an X-network N, to produce a uniquely determined acyclic X-network called in which, for almost all arcs (u, v), the clusters are distinct (i.e., ). The only possible exceptions occur when v is a leaf. For complicated N, can be very much simpler than N. Moreover, .
We need to consider the behavior of clusters under contraction of arcs.
An acyclic X- network is successively cluster-distinct (SCD) if, whenever , then unless for some , and . The exception at the end is intended to make the definition consistent with the condition (N5), which often forces p(x) to have out-degree one and therefore . (In Willson (2012) a network N was called SCD without the exception, but the networks there could fail (N5).)
In this section, we show that it is often easy to simplify a network N greatly so as to make it SCD.
Let and let be a connected leaf-preserving and root-preserving equivalence relation on V. Suppose . A generalized path or g-path in N from a to b is a sequence of vertices such that , , , , , , , , , .
In a g-path one always either utilizes an arc , or else one stays within an equivalence class (but ignoring the direction of any arcs within the equivalence class).
Lemma 4.1
Let be an X-network, let contain no X-arc, and let . Let be the projection.
(1) Suppose there is a path in N from v to w. Then, there is a path in from to .
(2) .
(3) If in N, then .
(4) Suppose there is a path in from [a] to [b]. Then in N there is a g-path from a to b.
Proof
Write for and note .
(1). Let be a path in N. Since is a CSD map the sequence of vertices satisfies that for each i, either or else is an arc of . From this the result is clear.
(2) in . By (1) if it follows there is a path from to , so .
(3) Since , .
(4) Suppose is a path in . For , since , we may choose , such that because is a CSD map. Then is a g-path because for , and , and .
Lemma 4.2
Let be an X-network. Let and b is not a leaf. Then, D is strongly closed and contains no X-arcs.
Proof
It is immediate that D contains no X-arcs since such were specifically excluded. Note that if then there exists a sequence such that for each i, either or . Since there is no X-arc in D, no is a leaf. Hence .
To see that D is strongly closed, suppose there are vertices in V such that , , , , , , , , and in addition . We must show that each of the arcs lies in D. Then . But since , we know so the chain in inclusions must be a chain of equalities. Thus each .
Theorem 4.3
Let be an X-network. Let and b is not a leaf. Form , and let be the projection CSD map. Then
(1) is an acyclic X-network.
(2) For every , .
(3) For every arc , either or else the arc satisfies , or else b is a leaf.
(4) is successively cluster-distinct (SCD).
(5) No vertex of (other than possibly p(x) for some ) has out-degree one.
(6) .
Proof
(1) follows from Theorem 3.8 and Lemma 4.2.
(2) By Lemma 4.1, . Conversely, suppose . Let . There is a path in from to . Since , by Lemma 4.1 there is a g-path in N from v to . Let be this g-path. Note , , for , and for .
Since , it follows . Since , . Since , by the definition of D it follows that , so . Since , it follows . Since it follows , so . We repeat the argument. By induction we find and since , so . This proves (2).
For (3) suppose and b is not a leaf. If , then is an arc of since is a CSD map. Moreover, and by (2). Since , . If , then so , a contradiction. It follows that , whence .
For (4) suppose (u, v) is an arc of and v is not a leaf. There exists such that since is a CSD map, and b is not a leaf since is leaf-preserving. Since is an arc, , and by (3), proving that is SCD.
For (5) write . Suppose a vertex has out-degree one with unique child c. Then . Since is SCD, it follows that for some , and so by (N5).
For (6), note by (2), using the fact that is surjective.
Recall that a vertex v is trivial if . Write the SCD acyclic X-network of Theorem 4.3 as . It is possible that contains a trivial vertex v with unique child c. When this happens, , and, by Theorem 4.3(5), for some , and . Such trivial vertices are a nuisance and it is easy to remove them. Since p(x) is trivial, it has a unique parent u(x). By Theorem 4.3, u(x) satisfies and . Hence, the trivial vertex p(x) can be merged with u(x) and hence removed. We state this as a theorem:
Theorem 4.4
Suppose is an X-network and is the acyclic SCD network of Theorem 4.3. Write p(x) for . Let
Define . Then
(1) is an acyclic SCD X-network.
(2) There is a leaf-preserving CSD map .
(3) contains no trivial vertices.
(4) is a phylogenetic X-network.
(5) satisfies .
Proof
It is immediate that E contains no X-arcs. It is easy to see that E is strongly closed. Hence, is an acyclic X-network, proving part of (1).
There are leaf-preserving CSD maps and , so their composition is a leaf-preserving CSD map from N to , proving (2). We see ; since u(x) did not have out-degree one in , [u(x)] does not have out-degree one in and [u(x)] is not trivial. Thus, has no trivial vertices, proving (3). Note , so is SCD and , completing the proof of (1) and proving (5). Then (4) follows from (1) and (3).
A very similar network was described in Willson (2012) by a different approach.
Example 1
Figure 4 gives an example of an X-network N, and Fig. 5 shows . In this case is a tree, clearly indicating the main features of N and much simpler than N. Vertices in are labeled by a representative vertex of N with the same cluster.
Fig. 4.

A network N that greatly simplifies to (shown in Fig. 5) if and b is not a leaf and trivial vertices are removed
Fig. 5.

The network for the network in Fig. 4
Removing Redundant Arcs from an X-Network
Our goal in this paper is the construction of normal networks which by definition contain no redundant arcs. A crucial step will be removing from an X-network N all its redundant arcs to form . This short section studies this process. Unfortunately, the natural map is not a CSD map unless , causing complications later in this paper.
If N is an X-network, let denote the directed graph obtained from N by removing all redundant arcs. More precisely, if then where is obtained from A by removing all arcs redundant in N.
Theorem 5.1
Suppose is an acyclic X-network. Then is an acyclic X-network.
The proof is straight-forward and is omitted.
If is a CSD map, then f may not induce a CSD map from to . Figure 6 shows an X-network M. Let and , also shown. M has no redundant arcs, while N has two redundant arcs, indicated by dashes. The projection map is a CSD map. But , is N without the dashed arcs, and is not a CSD map since (6,10) is an arc in but and there is no such arc in . Indeed, it is easy to see that there is no CSD map .
Fig. 6.

There is no CSD map from to nor from N to
In the same figure, one sees easily that there is no CSD map from N to .
Theorem 5.2
Suppose N is an acyclic X-network. For each . Moreover, and .
Proof
Suppose . There is a path from v to in N. By Theorem 2.1 a path from v to in N of maximal length contains no redundant arc, hence lies in . It follows that . Conversely, suppose . There is a path in from v to , so the same path is a path in N from v to x, proving . Hence . The rest follows easily.
Generalized Wired Lifts
Let and be X-networks. Suppose is a CSD map. In Willson (2012) a wired lift of into N is described. It provides a method for visualizing within N. In this section we modify and generalize the notion so that does not quite need to be a CSD map but is only a connected map. This will let us obtain wired lifts from a process that includes both CSD maps and removing redundant arcs.
Let and be X-networks, and let . We say f is a connected map if
(K1) is surjective,
(K2) ,
(K3) for all ,
(K4) for each there exists such that and ,
(K5) for each the set is connected.
It is immediate that a CSD map is connected, but a connected map need not be CSD. Suppose and are CSD maps. Since the composition of and is defined. We shall see in Theorem 6.4 that f is a connected map while in general it is not a CSD map.
Let be a connected map, and let denote the set of subsets of V. A (generalized) wired lift of f (or of into N) is a pair where is the map given by and where satisfies the following two conditions:
(W1) For each arc , and . Denote .
(W2) For every arc , there exists such that . We will say the arc (u, v) represents or is a pre-arc of .
Call the members of the representative arcs since each represents an arc of .
Note that the collection of all for is a partition of V. Thus for all , ; if are in , then ; and where the union is over all .
Suppose is a CSD map. A backwards map g is a map which satisfies that, for all , if then . Thus for all . Since f is a CSD map, for each such a (u, v) exists, and provides a unique choice of a pre-arc of .
There are several situations that give rise to wired lifts. We describe three of them in the next theorem. A fourth will be given in Theorem 6.4.
Theorem 6.1
Let and be X-networks, and let . Suppose is a CSD map.
(1) Let . Then is a wired lift of .
(2) Suppose is a backwards map. Let . Then is a wired lift of .
(3) Let be the result of removing all redundant arcs from . Let . Then is a wired lift of .
Proof
(1) and (2) are immediate from the definitions since a CSD map is connected. For (3), note that , so the map f can be regarded as a map . This map will not be a CSD map if has any redundant arcs, but it is a connected map. Then (3) follows.
Given a connected map , a wired lift can be visualized using a diagram of N. An example is shown below in Fig. 7. The diagram is exactly the diagram of N except that each arc may be wide solid, thin solid, or thin dashed. Suppose . For every arc such that draw (u, v) a wide solid arrow if and a thin dashed arrow if . For each arc such that draw the arc as a thin solid arrow. (If color is available, one might instead color red the arcs satisfying for vividness.) Thin solid arcs make the sets apparent in N and each vertex of corresponds to a connected component of the thin solid arcs. Each arc has a corresponding wide solid arc , justifying the word “lift”. The “wires” are the thin solid arcs. Paths in can be recognized in the wired lift as g-paths using allowed steps, which we will now describe.
Fig. 7.

An example of a wired lift. Thin solid arcs indicate identification of the vertices and can be followed in either direction. Wide solid arcs must be followed in their direction. Dashed arcs cannot be used in g-paths
Let and be X-networks, with a connected map, and suppose is a wired lift of f. If u and v are in V, we say there is an allowed step from u to v if either , or ( and ), or ( and ). Note that the step either follows a wide solid arc in forwards or else follows a thin solid arc, possibly forwards, possibly backwards. Dashed arcs cannot be used.
Theorem 6.2
Let and be X-networks, with a connected map, and suppose is a wired lift of f. Let be the equivalence relation on V defined by if and only if . Suppose . The following are equivalent:
(1) In N there is a sequence of vertices such that , , , , , , , , , .
(2) There is a sequence of vertices in N such that, for i such that , there is an allowed step from to .
Proof
Suppose there is a sequence of type (1). If then since and is connected, there is a sequence such that each lies in and, for , either or . Thus, there is an allowed step from to . Hence given a sequence of type (1), there is a sequence of type (2).
Conversely, given a sequence of type (2), if the allowed step from to satisfies then . If but and then we may replace by simply . Thus, there is a sequence of type (1).
We will call a sequence of either type a generalized path or g-path from a to b in . For specification they may be called type (1) or type (2).
Theorem 6.3
Let and be X-networks. Let be a connected map, and let be a wired lift of f.
(1) Suppose is a g-path in N (of type (2)). Then , , yields a path in , possibly by suppressing multiple successive copies of the same vertex.
(2) Suppose is a path in , , and . Then, there is a g-path in N from a to b.
Proof
If there is an allowed step from to , either , in which case is an arc of by (W1), or else , proving (1).
Conversely suppose is a path in , , and . For , since , by (W2) there exists such that and . Note , , and . Hence is a g-path (of type (1)), proving (2).
In Willson (2012) there was a backwards map and, instead of all arcs in , only the arcs in some spanning tree in containing each vertex in which lies on an arc in were included. But this feature is not essential.
Example 2
Figure 7 shows a wired lift that arises from a connected map . All the arcs and vertices are from N; thus if we ignore thickness and dashing and include all arcs with their indicated directions, whether thin, wide, or dashed, the diagram exhibits N. A vertex of with more than one preimage may be identified with a connected component of thin solid arcs. It is also convenient to identify each vertex by the members of inside square brackets. One sees immediately that the vertex f(10) of satisfies (from the component of thin arcs). We shall designate it [8, 10, 11, 16] or less formally [10], the equivalence class of 10. Similarly f(15) has inverse image and is written [15, 20]. Other vertices include [9, 18] and [17, 21]. Still other vertices have singleton inverse images such as [13] with , but the brackets may be omitted.
The dashed arcs are not permitted on g-paths, and wide solid arcs must be followed in their direction. Thin solid arcs can be followed in either direction. Thus, 16, 10, 8, 9, 15, 1 is a g-path showing that has a path from f(16) to f(1). The corresponding path in is formally written [8, 10, 11, 16], [9, 18], [15], [1] or informally as [16],[9],15,1. There is clearly no path in N from 16 to 1. Similarly the g-path 21,17,4 shows that in there is a path from [21] to [4]. Thus .
Suppose , , and are X-networks. Suppose and are CSD maps, where denotes a simplification of . Let be the composition . In general is not a CSD map since there is no CSD map from to . The following result shows that f is nevertheless a connected map and there is a wired lift of f. Consequently, we are able to visualize simplifications of .
Theorem 6.4
Suppose for , we have is an X-network. Write where is the set of arcs in which are not redundant in . Suppose and are CSD maps. Let be the composition of the vertex maps, . Thus, for . Define :
(i) and and
(ii) and . Then
(1) is a connected map.
(2) is a wired lift of f.
Proof
To see (1), note that f is well defined since and have the same vertex set . (K1), (K2), and (K3) are immediate. To see (K4) assume . Since is CSD, there exists such that and . But and is CSD. Hence there exists such that and . Thus and , proving (K4). The argument for (K5) is the same as that of Theorem 3.3 in Willson (2012), used to prove that the composition of CSD maps is CSD. This completes the proof of (1).
For (2), to prove (W1) suppose . By (i) , so since is a CSD map either or . The latter applies by (ii), proving (W1).
For (W2), given any arc there exists such that since is a CSD map. Then since is a CSD map, there exists such that . Hence for every there exists such that . This proves (W2) and hence (2).
In the situation of Theorem 6.4, to draw the wired lift of f on the diagram of , it follows that, for every arc , we draw the arc in one of three ways:
(1) a thin solid arc (u, v) if ,
(2) a thin dashed arc (u, v) if and is a redundant arc in ,
(3) a wide solid arc (u, v) if and is an arc in that is not redundant.
Deriving a Normal Network from an X-Network
This section concerns methods, given an X-network N, to produce an acyclic X-network with desirable properties. Often, an important step may be to remove redundant arcs, thus obtaining .
In particular, we shall want to find D such that is normal. Call a network N pre-normal if is normal. Thus, we seek D such that is pre-normal.
Let N be an X-network. A vertex v of N is a pre-normal obstacle or (more simply) an obstacle if (1) v is not a leaf, and (2) every child of the vertex v in is hybrid. Thus v may be regarded as an “obstacle” to being tree-child. Since contains no redundant arcs, this is an “obstacle" to being normal, or equivalently to N being pre-normal.
We must ignore redundant arcs when deciding our strategies concerning which arcs to merge. To make these decisions we need to have a notion of in-degree and out-degree that does not count redundant arcs.
Suppose v is a vertex of an X-network N. The non-redundant in-degree of v, denoted nrindeg(v), is the number of non-redundant arcs (p, v); hence it is the number of parents of v by non-redundant arcs. If then by Theorem 2.1. Clearly . The non-redundant out-degree of v, denoted nroutdeg(v), is the number of non-redundant arcs (v, c), hence the number of children of v by non-redundant arcs. If v is not a leaf, then by Theorem 2.1 it has a non-redundant child, whence .
A vertex v of N is nonr-hybrid if . A vertex c is a nonr-child of v if (v, c) is a non-redundant arc; we also say v is a nonr-parent of c. A nonr-child c of v is a nonr-tree-child of v if . A path is a nonr-path if no arc is redundant, for .
It is immediate that v is a pre-normal obstacle iff (1) v is not a leaf, and (2) every nonr-child of v is nonr-hybrid.
Figure 8 shows an acyclic X-network with redundant arcs. Note that 8 is an obstacle since both its children 12 and 13 are nonr-hybrids. But 9 is not an obstacle since the only nonr-parent of 10 is 9 and .
Fig. 8.

An X-network N with redundant arcs (6,8) and (6,10). Vertex 8 is an obstacle but 9 is not
An X-network N is obstacle-free if it contains no pre-normal obstacle.
Theorem 7.1
Suppose N is an acyclic X-network that is obstacle-free. Then is a normal X-network and N is pre-normal.
Proof
By hypothesis, for every vertex v that is not a leaf, there is a non-redundant arc (v, c) with . It follows that in , c is a tree-child of v. Since has no redundant arcs, it follows that is normal and N is pre-normal.
Theorem 7.1 further justifies the use of the term “pre-normal obstacle”. It is easy to see that a tree-child X-network is always pre-normal, but a pre-normal network need not be tree-child.
Theorem 7.1 suggests our strategy for normalization: Given an arbitrary X-network N, when we seek a normal network M, we know by Lemma 2.2 that S(M) will be regular; by (1) in the definition of regular network, S(M) is SCD. We are therefore seeking a network that is very close to being an SCD network, and it is plausible to start with the very general SCD network . We then recursively remove obstacles until there are no obstacles remaining. Next we remove redundant arcs to obtain a normal network. If we seek to obtain a uniquely determined normalization we are careful not to make arbitrary choices about which arcs to merge.
Now we show that there are different types of pre-normal obstacles. Let N be an acyclic X-network. Suppose c is an obstacle. An allowable 1-fold parent chain of c is a path such that is not redundant and has a nonr-tree-child (so , whence necessarily every other parent of d is via a redundant arc). An obstacle c is of type 1 if c has an allowable 1-fold parent chain. If c has type 1 and is an allowable 1-fold parent chain, let .
Suppose c is an obstacle and is an integer. An allowable k-fold parent chain for c is a nonr-path such that has a nonr-tree-child d distinct from . An obstacle c is of type k if
(a) c is not of type ; and
(b) c has an allowable k-fold parent chain.
In this situation, for this k-fold parent chain write
.
Theorem 7.2
Let N be an acyclic X-network. Then every pre-normal obstacle c has a unique type.
Proof
It is clear that the type, if it exists, is unique.
Consider a path from to c which has maximal length k. Write this path as . By Theorem 2.1 this is a nonr-path. If has a nonr-child d other than , then this path is an allowable k-fold parent chain of c, so c has type at most k. If, instead, is the only nonr-child of , then every other child q of satisfies that is redundant. There is a lengthening nonr-path by Theorem 2.1, whence is a nonr-child of ; since is the only such nonr-child, it follows . Indeed, every nonr-path from to any vertex other than or must begin with . If has a nonr-child d other , then is an allowable -fold parent chain and c has type . Otherwise is the only nonr-child of . Thus any nonr-path from to a vertex other than must begin . We repeat the argument. If at any stage we have r such that has a nonr-child , then is an allowable -fold parent chain. Otherwise every nonr-path from to a vertex other than must start with .
If no such occurs, then we find that is a nonr-path and every nonr-path from to any vertex other than must begin with . But c is not a leaf hence must have a nonr-child e. Since c is an obstacle, so e has a nonr-parent . Every nonr-path from to q must start , so there is a nonr-path from c to q, hence a nonr-path from c to q to e, showing that (c, e) is redundant, a contradiction. Hence some such must occur, and c has an allowable -fold parent chain.
The following result shows a simple way to remove a type 1 obstacle:
Lemma 7.3
Suppose N is an acyclic X-network and c is a type 1 obstacle with allowable 1-fold parent chain p, c, where p has nonr-tree-child . Let . Form and let be the projection. Then is not an obstacle in .
Proof
Since c is an obstacle, it is not a leaf, so D contains no X-arc. Moreover, D is strongly closed since (p, c) is not redundant, and is an acyclic X-network. Note and in there is an arc ([p, c], d). If q is any parent of d in N other than p, then (q, d) is redundant since d is a nonr-tree-child of p. Hence by Theorem 2.1 it has a lengthening of maximal length, ending with a non-redundant arc into d. Thus the lengthening must include the nonr-parent p of d and there is a path in N with non-redundant arcs from q to p to d. Since is a CSD map, there is a path in from to to , showing that (q, d) is redundant in , so .
The result above often generalizes to obstacles of type k. The next result assumes for simplicity that D is strongly closed.
Lemma 7.4
Let N be an acyclic X-network with pre-normal obstacle c of type k. Suppose is an allowable k-fold parent chain, where has nonr-tree-child . Let . Assume D is strongly closed. Form and let be the projection. Then has nonr-child d and , so is not an obstacle in .
Proof
Note that D contains no X-arcs since c is not a leaf. By Lemma 7.3, when we identify and , d becomes a nonr-tree-child of . When we next identify with , d becomes a nonr-tree-child of . This continues until we conclude that has the nonr-tree-child d.
The following lemma shows that, often, once an obstacle is removed, it does not reappear when subsequent arcs are merged.
Lemma 7.5
Suppose (p, c) is a non-redundant arc in the acyclic X-network N and is obtained by identifying p and c. Let be the projection. Suppose (a, b) is a non-redundant arc in N and b is a nonr-tree-child of a (so ). Assume , . Then is a non-redundant arc of and has nonr-indegree 1.
Proof
must be an arc unless . But in the only identification is . If , this would contradict that , , so is an arc in . If is a non-redundant arc in , then since a is the only nonr-parent of b in N. Hence .
It will be useful to remove trivial vertices which may have been created in the construction process. Suppose is an X-network. Let . Define . Note that v will have unique parent u and also a unique child since v is trivial. The next result shows that has desirable properties.
Theorem 7.6
Let be an acyclic X-network. Then is an acyclic X-network. Moreover
(1) contains no trivial vertices and hence is a phylogenetic X-network.
(2) There is a leaf-preserving CSD map .
(3) .
(4) If N is normal, then is normal.
Proof
Note that E contains no X-arc because a leaf does not have out-degree one. Since E is clearly strongly closed, is an acyclic X-network. (1) and (2) follow as in Theorem 4.4. (3) is obvious since if and u is trivial, then .
For (4) we first show that since N is tree-child, must also be tree-child. It suffices to prove this in the case where is obtained from N by removing one trivial vertex t with parent q and child c. In every non-leaf vertex except q obviously still has a tree-child, the same one as in N. We must show that q has a tree-child in . But t has a tree-child in N which must be c so c has no other nonr-parent than t in N. Hence q in has child c which has no other nonr-parent and is therefore a tree-child. This proves is tree-child.
For (4) we must also prove that has no redundant arc. Again we may assume that is obtained from N by the removal of a single trivial vertex t with parent q and child c. The only possible redundant arc in is the new arc (q, c). If it is redundant, there is a path in from q to c other than the arc, hence a path in N from q to c not through t. Such a path of maximal length by the proof of Theorem 2.1 contains no redundant arc, so c has a nonr-parent besides t. This contradicts that N was normal, since t in N has no tree-child.
We now show how, given an X-network N, to compute a uniquely determined normal X-network. We first compute a uniquely determined pre-normal acyclic X-network , which we call the pre-normalization of N. The computation uses the procedure PRENORM described below. Briefly, if N is not already a pre-normal acyclic X-network, we compute . If contains no obstacles then . Otherwise, for each obstacle c we compute its type k and find all the allowable k-fold parent chains for c. Let D(c) be the union of for all such allowable chains for c. Let D be the union of the D(c) for all the obstacles c. We then compute . If this has no obstacles then . If not, we repeat the process.
Here is a more detailed description of the computation of :
The network output by the procedure will be denoted . The integer output will be called the height of and denoted r. Thus . Note that if N is acyclic and pre-normal; otherwise, r is 1 more than the number of times the procedure passes through steps 5 , 6, 7. The height r is a crude indicator of the complexity of the calculation. The examples in this paper satisfy , including those examples with real data. The author has worked examples with .
The next theorem shows that the procedure works.
Theorem 7.7
Let be an X-network.
Apply procedure PRENORM to N. Then
(1) The procedure terminates and outputs an acyclic X-network which is pre-normal.
(2) depends only on the geometry of N.
(3) The projection is a leaf-preserving CSD map.
(4) Let . Then is a wired lift of into N that contains no dashed arcs.
Proof
If the procedure returns N and , then (1) is immediate. Otherwise, by Theorem 4.4, is an acyclic X-network which contains no trivial vertices. If it contains no obstacles, it is pre-normal by Theorem 7.1, , and (1) follows. Otherwise, it contains at least one obstacle c. For each obstacle c of type k and each allowable k-fold parent chain for c the set contains no X-arc since the obstacle c cannot be a leaf. Hence D(c) contains no X-arc, so D contains no X-arc. By Theorem 3.8, is an acyclic X-network, which we expect will be pre-normal by Lemmas 7.3 and 7.4. If contains no obstacles, it is pre-normal by Theorem 7.1, and , so (1) is true. Otherwise it contains an obstacle and the procedure returns to step 5.
Each time step 5 is utilized on , the corresponding set D is nonempty, so more arcs are contracted and contains fewer vertices and fewer arcs. Since N is finite, the procedure must terminate. This proves (1).
The procedure never involves a choice, such as favoring some obstacles or some allowable parent chains over others. Hence (2) is true.
We wish to see (3). If then and is the identity map. Otherwise, let be the projection from Theorem 4.4. If then and proves (3). Suppose . For let be the projection. Then is the composition and proves (3), since it is a composition of leaf-preserving CSD maps.
For (4), since is a CSD map, is a wired lift by Theorem 6.1(1). It has no dashed arcs since redundant arcs are allowed in .
Since is a pre-normal acyclic X-network, we remove the redundant arcs to form , which will be normal. It may, however, contain trivial vertices, so we define , which will be normal and contain no trivial vertices. We call the normalization of N. The next theorem records its basic properties.
Theorem 7.8
Suppose N is an X-network. Let be the projection CSD map. Then
(1) is a normal acyclic X-network containing no trivial vertices, hence a phylogenetic X-network.
(2) The definition of depends only on the geometry of N.
(3) The projection is a leaf-preserving CSD map.
(4) The composition as maps of vertices from N to is a connected map.
(5) There is a wired lift of into N which may contain dashed arcs.
(6) .
Proof
For (1) note is an acyclic X-network by Theorem 5.1 which is normal by Theorem 7.1. Hence (1) follows from Theorem 7.6. Moreover, (2) is obvious since depends only on the geometry of N. Then (3) follows from Theorem 7.6. Next (4) and (5) follow from Theorem 6.4. Finally (6) follows from Theorems 5.2 and 7.6.
Remark
There is an interesting variant of the procedure PRENORM. Define the procedure VARIANT PRENORM to be exactly like PRENORM except that step (5c) is replaced by
(5) Select exactly one allowable k-fold parent chain for c, and let .
We will abbreviate the name of the procedure to VARIANT. Thus while PRENORM uses all allowable k-fold parent chains for each obstacle c of type k, VARIANT would use just one allowable parent chain for each such obstacle. The following theorem shows that the output of VARIANT has interesting properties. The proof is like those of Theorems 7.7 and 7.8 and is omitted.
Theorem 7.9
Let be an X-network.
Apply procedure VARIANT PRENORM to N. Then
(1) The procedure terminates and outputs an acyclic X-network which is pre-normal.
(2) The projection is a leaf-preserving CSD map.
(3) There is a wired lift of into N that contains no dashed arcs.
Let Then
(4) is a normal acyclic X-network containing no trivial vertices, hence a phylogenetic X-network.
(5) The projection is a leaf-preserving CSD map.
(6) The composition as maps of vertices is a connected map.
(7) There is a wired lift of into N.
The output of VARIANT is called a variant prenormalization of N and is denoted or for simplicity . Here C records the particular choice of allowable k-fold parent chain for c made in step (5) each time there was more than one allowable k-fold parent chain for c to choose among. Similarly is called a variant normalization of N and is denoted or . Note that and will not necessarily depend only on the geometry of N; instead, the result will depend on all the choices C of the parent chains when there is more than one possible. In certain circumstances, however, it might be preferable. For example, the researcher might have additional information suggesting that the relevant gene flow is much more likely along one parent chain than another, in which case the least probable parent chain should be selected for merging.
It frequently happens that has more vertices and hence higher resolution than . The following lemma indicates one source of this inequality. To obtain a short proof, we assume very strong hypotheses.
Lemma 7.10
Suppose is an X-network. Assume when that in step 6 of PRENORM the merging set of arcs in is D, while in step 6 of VARIANT the merging set is E. Then
(1) .
(2) If for some obstacle c of type k there is more than one allowable k-fold parent chain, then .
(3) If u and v are vertices of and , then .
(4) .
(5) If u and v are vertices of and , then .
(6) If , then has strictly more vertices than .
(7) If and for both PRENORM and VARIANT, then has strictly more vertices than .
Proof
(1) is true since for a given obstacle c of type k, E contains exactly one allowable k-fold parent chain for c, while D contains all allowable k-fold parent chains for c. Then (2) is obvious and (3) follows from (1). For (4), go through the proof of Theorem 3.7 and note that every time an arc is added to make K(E), it necessarily must be added also to K(D). Then (5) follows from (4). For (6), the vertices of are the equivalence classes of under , while the vertices of are the equivalence classes of under . Since , (6) follows. Now (7) is clear.
A calculation of is illustrated below in Example 7.
Other variants making other choices of parent chains are possible as well.
Example 3
This example continues Example 1 in Sect. 4. Consider the network N in Fig. 4. The first step is to compute , shown in Fig. 5. Since is a tree, it has no obstacle, the height and . Since there are no redundant arcs, we find . Moreover, . It is easy to see in this example that is the star tree consisting of the root 7 together with an arc from 7 to each of the six leaves. Moreover (and therefore N has four more distinct clusters than the tree . Hence . In this example, outperforms .
Example 4
Consider N in Fig. 9. Let , so since N is already SCD. has two obstacles: 8 and 11. Obstacle 8 is type 1 with one allowable parent chain 6,8 and . Obstacle 11 has type 1 with allowable chain 7,11 and . (Note that 8,11 is not an allowable chain since 12 is hybrid.) Hence .
Fig. 9.

An SCD X-network N
D is strongly closed, and we find , shown in Fig. 10 with redundant arcs ([7,11],13) and ([6,8],12). There are no obstacles, so the height and . We remove the redundant arcs to find , which contains the trivial vertex 13. Then we compute to remove the trivial vertex by merging the arc (10,13) as in Theorem 7.6 to yield , shown in Fig. 11.
Fig. 10.

for N in Fig. 9, where . Note that and the normalization is obtained by removing the redundant arcs ([7,11],13) and ([6,8],12) and then the resulting trivial vertex 13
Fig. 11.

for N in Fig. 9, where . Note the removal of the trivial vertex 13 from Fig. 10
The projection map has and . For other vertices v, . Without the redundant arc ([7,11],13), would not be a CSD map since N contains the arc (11,13).
Let be the projection and be the composition of the vertex maps. Note f is a connected map. Figure 12 shows Fig. 9 redrawn to exhibit the wired lift of into N. The thin solid arcs show immediately that there were identifications , , and . The dashed arcs correspond to the redundant arcs ([7,11],13) and ([6,8],12) in Fig. 10, which are not present in . We see, for example, there is a unique g-path from 8 to 2, given by 8,6,7,10,13,2. Hence in there is a path from to .
Fig. 12.

Figure 9 redrawn to exhibit the wired lift of into N. Thin solid arcs are merged in . Dashed arcs give rise to redundant arcs of and are not included in . Wide solid arcs are in
It is easy to compute that using Theorems 7.6 and 5.2. We identify the two relevant clusters by noticing that N contains vertices with clusters and that are not in , while every cluster of and hence of is also a cluster of N. In this example, .
Example 5
Figure 13 shows the wired lift of into an X-network N with a single obstacle 16 of type 3. N is seen by changing all thin solid or dashed arcs to wide solid. N is easily verified to be already SCD, so . One 3-fold parent chain in N is 11, 14, 17, 16; another is 13, 14, 17, 16. The chain 13,14,16 is not an allowable parent chain because (14,16) is redundant. To find , use , .
Fig. 13.

The wired lift of for an X-network N with a single obstacle 16 of type 3. N is seen if all arcs are instead made wide solid. The wired lift of is seen if the arcs (9,11), (16,22), and (19,22) are all instead made wide solid
is shown in Fig. 14. Note has no obstacles, hence is pre-normal. Thus and the height . The point [11,13,14,16,17] is labeled 16. If is the projection, then ,. contains redundant arcs (9, 16) and (16, 22), arising from the arcs (9,11) and (16,22) in N. When these arcs are removed from and the resulting trivial vertex 22 is removed, we obtain .
Fig. 14.

for the N of Fig. 13. The vertex [11, 13, 14, 16, 17] is labeled 16. The arcs (9,16) and (16,22) are redundant. is found by removing the two redundant arcs and then using to remove the resulting trivial vertex 22 by merging (19,22)
As maps of vertex sets and can be composed to yield the resulting connected map ; it is not a CSD map because the vertex map from to is not CSD. The wired lift of is shown in Fig. 13. The arcs (9,11) and (16,22) are dashed because they are pre-arcs to the redundant arcs of , which are not arcs of , hence are not in of the wired lift, using Theorem 6.3. Arcs (u, v) such that are thin solid. Hence arcs in the induced subgraph of , are thin solid. The arc (19, 22) is thin solid because it was merged to remove the trivial vertex 22. Note the g-path 16,17,19,22,4 from 16 to 4; but 16,22,4 is not a g-path.
The wired lift of would be seen in Fig. 13 if the arcs (9,11), (16,22), and (19,22) are made wide solid. The first two would be wide since in both (9,16) and (16,22) are arcs. Arc (19,22) would be wide since 22 is not a trivial vertex in and is not removed.
Examples with Real Data
This section contains two examples from real biological data.
Example 6
Glémin et al. (2019) study pervasive hybridizations of wheat relatives. Their Fig. 5 shows their proposed scenario for the history of diploid Aegilops/Triticum species. Let N be their graph. A wired lift of is shown in our Fig. 15. The network N is seen if each arc in Fig. 15 is made wide solid. In we find only a single pre-normal obstacle 21 of type 1. The height of the computation is . When we compute , there is a single redundant arc. contains 23 vertices and 29 arcs; it is thus simpler than N, which contains 31 vertices (of which eight are hybrid), and 38 arcs. We find . It is interesting that our dashed arc (21,22) is also dashed in Glémin et al. (2019) to indicate a less likely event. It turns out that in this case .
Fig. 15.
The wired lift of for the diploid Aegilops/Triticum species in Glémin et al. (2019). Wide arcs are in . Thin solid arcs represent identifications; thus , , , , , and . The dashed arc (21,22) corresponds to a redundant arc in and may not be used for g-paths. If all arcs are instead wide solid, we obtain N
Example 7
Marcussen et al. (2015) exhibit a network N for the angiosperm genus Viola in their Fig. 4. Our methods find has 2 obstacles. One obstacle is type 1 with two allowable 1-fold parent chains. The other is type 2 with one allowable 2-fold parent chain. Thus for computing , D contains 4 arcs. has no obstacles, so and the height is two. A wired lift of is shown in Fig. 16. We see that has 29 vertices (equivalence classes under thin solid arcs). It turns out to have 31 arcs, while the wired lift has 34 wide solid arcs. If is the connected map, more than one wide solid arc (u, v) can map to the same arc of . Thus (14,10) and (17,18) map to the same arc in , as do (28,37) and (28,36), and also (11,12) and (11, 21).
Fig. 16.
The wired lift of for the Viola data N in Marcussen et al. (2015). The entire vertical line labeled 42 represents one vertex with out-degree 7
is drawn in Francis et al. (2021) in their Fig. 3. We compute that while . Thus is a better approximation to N than but lacks a wired lift.
If we use instead VARIANT PRENORM, there are two possible normal networks that can result, depending on the choice of the 1-fold parent chain for the type 1 obstacle [13,14,15,17] in . One allowed parent chain is 12, [13,14,15,17]; the other is [21,22], [13,14,15,17]. Thus for computing , the set D has 3 rather than 4 arcs, leading to one more vertex in compared to . For both networks , , so they both approximate N as well as does but depend on the choice of 1-fold parent chain.
Both such have wired lifts into N by Theorem 7.9. The wired lift of each is very similar to that for with one additional wide solid arc replacing a thin solid arc. For one the wired lift is given by making the arc (12,13) in Fig. 16 be wide solid; for the other , the only change is that the arc (22,13) in Fig. 16 is wide solid.
Both variant normalizations satisfy , so neither agrees with .
Further comments concerning this example are given in Sect. 9.
Discussion
Comparison of and Let be an X-network. It is interesting to contrast with , defined in Francis et al. (2021). Both are uniquely determined normal phylogenetic X-networks depending only on the geometry of N. Both allow vertices of N to have in-degree greater than 2 or out-degree greater than 2, and both apply quite generally.
is fast to compute using Huson and Steel (2020) and very elegant. It works by locating the “visible” vertices of N. A vertex v is visible if there exists such that every path from to contains v. This set of visible vertices forms the initial vertex set of . Hence each initial vertex of can be highlighted in the diagram for N, as is done in Francis et al. (2021). At the end, trivial initial vertices of are suppressed. In a tangled network like our Fig. 4, the only visible vertices are the root and the leaves, since there is a great multiplicity of possible paths from the root to a given leaf. In such a situation, does not perform well. For less tangled networks such as Example 7 the computation works well. Perhaps it would be useful in general to compute .
The arcs of are harder to interpret than the vertices. In there is an arc (u, v), where u and v are distinct visible vertices of N, precisely when in N and there is no third visible vertex w such that and . Thus, for example, two different arcs and emerging from the same u could be present because of directed paths in N from u to and from u to such that the paths have significant overlap, invisible in .
Consider again Fig. 16 where N is for the Viola genus of Marcussen et al. (2015). The diagram of N is exactly Fig. 16 in which all arcs are drawn wide solid. The vertices of , before suppression of the trivial vertices, are the 43 visible vertices out of the 61 vertices of N. None of the vertices 13, 14, 15, 17 of N (those relevant to the obstacle [13,14,15,17] crucial to our discussion of the VARIANT calculation) are visible and hence they do not appear in . has arc (22,18) because (a) 22 and 18 are visible, (b) 22, 13, 14, 10, 8, 18 is a path in N, and (c) there is no other path from 22 to 18 containing a third visible vertex. Similarly has arc (12,20) because of the path 12, 13, 14, 15, 17, 20, and it has arc (12,18) because of the path 12, 13, 14, 10, 8, 18 in N. Thus these three distinct arcs in arise from overlapping paths in N involving 13 and 14.
In contrast, the arcs of are easy to interpret. The wide solid arcs highlight the arcs N that appear in ; the thin dashed arcs indicate redundant arcs in and must be avoided in g-paths; the thin solid arcs tell what arcs must be merged to obtain the normal network . The use of g-paths lets us understand from just the wired lift.
Software The author has written software using Xcode which implements the calculation of somewhat interactively. It was essential for the examples based on real data. It computes , , , and and locates all obstacles. It finds all allowable 1-fold and 2-fold parent chains, but obstacles of type must be handled interactively. The software is far from ready for general use, but it shows that the calculations can be automated.
Future work One can ask whether there are other classes of networks besides normal networks for which a similar construction could be used to simplify a network N into one of this other class. Suppose, given an X-network N, we sought a tree-child X-network C. Since a tree child network may contain redundant arcs, we should like a construction that depends only on the geometry of N and yields a CSD map . At first glance we might think that we could use the CSD map ; but while is tree-child, need not be tree-child. The author is currently looking at such problems for tree-child and some other classes of networks.
Acknowledgements
The author is indebted to the anonymous referees for numerous suggestions to improve the exposition and for catching some errors, gaps, and omissions. He also thanks Thomas Marcussen in Oslo for helpful comments and suggesting Glémin et al. (2019).
Funding
None.
Declarations
Conflict of interest
None
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Baroni M, Semple C, Steel M. A framework for representing reticulate evolution. Ann Comb. 2004;8:391–408. doi: 10.1007/s00026-004-0228-0. [DOI] [Google Scholar]
- Cardona G, Rosselló F, Valiente G. Comparison of tree-child phylogenetic networks. IEEE/ACM Trans Comput Biol Bioinf. 2009;6(4):552–569. doi: 10.1109/TCBB.2007.70270. [DOI] [PubMed] [Google Scholar]
- Delwiche CF, Palmer JD. Rampant horizontal transfer and duplication of Rubisco genes in Eubacteria and plastids. Mol Biol Evol. 1996;13(6):873–882. doi: 10.1093/oxfordjournals.molbev.a025647. [DOI] [PubMed] [Google Scholar]
- Doolittle WF, Bapteste E (2007) Pattern pluralism and the tree of life hypothesis. In: Proceedings of the Nat’l Academy of Sciences USA 104:2043–2049. 10.1073/pnas.0610699104 [DOI] [PMC free article] [PubMed]
- Francis A, Huson DH, Steel M. Normalising phylogenetic networks. Mol Phylogenet Evol. 2021;163:107215. doi: 10.1016/j.ympev.2021.107215. [DOI] [PubMed] [Google Scholar]
- Glémin S, Scornavacca C, Dainat J, Burgarella C, Viader V, Ardisson M, Sarah G, Santoni S, David J, Ranwez V. Pervasive hybridizations in the history of wheat relatives. Sci Adv. 2019;5(5):eaav9188. doi: 10.1126/sciadv.aav9188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huson DH, Steel M (2020) PhyloSketch. http://ab.inf.uni-tuebingen.de/software/phylosketch
- Huson DH, Rupp R, Scornavacca C. Phylogenetic Networks: Concepts Algorithms and Applications. Cambridge: Cambridge University Press; 2010. [Google Scholar]
- Inagaki Y, Doolittle WF, Baldauf SL, Roger AJ. Lateral transfer of an EF-1 gene: origin and evolution of the large subunit of ATP sulfurylase in Eubacteria. Curr Biol. 2002;12:772–776. doi: 10.1016/S0960-9822(02)00816-3. [DOI] [PubMed] [Google Scholar]
- Jones G, Sagitov S, Oxelman B. Statistical inference of allopolyploid species networks in the presence of incomplete lineage sorting. Syst Biol. 2013;62(3):467–478. doi: 10.1093/sysbio/syt012. [DOI] [PubMed] [Google Scholar]
- Linz S, Semple C. Caterpillars on three and four leaves are sufficient to reconstruct binary normal networks. J Math Biol. 2020;81:961–980. doi: 10.1007/s00285-020-01533-7. [DOI] [PubMed] [Google Scholar]
- Marcussen T, Heier L, Brysting AK, Oxelman B, Jakobsen KS. From gene trees to a dated allopolyploid network: insights from the angiosperm genus Viola (Violaceae) Syst Biol. 2015;64(1):84–101. doi: 10.1093/sysbio/syu071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moret BME, Nakhleh L, Warnow T, Linder CR, Tholse A, Padolina A, Sun J, Timme R. Phylogenetic networks: modeling, reconstructibility, and accuracy. IEEE/ACM Trans Comput Biol Bioinf. 2004;1(1):13–23. doi: 10.1109/TCBB.2004.10. [DOI] [PubMed] [Google Scholar]
- Pardi F, Scornavacca C. Reconstructible phylogenetic networks: do not distinguish the indistinguishable. PLoS Comput Biol. 2015;11(4):e1004135. doi: 10.1371/journal.pcbi.1004135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson DF, Foulds LR. Comparison of phylogenetic trees. Math Biosci. 1981;53:131–147. doi: 10.1016/0025-5564(81)90043-2. [DOI] [Google Scholar]
- Solís-Lemus C, Yang M, Ané C. Inconsistency of species tree methods under gene flow. Syst Biol. 2016;65(5):843–851. doi: 10.1093/sysbio/syw030. [DOI] [PubMed] [Google Scholar]
- Steel M (2016) Phylogeny: Discrete and Random Processes in Evolution. Society for Industrial and Applied Mathematics
- Willson SJ. Properties of normal phylogenetic networks. Bull Math Biol. 2010;72:340–358. doi: 10.1007/s11538-009-9449-z. [DOI] [PubMed] [Google Scholar]
- Willson SJ. CSD homomorphisms between phylogenetic networks. IEEE/ACM Trans Comput Biol Bioinf. 2012;9:1128–1138. doi: 10.1109/TCBB.2012.52. [DOI] [PubMed] [Google Scholar]


