Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2011 Nov 6;311(21):2512–2542. doi: 10.1016/j.disc.2011.07.028

The number of maximum matchings in a tree

Clemens Heuberger a,, Stephan Wagner b
PMCID: PMC3226351  PMID: 22158732

Abstract

We determine upper and lower bounds for the number of maximum matchings (i.e., matchings of maximum cardinality) m(T) of a tree T of given order. While the trees that attain the lower bound are easily characterised, the trees with the largest number of maximum matchings show a very subtle structure. We give a complete characterisation of these trees and derive that the number of maximum matchings in a tree of order n is at most O(1.391664n) (the precise constant being an algebraic number of degree 14). As a corollary, we improve on a recent result by Górska and Skupień on the number of maximal matchings (maximal with respect to set inclusion).

Keywords: Maximum matchings, Trees, Bounds, Structural characterisation

1. Introduction and statement of main results

Many problems in graph theory can be described as follows: for a certain class of graphs and a graph parameter, determine the largest and smallest possible values of the parameter, given the order of a graph (and possibly other conditions). One family that is particularly well-studied in this regard is the family of trees, not only because of their simplicity, but also in view of their many applications in various areas of science.

On the other hand, lots of natural graph parameters are defined as the number of vertex or edge subsets of a certain kind; we mention, for example, the number of independent vertex subsets [15,18], the number of matchings [7], the number of dominating or efficient dominating sets [2,3] or the number of subtrees [13,21]. Some of them play an important role in applications as well, for instance the number of matchings that is known as Hosoya index in mathematical chemistry [9,12] and is also connected to the monomer–dimer model of statistical physics [10]. The same can be said of the number of independent sets, which is studied under the name Merrifield–Simmons index in chemistry [17] and which is related to Hard Models in physics [1]. For both these parameters, the minimum and the maximum among all trees of given order are well known and are obtained for the star and the path respectively. A tremendous number of publications deal with related problems, concerning restricted classes of trees or tree-like graphs; the interested reader is referred to [22] and the references therein.

It is natural to consider variants of these graph parameters: instead of the number of matchings, one might be interested in the number of maximal matchings (maximal with respect to inclusion) or maximum matchings (matchings of largest possible cardinality). The same holds, of course, for the number of independent sets.

The number of maximal independent sets is treated in [19,23]—the maximum turns out to occur for an extended star. More recently, maximal matchings were studied by Górska and Skupień [5], who determined exponential upper and lower bounds for the maximum number of maximal matchings among all trees of given order. To the best of our knowledge, however, there are no analogous results on the number of maximum matchings, i.e., matchings of largest possible cardinality. Clearly, any maximum (cardinality) matching is also maximal with respect to inclusion, but the converse is not true. In fact, graphs for which every maximal matching is also a maximum matching are known as equimatchable [16].

In the following, we denote the number of maximum matchings in a graph G by m(G). Our goal is to characterise the trees of given order n for which the maximum and the minimum of this parameter are attained. This problem also has an algebraic interpretation: it is well known that the characteristic polynomial of a tree T of order |T|=n coincides with the matching polynomial [16]

ϕ(T,x)=k=0n/2(1)kak(T)xn2k,

where ak(T) is the number of matchings of cardinality k in T. This is a special case of a general theorem on the coefficients of the characteristic polynomial; see for instance [4]. It follows that m(T) is precisely the (absolute value of the) last nonzero coefficient of ϕ(T,x) and thus the product of the absolute values of all nonzero eigenvalues. In this sense, m(T) is a multiplicative analogue of the so-called energy of a graph [8,9], which is defined as the sum of the absolute values of all eigenvalues.

The lower bound for m(T) is almost trivial, and the trees that attain it can also be characterised easily:

Theorem 1.1

For any tree T of even order n, m(T)1 with equality if and only if T has a perfect matching. For a tree T of odd order n>1, m(T)2 with equality if and only if T is obtained from a tree T of order n1 with a perfect matching by doubling one of the leaves (i.e., choosing a leaf v and attaching a second leaf to v’s unique neighbour).

We note that a path of even order is an example of a tree of even order admitting a perfect matching.

The analogous problem asking for the largest possible number of maximum matchings appears to be much harder. The bound provided by Górska and Skupień for the number of maximal matchings immediately provides an upper bound for the number of maximum matchings, so that we have m(T)=O(1.395337n) (the constant being a root of the algebraic equation x42x1) by the result stated in [5]. We improve this to the following:

Theorem 1.2

For n{6,34}, there is a unique tree Tn of order nthat maximises m(T) . For n=6 and n=34, there are two such trees. Asymptotically,

m(Tn)cnmod7λn/7,

where λ=12(11+85)10.1097722286464 is the larger root of the polynomial x211x+9 and the constants cj, j{0,,6}, are given in Table 1.

Table 1.

Constants cj in the asymptotics of m(Tn).

j cj
0 67λ71765 ≈ 0.792620574273610
1 11λ1885λ1/7 ≈ 0.787947762616490
2 101047λ90171614125λ2/7 ≈ 0.783080426542439
3 4996λ444821675λ3/7 ≈ 0.788434032505851
4 27λ2185λ4/7 ≈ 0.790280714748050
5 3209λ28177225λ5/7 ≈ 0.785510324593434
6 6451616λ574340810440125λ6/7 ≈ 0.784269603628599

While the improvement in the constant (from 1.395337 to λ1/71.391664) seems modest, the main part of the theorem is the characterisation of the trees Tn, which will be stated explicitly in Section 3. Fig. 1 shows T181 as an example. Since maximum matchings are automatically maximal matchings, the theorem also improves on the lower bound for the maximum number of maximal matchings that was given by Górska and Skupień in [5], which is Ω(1.390972n) (the precise constant being 51+510214).

Fig. 1.

Fig. 1

Unique optimal tree of order 181.

The paper is organised as follows: in the following section, we deal with the simple lower bound (Theorem 1.1), the rest is devoted to the proof of Theorem 1.2. The structure of the “optimal” trees Tn is described explicitly in Section 3, making use of the concept of an outline graph. Then, some important preliminary results (Section 4) and information about the local structure (Section 5) are gathered. The global structure is discussed in Section 6. The proof is rather long and technical—one of the reasons we consider this inevitable is the fact that seven different cases occur in the structure of the optimal trees, and that there is also a number of exceptions from the general pattern (note the case n=34 in Theorem 1.2: the precise characterisation of the structure is only valid for n35). Another reason is that there are many trees that almost reach the upper bound, as can be seen from some of the estimates made on the way to our main result.

2. The lower bound

Let us start with the simple lower bound; as stated in Theorem 1.1, the minimum of m(T) is either 1 or 2, depending on the parity of the order.

Proof

In the case of even n, the inequality is trivial, so that we only have to determine the cases of equality. If T has a perfect matching, then this perfect matching can be reconstructed uniquely, starting from the leaves. Hence equality holds in this case. Otherwise, consider a tree T of order n and a maximum matching M. Since it is not a perfect matching, there is a vertex v that is not covered by the matching. Now choose an arbitrary neighbour w of v. Then w must be covered by the matching M, since one could otherwise add the edge vw to M to obtain a larger matching, contradicting the choice of M. Now replace the edge that covers w by the edge vw to obtain a second matching of the same cardinality as M, which shows that m(T)2 unless T has a perfect matching.

Now let us determine which trees of odd order satisfy m(T)=2. Consider once again a maximum matching. Since the above argument can be carried out for any vertex that is not covered by M, we can only have m(T)=2 if there is exactly one vertex v that is not covered. Furthermore, v must be a leaf: otherwise, we could apply the exchange procedure for each of its neighbours to obtain at least 3 distinct maximum matchings. Let w be v’s unique neighbour and assume that w is covered by an edge vw in M. Then v must also be a leaf, since we could otherwise replace vw by vw and repeat the argument. This shows that equality can only hold in the described case. □

As we will see in the following sections, the analogous question for the maximum of m(T) is much harder and requires a completely different approach. Let us first give a precise description of the trees Tn in Theorem 1.2.

3. The upper bound: description of the optimal trees

As mentioned in the introduction, we define m(T) to be the number of matchings of maximal cardinality of a tree T. A tree T is called an optimal tree if it maximises m(T) over all trees of the same order.

The results on the global structure are formulated in terms of leaves, forks, and chains.

Definition 3.1

  • (1)

    The graph of order 1 is also denoted by L (leaf).

  • (2)

    The rooted tree in Fig. 2(a) (with root r) is denoted by F (fork).

  • (3)
    Chains are defined recursively: for a rooted tree (T,r), we define the rooted tree (CT,s) as in Fig. 2(b). For k1 and a rooted tree T, we set
    CkTC(Ck1T)andC0T=T.

Fig. 2.

Fork and chain (Definition 3.1).

graphic file with name gr2a.jpg

(a) F.

graphic file with name gr2b.jpg

(b) CT.

Using these definitions, we can see five copies of C3F and one copy of C4F as rooted subtrees of the optimal tree in Fig. 1.

Formulating as much as possible using the notations L, F and Ck turns out to give compact representations for optimal trees. Let us formalise this concept:

Definition 3.2

Let T be a tree. We construct the outline graph of T as follows: first, all occurrences CkF and CL as rooted subtrees of T are replaced by special leaves “CkF” and “CL”, respectively (where replacement takes place by decreasing order of the replaced rooted subtree). In a second step, we consider all occurrences of subtrees CkT where T has a unique branch T. Every such subtree is replaced by the subtree T, linked to the rest by a special edge “Ck”.

As an example, the outline graph of the tree from Fig. 1 is shown in Fig. 3.

Fig. 3.

Fig. 3

Outline of the unique optimal tree of order 181.

We are now able to state our main theorem fully describing optimal trees.

Theorem 3.3

Let n4 and n{6,10,13,20,34} . Then there is a unique optimal tree Tn of order n.

  • (1)

    If n1(mod7), then Tn=C(n1)/7L.

  • (2)
    If n2(mod7), then Tn is shown inFig. 4(a), where
    k0=max{0,n3735},kj={n2+7j35 if n37,n9+7j35 if n30
    for j{1,2,3,4}.
  • (3)
    If n3(mod7), then Tn is shown inFig. 4(b), where
    kj=n17+7j28
    for j{0,1,2,3}.
  • (4)

    If n4(mod7), then Tn=C(n4)/7F.

  • (5)
    If n5(mod7), then Tn is shown inFig. 4(c), where
    kj=n5+7j21
    for j{0,1,2}.
  • (6)
    If n6(mod7), then Tn is shown inFig. 4(d), where
    kj=n27+7j49
    for 0j6.
  • (7)
    If n0(mod7), then Tn is shown inFig. 4(e), where
    k=n77.

If n{1,2,3,10,13,20}, there is also a unique optimal tree Tn of order n . For n{1,2,3}, there is only one tree of order n . For n{10,13,20}, Tn is shown in Fig. 5.

For n{6,34}, there are two non-isomorphic optimal trees Tn,1 and Tn,2 of order n . For n=6, T6,1 (the star of order 6) and T6,2 are shown in Fig. 5.

For n=34, both T34,1 and T34,2 have the shape as in Fig. 4(d). We have (k0,k1,k2,k3,k4,k5,k6)=(0,0,0,0,0,0,1) for T34,1 (this corresponds to the general case n6(mod7) as described above) and (k0,k1,k2,k3,k4,k5,k6)=(1,0,0,0,0,0,0) for T34,2.

Fig. 4.

Optimal trees.

graphic file with name gr4a.jpg

(a) Tn for n2(mod7) and n9.

graphic file with name gr4b.jpg

(b) Tn for n3(mod7) and n17.

graphic file with name gr4c.jpg

(c) Tn for n5(mod7).

graphic file with name gr4d.jpg

(d) Tn for n6(mod7) and n27, n34. For n=34, T34,1 and T34,2 also have this shape.

graphic file with name gr4e.jpg

(e) Tn for n0(mod7).

Fig. 5.

Optimal trees for n{6,10,13,20}.

graphic file with name gr5a.jpg

(a) T6,1.

graphic file with name gr5b.jpg

(b) T6,2.

graphic file with name gr5c.jpg

(c) T10.

graphic file with name gr5d.jpg

(d) T13.

graphic file with name gr5e.jpg

(e) T20.

Remark 3.4

The quasi-periodicity of length 7 is somewhat reminiscent of the situation encountered for dominating sets [2,3], even though there are certain differences.

4. The upper bound: preliminaries

4.1. The bipartition condition

A tree may always be seen as a bipartite graph. In the case of an optimal tree, however, the bipartition of the vertices corresponds to a specific behaviour in terms of maximum matchings, as will be shown in this section. This will also allow us to somewhat decompose the problem.

We start with a few definitions.

Definition 4.1

Let T be a forest. The matching number μ(T) is the maximum cardinality of a matching of T. Hence a matching of T is a maximum matching if it has cardinality μ(T). Denoting the empty graph by , it is convenient to set μ()=0 and m()=1.

Definition 4.2

A forest T is called an optimal forest if it maximises m(T) over all forests of the same order.

We now define the type of a vertex. These types will later be seen to correspond to the bipartition of the set of vertices of optimal trees.

Definition 4.3

Let T be a forest. A vertex v is said to be of type A if T admits a maximum matching that does not cover v. Otherwise, v is said to be of type B.

A first step towards the main result on the bipartition holds for all trees: there are no edges between vertices of type A:

Lemma 4.4

Let T be a tree, sV(T) of type A, and t a neighbour of s in T . Then t is of type B . Denoting the connected components of Tst by Ts and Tt with sTs and tTt, cf. Fig. 6, we have

μ(Tss)=μ(Ts),μ(Ttt)=μ(Tt)1,
μ(T)=μ(Ts)+μ(Tt),m(T)=m(Ts)m(Tt)+m(Tss)m(Ttt).

Fig. 6.

Fig. 6

Decomposition of T for Lemma 4.4 and Proposition 4.7.

Proof

We first note that

μ(Tv)μ(T)μ(Tv)+1

holds for any vertex v of any tree T, as any maximum matching of Tv is a matching of T and any maximum matching of T minus possibly the edge covering v is a matching of Tv.

Any maximum matching M of the tree T either contains the edge st or it does not contain the edge st. In the first case, M decomposes into a maximum matching of Tss, a maximum matching of Ttt and the edge st, which implies that μ(T)=μ(Tss)+μ(Ttt)+1. In the second case, M decomposes into a maximum matching of Ts and a maximum matching of Tt, which implies that μ(T)=μ(Ts)+μ(Tt). We conclude that

μ(T)=max{μ(Tss)+μ(Ttt)+1,μ(Ts)+μ(Tt)}. (4.1)

As s is of type A, there is a maximum matching M of T not covering s, hence we have μ(T)=μ(Tss)+μ(Tt). In view of (4.1), this implies that μ(Tt)μ(Ttt)+1, i.e., μ(Ttt)=μ(Tt)1, and μ(Tss)μ(Ts), i.e., μ(Tss)=μ(Ts).

In this case, we also have μ(Tt)=μ(Ttt)+μ(Ts)<μ(Tt)+μ(Ts)=μ(T), i.e., t is of type B. Finally, m(Ts)m(Tt) counts the number of maximum matchings of T not containing st and m(Tss)m(Ttt) counts the number of maximum matchings of T containing st, their sum is therefore m(T). □

We now show that in almost all cases, optimal forests are trees, so we may restrict our attention to trees afterwards. Nevertheless, at one point, we will also use this result as a technical tool when considering trees.

Lemma 4.5

Let T be an optimal forest of order at least 3. Then T is connected, i.e., T is a tree.

Proof

Let T1 and T2 be connected components of T. For simplicity, we may assume that these are the only connected components of T; otherwise, we use the following argument inductively.

As T is optimal, each of its connected components has to be optimal.

If both T1 and T2 are of order 1, then they both only admit the empty matching, inserting an edge between these two vertices does not alter the number of maximum cardinality matchings.

Next, we note that for n3, the star Sn on n vertices satisfies m(Sn)=n1>1. Thus an optimal forest of order at least 3 does not admit a perfect matching, as perfect matchings of trees are unique (see Theorem 1.1). This implies that an optimal forest of order at least 3 has a vertex of type A. As the unique vertex of a tree of order 1 is also of type A, we conclude that all optimal trees except the tree of order 2 have a vertex of type A.

As any neighbour of any vertex of type A is of type B by Lemma 4.4 and the vertices of the tree of order 2 also are of type B, we conclude that every optimal tree of order at least 2 has a vertex of type B.

If T1 and T2 are both of order 2, then there is no vertex of type A, thus T is not optimal.

So we may now assume that vT1 is of type A and wT2 is of type B. If we insert the edge vw, we obtain a new graph T=T+vw. As in Lemma 4.4, we obtain

μ(T)=max{μ(T1v)+μ(T2w)+1,μ(T1)+μ(T2)}.

As v is of type A (with respect to T1) and w is of type B (with respect to T2), we have μ(T1v)=μ(T1) and μ(T2w)+1=μ(T2). This implies that μ(T)=μ(T1v)+μ(T2), i.e., v is of type A with respect to T and Lemma 4.4 can be applied to yield

m(T)=m(T1)m(T2)+m(T1v)m(T2w)>m(T1)m(T2)=m(T),

contradiction.

Thus the only disconnected optimal forest is the forest consisting of exactly two isolated vertices. □

We can now formalise what we will call the bipartition condition.

Definition 4.6

Let T be a tree. We say that T fulfils the bipartition condition if the two classes in T’s unique bipartition contain precisely the vertices of type A and B respectively.

It turns out that indeed almost all optimal trees satisfy this condition.

Proposition 4.7

Let T be an optimal tree of order at least 3. Then T fulfils the bipartition condition.

Let st be an edge of T where s is of type A and t is of type B . The connected components of Tst are denoted by Ts and Tt with sTs and tTt . Then s and t are of types A and B with respect to the trees Ts and Tt, respectively. Furthermore,

m(T)=m(Ts)m(Tt)+m(Tss)m(Ttt). (4.2)

Proof

Assume that s and t are two adjacent vertices of type B.

If we have μ(Tss)+μ(Ttt)+1=μ(Ts)+μ(Tt), then (w.l.o.g.) μ(Tss)=μ(Ts) and μ(Ttt)=μ(Tt)1. In this case, we obtain μ(T)=μ(Tss)+μ(Tt), i.e., s is of type A. Contradiction.

Next, we consider the case that μ(Tss)+μ(Ttt)+1<μ(Ts)+μ(Tt)=μ(T), i.e., the case that st is not contained in any maximum matching of T. Deleting the edge st resulting in a forest T=Tst does not alter the number of maximum matchings, i.e., m(T)=m(T). By Lemma 4.5, T and therefore T are not optimal, contradiction.

Finally, we consider the case μ(T)=μ(Tss)+μ(Ttt)+1>μ(Ts)+μ(Tt), i.e., the case that st is contained in every maximum matching of T. Deleting all edges incident to s or t leads to a disconnected forest of the same order and the same number of maximum matchings. Contradiction.

Thus exactly one of s and t, say s, is of type A by Lemma 4.4 and the remaining assertions of this proposition are restatements of the results of Lemma 4.4. □

4.2. Rooted trees

For many of our arguments, we will designate a vertex of a tree as the root and recursively consider subtrees. To this end, we collect a few definitions as well as some recursive formulæ for the number of maximum matchings.

We assume that all rooted trees are non-empty. A rooted tree with underlying tree T and root r will be denoted by the pair (T,r); frequently, we will simply write T if the root is clear from the context. An important operation that we will frequently apply is to choose another vertex sV(T) as the new root. We will usually denote the resulting rooted tree by a new symbol (T,s) (and thus abbreviated to T) although the underlying unrooted trees T and T are identical.

As usual, the branches of a rooted tree (T,r) of the shape as Fig. 7 are the rooted trees (T1,r1),,(Tk,rk).

Fig. 7.

Fig. 7

Rooted tree with branches.

A rooted subtree (T,v) of an unrooted tree T is a connected component of Tvw for some edge vw of T such that vT. Note that this definition forces T to be a proper subtree of T.

A rooted subtree (T,v) of a rooted tree (T,r) is the connected component of Tvw containing v, where w has to be the parent of v, i.e., T is the subgraph induced by all the successors of v. We will also write T=T(v) in this case.

Let T be a tree and v be a vertex of T with neighbours r1,,rk. The connected components of Tv are denoted by T1,,Tk such that rjTj for all j. Then the rooted trees (T1,r1),,(Tk,rk) are said to be the rooted connected components of Tv (and usually, the roots rj will not be mentioned).

Definition 4.8

Let (T,r) be a rooted tree.

  • (1)

    We define m1(T) to be the number of maximum matchings of T covering the root r.

  • (2)

    We define m0(T) to be m(Tr), the number of maximum matchings of Tr.

  • (3)

    The type of T is defined to be the type of the root as a vertex of the unrooted tree, i.e., (T,r) is of type A if μ(Tr)=μ(T) and of type B if μ(Tr)=μ(T)1. We sometimes write type(T)=A and type(T)=B, respectively.

Thus (T,r) is of type A if and only if it admits a maximum matching not covering the root r.

We have

μ(L)=0,m(L)=1,m0(L)=1,m1(L)=0 (4.3)

for the rooted tree L of order 1, which implies that it is a rooted tree of type A.

Definition 4.9

We define the bipartition condition for rooted trees recursively as follows: a rooted tree of order 1 (rooted at its only vertex) is said to satisfy the bipartition condition. If (T,r) is a rooted tree with branches (T1,r1),,(Tk,rk), then the rooted tree (T,r) is said to fulfil the bipartition condition if all branches (Tj,rj) fulfil the bipartition condition and the type of (T,r) is not equal to the type of any of the branches (Tj,rj).

Remark 4.10

Let T be an optimal tree of order at least 3 and (S,r) be a rooted subtree of T. Then the type of r as vertex of T coincides with the type of S and S fulfils the bipartition condition for rooted trees by Proposition 4.7.

The main goal behind the definition of the two different types is to provide a recursive method to compute m(T). Note first that for a rooted tree (T,r), we have

m(T)={m0(T)+m1(T), if (T,r) is of type A,m1(T), if (T,r) is of type B.

We now give recursive formulæ for these quantities in terms of the branches of a rooted tree. Here, for technical reasons, we do not assume the bipartition condition for rooted trees, but a weaker version only, and derive the bipartition condition for rooted trees.

Lemma 4.11

Let (T,r) be a rooted tree and (T1,r1),,(Tk,rk) its branches. We assume that T1,,Tk are of the same type. Then T is of the other type and we have

m0(T)=j=1km(Tj), (4.4)
m1(T)=m0(T)j=1km0(Tj)m(Tj). (4.5)

Proof

If (T,r) is of order 1, then there are no branches, and the product in (4.4) and the sum in (4.5) are empty, which coincides with the values for L given in (4.3). Thus we may focus on the case that the order of (T,r) is at least 2.

As Tr consists of the connected components T1,,Tk, we clearly have μ(Tr)=i=1kμ(Ti) and (4.4). Furthermore,

μ(T)=max({μ(Tr)}{1+μ(Tjrj)+ijμ(Ti):j{1,,k}}),

as a maximum matching either does not cover r or contains the edge rrj for some j.

If all branches are of type B, i.e., μ(Tjrj)=μ(Tj)1 for all j, then 1+μ(Tjrj)+ijμ(Ti)=iμ(Ti)=μ(Tr) for all j. This implies that μ(T)=μ(Tr), T is of type A and each of the edges rrj can be used in a maximum matching.

If all branches are of type A, i.e., μ(Tjrj)=μ(Tj) for all j, then 1+μ(Tjrj)+ijμ(Ti)=1+iμ(Ti)=1+μ(Tr) for all j. This implies that μ(T)=μ(Tr)+1, T is of type B and again, each of the edges rrj can be used in a maximum matching.

There are m(T1)m(Tj1)m0(Tj)m(Tj+1)m(Tk) maximum matchings of T containing the edge rrj. Summing over all j yields (4.5). □

If T1,,Tk are rooted trees of type A, then the rooted tree with branches T1,,Tk is also denoted by B(T1,,Tk). It is of type B by Lemma 4.11.

Similarly, if T1,,Tk are rooted trees of type B, then the rooted tree with branches T1,,Tk is also denoted by A(T1,,Tk). It is of type A by Lemma 4.11. If k=1, we will omit the parentheses and simply write AT1.

The crucial quantity in our investigation will be the following quotient:

Definition 4.12

For a rooted tree (T,r), we set ρ(T)=m0(T)/m(T).

We note that by definition, ρ(T)>0 for all rooted trees (T,r).

We now reformulate the recursive formulæ for m and m0 to yield recursive formulæ for ρ.

Lemma 4.13

Let (T,r) be a rooted tree fulfilling the bipartition condition with branches (T1,r1),,(Tk,rk) . Then

ρ(T)={11+j=1kρ(Tj), if (T,r) is of type A,1j=1kρ(Tj), if (T,r) is of type B.

Proof

This is a simple consequence of (4.4) and (4.5). □

4.3. α-optimality

It turns out that a rooted subtree of an optimal tree no longer needs to be optimal. Instead, we introduce the auxiliary notion of α-optimality.

Definition 4.14

Let α be a non-negative real number. A rooted tree (T,r) is said to be α-optimal if it fulfils the bipartition condition and if

m(T)+αm0(T)=max{m(T)+αm0(T):(T,r) is a rooted tree fulfilling the bipartition condition with |T|=|T|andtype(T)=type(T)}. (4.6)

Note that 0-optimality is just ordinary optimality. This definition is motivated by the fact that any rooted subtree of an optimal tree is indeed α-optimal for an appropriate value of α:

Proposition 4.15

Let T be an optimal tree, st an edge of T and Ts and Ttthe connected components of Tst, with sTs and tTt . Then (Ts,s) is a ρ(Tt)-optimal tree and (Tt,t) is a ρ(Ts)-optimal tree.

Proof

If the order of T is 2, the statement holds trivially.

Reformulating (4.2) in terms of the function ρ shows that

m(T)=m(Tt)(m(Ts)+ρ(Tt)m0(Ts)).

If Ts was not ρ(Tt)-optimal, we could replace it by a ρ(Tt)-optimal tree and this would increase m(T), contradiction. The same argument applies to Tt. □

We note the fact that ρ(T)1 holds for all rooted trees of type A by Lemma 4.13, where equality holds if and only if T=L. Thus, by Proposition 4.15, we may restrict ourselves to the investigation of α-optimal trees of type A with α[0,) as well as α-optimal trees of type B with α[0,1].

A few rooted trees will be considered repeatedly in our proofs. These are shown in Fig. 8. One could indeed show that these trees are α-optimal for some α>0, but we do not need this information. On the other hand, we will later need to know that some rooted trees are not α-optimal for some ranges of α. We list these trees (together with a replacement T) in Table 3 in the Appendix. Similarly, we list a few non-optimal trees in Table 2, where Tn is given in Theorem 3.3. We will simply refer to the entries of these two tables by (R1)(R11). These tables can be verified using a Sage [20] program available in [11].

Fig. 8.

Fig. 8

Some important rooted trees. All trees are given with the pair (m(T),m0(T)).

Table 3.

Replacements for rooted subtrees: |T|=|T|, type(T)=type(T) and m(T)+αm0(T)>m(T)+αm0(T) hold for the given range of α. Here, the additional abbreviation CLkS=AB(L,L,CLk1S) and CCLkS=AB(L,CL,CLk1S) for k1 have been used, where, as usual, CL0S=S and CCL0S=S.

T |T| m(T) m0(T) T m(T) m0(T)
(R4) BA3 4 1 2 B(L,L,L) 3 1 α<2
B(L,A3) 5 3 2 B(L,L,L,L) 4 1 α<1
B(A3,A3) 7 4 4 B(L,CLL) 7 4 α0
B(A3,A3,A3) 10 12 8 B(L,F,F) 21 9 α0
(R5) AB(F,F,A24) 34 59049 39366 CCL2CL2F 58999 41839 α>502473
AB(F,A14,A14) 34 59049 39366 CCL2CL2F 58999 41839 α>502473
AB(F,A14,A24) 44 1594323 1062882 CCCL2C2L 1618650 1139139 α0
AB(F,A24,A24) 54 43046721 28697814 C2AB(L,C2L,C3L) 44259488 31126973 α0
AB(A14,A14,A14) 44 1594323 1062882 CCCL2C2L 1618650 1139139 α0
AB(A14,A14,A24) 54 43046721 28697814 C2AB(L,C2L,C3L) 44259488 31126973 α0
AB(A14,A24,A24) 64 1162261467 774840978 C9L 1209774005 850782533 α0
AB(A24,A24,A24) 74 31381059609 20920706406 C10F 33062296902 23251305273 α0
(R6) B(L,A10) 12 34 21 B(L,L,CL2L) 41 15 α<76
(R7) B(L,A14) 16 135 81 B(L,L,CL3L) 153 56 α<1825
B(L,A24) 26 3645 2187 B(L,L,CLCCLCL) 4235 1551 α<295318
(R8) B(A10,A7,A7) 25 2512 1344 B(L,CLCCLCL) 2684 1551 α0
B(A10,A10,A7) 28 6573 3528 B(F,CL,C2L) 7759 3696 α0
B(A10,A10,A10) 31 17199 9261 B(F,CL,C2F) 20967 9999 α0
(R9) B(A7,F,F) 16 141 72 B(L,L,CL3L) 153 56 α<34
B(A7,A7,F) 19 368 192 B(L,L,CL3F) 418 153 α<5039
B(A7,A7,A7) 22 960 512 B(L,CL4F) 989 571 α0
(R10) B(L,L,L,CLL) 9 15 4 B(L,A7) 13 8 α>12
B(L,L,CL4L) 20 571 209 B(L,F,A14) 567 243 α>217
(R11) B(L,F,CA14) 27 5751 2430 B(L,F,AB(F,F,CF)) 5742 2457 α>13
B(L,F,CA24) 37 155277 65610 B(L,F,AB(F,F,AB(F,F,CF))) 155007 66420 α>13

Table 2.

Replacements for trees: |T|=|Tn|=n and m(T)<m(Tn) hold.

T n|T| m(T) m(Tn)
(R1) B(L,L,L,L,AB(L,L,L)) 10 19 21
B(L,L,L,L,A6) 11 24 30
(R2) B(A7,F,F,L) 17 213 216
(R3) AB(L,L,AB(L,L,AB(L,L,AB(L,L,L)))) 17 209 216

4.4. Exchanging subtrees

In order to derive information on the structure of optimal trees, we will compare optimal trees with trees where some rooted subtrees have been exchanged. In order to estimate the effect of such exchange operations, we need an extension of our recursive formulæ (4.4) and (4.5) to finer decompositions of a tree. These extensions will be formulated in terms of continuants and continued fractions.

We therefore fix some notations and definitions in the context of continuants and continued fractions. We follow Graham, Knuth and Patashnik [6], Section 6.7.

Definition 4.16 [6, (6.127)]

The continuant polynomial Kn(x1,,xn) has n parameters, and it is defined by the following recurrence:

Kn(x1,,xn)=Kn1(x1,,xn1)xn+Kn2(x1,,xn2) (4.7)

for n2 and K0()=1, K1(x1)=x1.

We will omit the index n in Kn whenever it is clear from the context.

We need the following additional properties of continuants:

Lemma 4.17

We have

K(x1,,xn)=K(xn,,x1) (4.8)
Kn(x1,,xn)=x1Kn1(x2,,xn)+Kn2(x3,,xn), (4.9)

Proof

The symmetry relation (4.8) is [6, (6.131)], the recursion (4.9) is a consequence of the symmetry relation (4.8) and the defining recursion (4.7), cf. [6, (6.132)]. □

The following lemma shows how continuants can be used to determine m(T). We use the Iversonian notation [expr]=1 if expr is true and [expr]=0 otherwise, cf. Knuth [14].

Lemma 4.18

Let T be a tree fulfilling the bipartition condition of the shape given in Fig. 9for some k0, integers ri0 for 0ik, and rooted trees Si,j, 0ik, 1jri.

Then

m(T)=K(ρ0,ρ1,,ρk1,ρk)h=0kj=1rhm(Sh,j),

where

ρi=[typevi=A]+j=1riρ(Si,j).

Fig. 9.

Fig. 9

Shape of T for the exchange lemma.

Proof

We set

Mi=h=ikj=1rhm(Sh,j)

and consider v0 as root of T. We claim that

m(T(vi))=MiK(ρi,ρi+1,,ρk1,ρk),
m0(T(vi))=MiK(ρi+1,,ρk1,ρk)

holds for 0ik. This can be shown by reverse induction on i using only the recursive formulæ (4.4), (4.5) and (4.9). □

We now turn to continued fractions.

Definition 4.19

We set

CF(x0,x1,,xn)=x0+1x1+1x2+1+1xn.

As usual, for a sequence (xk)k0, the infinite continued fraction CF(x0,x1,) is defined as the limit limkCF(x0,x1,,xk).

The connection between continuants and continued fractions is stated in the following result.

Lemma 4.20 [6, (6.136)]

We have

CF(x0,x1,,xn)=K(x0,x1,,xn)K(x1,,xn).

We are now able to formulate our main exchange lemma. It comes in several flavours: first, the most general version is stated, which might be cumbersome to use. Next, in a mostly symmetric case, we get a neat formulation, which will be frequently used. Finally, we give two estimates for the asymmetric case, which are not best possible, but sufficient for our purposes. For these estimates, we make some assumptions on the occurring values of ρ which will be fulfilled in the applications later on.

Lemma 4.21

Let T be an optimal tree of the shape given in Fig. 9 for some even k2, integers ri0 for 0ik, and rooted trees Si,j, 0ik, 1jri . We set

ρi=[typevi=A]+j=1riρ(Si,j).

Let 0s0r0 and 0skrk and set

xj=1s0ρ(S0,j),yj=1skρ(Sk,j),
a[typev0=A]+j=s0+1r0ρ(S0,j),b[typevk=A]+j=sk+1rkρ(Sk,j)

so that ρ0=x+a and ρk=y+b . Assume that y+a>0, x+b>0 and ρ1, ρk1>0.

  • (1)
    If x>y, then
    CF(a,ρ1,ρ2,,ρk2,ρk1)CF(b,ρk1,ρk2,,ρ2,ρ1). (4.10)
  • (2)

    If x>y and (ρ1,,ρk1)=(ρk1,,ρ1), then ab.

  • (3)
    If x>y, ρj=1 for odd j and ρju for all even j with 2jk2 and for fixed 0<u, then
    a<b+U0(,u),
    where
    U0(,u)=1CF(1,u,1,u,1)1CF(1,,1,,).
    In particular, we have
    U0(1,2)<0.1153,U0(2,3)<0.0597,U0(3,4)<0.0373.
  • (4)
    If x>y, ρj=1 for even j with 2jk2 and ρju for all odd j and for fixed 0u, then
    a<b+U1(,u),
    where
    U1(,u)=1CF(,1,)1CF(u,1,u,1,).
    In particular, we have
    U1(1,2)<0.3007,U1(2,3)<0.1113,U1(3,4)<0.0596.

Proof

  • (1)
    Set M=h=0kj=1rhm(Sh,j) and let T be the tree arising from T by exchanging S0,1,,S0,s0 against Sk,1,,Sk,sk. As k is even and a+y>0 and b+x>0, the types of all vj are the same in T and T. As T is an optimal tree, we have
    0m(T)m(T)M=K(x+a,ρ1,,ρk1,y+b)K(y+a,ρ1,,ρk1,x+b)=((x+a)(y+b)(y+a)(x+b))K(ρ1,,ρk1)+((x+a)(y+a))K(ρ1,,ρk2)+((y+b)(x+b))K(ρ2,,ρk1)=(xy)K(ρ1,,ρk1)(ba+K(ρ1,,ρk2)K(ρ1,,ρk2,ρk1)K(ρ2,,ρk1)K(ρ1,ρ2,,ρk1))=(xy)K(ρ1,,ρk1)(ba+1CF(ρk1,,ρ1)1CF(ρ1,,ρk1))=(xy)K(ρ1,,ρk1)(CF(b,ρk1,,ρ1)CF(a,ρ1,,ρk1))
    by Lemma 4.18, (4.7), (4.9), (4.8), Lemma 4.20 and the obvious recursion formula for continued fractions. The result follows upon division by the positive quantity (xy)K(ρ1,,ρk1).
  • (2)

    The symmetry implies that CF(ρk1,,ρ1)=CF(ρ1,,ρk1) and the result follows from (4.10).

  • (3)
    If k4, then the assertion follows from Lemma 4.21(2). So we may assume k6. By (4.10) we have
    a+1CF(1,,1,,)=CF(a,1,,1,,)<CF(a,1,,,,1)CF(a,1,ρ2,,ρk2,1)CF(b,1,ρk2,1,ρk4,1,,ρ2,1)CF(b,1,u,1,u,1,,u,1)CF(b,1,u,1,u,1)=b+1CF(1,u,1,u,1),
    as decreasing the entries at even-numbered indices of a continued fraction CF(x0,x1,) decreases the continued fraction, and increasing entries at odd-numbered indices also decreases the continued fraction.
  • (4)
    If k2, then the assertion follows from Lemma 4.21(2). So we may assume k4. By (4.10) we have
    a+1CF(u,1,u,1,)=CF(a,u,1,u,1,)<CF(a,u,1,,1,u)CF(a,ρ1,1,,1,ρk1)CF(b,ρk1,1,,1,ρ1)CF(b,,1,)=b+1CF(,1,).
     □

This exchange lemma will be used repeatedly in the following to deduce information about the structure of optimal trees. To simplify explanations, we will call the vertices v0 and vk in Fig. 9 pivotal vertices.

5. The upper bound: local structure

We have now gathered enough auxiliary tools to start with the proof of Theorem 3.3 and thus Theorem 1.2. To abbreviate some statements, we introduce the following definitions.

Definition 5.1

Let T be a tree. We say that it fulfils the local conditions (LC), if all of the following conditions are fulfilled:

  • (LC1)

    T fulfils the bipartition condition,

  • (LC2)

    each vertex of type A has degree 1 or 2,

  • (LC3)

    each vertex of type B has degree at least 3,

  • (LC4)

    each vertex of degree 3 is adjacent to at least two leaves,

  • (LC5)

    each vertex has degree at most 4,

  • (LC6)

    no vertex is adjacent to 3 leaves.

By Proposition 4.7, an optimal tree of order 3 fulfils LC1.

The following theorem will be shown step by step in Sections 5.1–5.3:

Theorem 5.2

Let S={T2,T3,T5,T6,1,T6,2,T8,T9,T10,T12,T13,T16,T20} and T be an optimal tree with TS.

Then T fulfils the local conditions LC1–LC6.

We note that it is debatable whether LC6 shall be considered to be part of the local structure as T8, T9, T12, T16 fulfil LC1–LC5 and are contained in the generic cases described in Theorem 3.3. So these trees may simply be seen as degenerated cases of the generic cases even though LC6 is violated. On the other hand, T13 and T20 fulfil LC1–LC5, but not LC6, and these two trees are not contained in one of the generic families of Theorem 3.3. Since the overall proof is simpler when excluding the trees in S at this stage, this is the route we proceed on.

5.1. Vertices of type A and estimates for vertices of type B

We first aim to show that almost all optimal trees fulfil LC2 and LC3. As a first step, we will show that almost all rooted subtrees of optimal trees contain a k-claw for k{2,3,4}, i.e., a rooted subtree with k branches all of which are single vertices, see Fig. 13(a). In a second step, the existence of k-claws will provide us with bounds for ρ(S) for rooted subtrees S of optimal trees. These bounds will be quite weak, but sufficient for using our general exchange lemma (Lemma 4.21) to give a useful technical result on decompositions of optimal trees along a path. This almost immediately yields LC2. We then characterise all optimal trees containing a B2 or a 4-claw as a rooted subtree (there are only very few), such that from the end of this subsection, we can work exclusively with 2- and 3-claws.

Fig. 13.

k-claw and tree T for the proof of Lemma 5.4.

graphic file with name gr13a.jpg

(a) k-claw.

graphic file with name gr13b.jpg

(b) T.

Lemma 5.3

Let S be a rooted subtree of an optimal tree TT6,2 . Then S is isomorphic to L, A3, B2 or it contains a k-claw for some k2, i.e., a rooted subtree as in Fig. 13(a).

Proof

If |S|2, then S{L,B2} and there is nothing to show. We assume that |S|>2 and that S does not contain a k-claw for any k2.

Let v1 be a leaf of S of maximum height. Then v1 is of type A. If its parent v2 (which is of type B) has other branches, they have to be leaves by the choice of v1 and we found a k-claw for k2, contradiction. Thus v2 has only one branch, v1.

The parent of v2 is called v3. It has to be of type A. So all branches of v3 are of type B, thus they cannot be leaves. By construction, all branches of v3 are isomorphic to B2, cf. Fig. 10(a).

Denote the branches of the rooted tree S0S(v3) by B1,B2,,Bk and assume that k2. Then we have m0(Bj)=1 and m(Bj)=1. Thus m0(S0)=1 and m(S0)=k+1. If we remove B2 and add the two vertices as children of v2, cf. Fig. 10(b), the resulting branch B1 has m0(B1)=1 and m(B1)=3. The modified tree S0 has m0(S0)=3 and m(S0)=3(1+k2+1/3)=3k2k+2>m(S0), contradiction to Proposition 4.15. Thus v3 has only one child.

If S=S0, then S=A3 and there is nothing to show. Otherwise, the parent of v3 is called v4. Then T has the shape shown in Fig. 11 for some k0 and rooted trees A0,,Ak of type A.

Each of the Aj, j{1,,k} is either a leaf (with ρ(Aj)=1) or an A3 with ρ(Aj)=1/2. As ρ(A0)>0, Lemma 4.21(2) (with v4 and v2 as pivotal vertices) yields ρ(A1)++ρ(Ak)1, i.e., either k1 or k=2 and both A1 and A2 are isomorphic to A3. Thus S(v4) is one of the trees in Fig. 12. The trees S1, S3, S4 are not α-optimal for any α[0,1], cf. (R4), contradiction to Proposition 4.15. The tree S2 is not α-optimal for α<1, cf. (R4), so we must have ρ(A0)=1 and therefore A0=L. Thus we must have T=T6,2, which has been excluded. □

Knowing now that almost every rooted subtree of an optimal tree contains a k-claw with k2, we show that no k-claws with k5 occur.

Fig. 10.

Shape of S0=S(v3) and S0 in the proof of Lemma 5.3.

graphic file with name gr10a.jpg

(a) S0=S(v3).

graphic file with name gr10b.jpg

(b) S0.

Fig. 11.

Fig. 11

Shape of S(v4) in the proof of Lemma 5.3.

Fig. 12.

S(v4) in the proof of Lemma 5.3.

graphic file with name gr12a.jpg

(a) S1.

graphic file with name gr12b.jpg

(b) S2.

graphic file with name gr12c.jpg

(c) S3.

graphic file with name gr12d.jpg

(d) S4.

Lemma 5.4

For k5, a k-claw does not occur as rooted subtree of an optimal tree.

Proof

Let (T,b) be a k-claw, cf. Fig. 13(a), and (T,b) be the rooted tree in Fig. 13(b) of the same order and type.

We have

m0(T)=1k=1,m(T)=m1(T)=1(1++1)=k,
m0(T)=31k4=3,m(T)=m1(T)=3(23+(k4))=3k10.

For k5, we have m(T)k=m(T) and m0(T)=3>1=m0(T), thus T cannot be α-optimal for any α>0, so it is not a subtree of an optimal tree by Proposition 4.15. □

We are now able to prove lower bounds for ρ(S) for rooted subtrees S of optimal trees. The key idea is the following: changing the root of S to another root can only alter m0(S), but m(S) remains unchanged. Changing the root of S cannot increase ρ(S), since this would increase m(S)+αm0(S), contradiction to the α-optimality of S. The new roots used for comparison will be leaves or roots of k-claws.

We start with lower bounds for rooted subtrees of type A.

Lemma 5.5

Let S be a rooted subtree of type A of an optimal tree T . Then ρ(S)1/2 with equality if and only if S=A3.

Proof

Let T consist of the rooted subtrees (S,s) and (Tt,t) of types A and B, respectively, and of the edge st.

For |S|3, we have S{L,A3} and there is nothing to show, so we assume |S|>3.

By Lemmata 5.3 and 5.4, S contains an -claw for some 24. We switch the root of S to a leaf of the -claw, obtaining a new rooted tree (S,a1) shown in Fig. 14. The rooted tree (A,r) arises from the rooted connected component Tr of Tbr by removing the rooted subtree Tt of type B. Removing a rooted subtree of type B from a rooted tree of type A fulfilling the bipartition condition for rooted trees yields a rooted tree of type A, so A is of type A.

The ρ(Tt)-optimality of S together with m(S)=m(S) implies that

ρ(S)ρ(S)=11+11+ρ(A)>11+11=12.

 □

Fig. 14.

Fig. 14

Tree S in the proof of Lemma 5.5.

We are now able to prove a lower bound for ρ(S) for rooted subtrees S of type B.

Lemma 5.6

Let (S,r) be a rooted subtree of type B of an optimal tree T with |T|3 . If S contains a k-claw for some k1, then ρ(S)11+k.

Proof

By Proposition 4.7, T and S fulfil the bipartition condition and the bipartition condition for rooted trees, respectively.

If S has only one branch, say S=B(S), then S is of type A by the bipartition condition for rooted trees and ρ(S)1. Thus ρ(S)=1/ρ(S)1, as required. So we assume that S has more than one branch.

If S is the k-claw, then ρ(S)=1/k>1/(k+1).

Let T consist of the rooted subtrees (S,r) and (Tt,t) of types B and A, respectively, and of the edge rt.

We change the root of S to the root of the k-claw, which results in a rooted tree (S,s) with m(S)=m(S) shown in Fig. 15. Here A arises from a rooted subtree of T of type A by removing the rooted subtree (Tt,t). Since it was assumed that S has more than one branch, we conclude that A is still of type A by Lemma 4.11 and that ρ(A)1. As S is ρ(Tt)-optimal by Proposition 4.15, this yields

ρ(S)ρ(S)=1k+ρ(A)1k+1.

 □

Fig. 15.

Fig. 15

Tree S in the proof of Lemma 5.6.

Next, we give a preliminary upper bound for ρ(S) for rooted subtrees of type A:

Lemma 5.7

Let S be a rooted subtree of type A of an optimal tree T . Then S=L or ρ(S)5/6.

Proof

Assume that SL. Let S=A(B1,,B) for suitable branches B1,,B for some 1. By Lemmata 5.6, 5.3 and 5.4, we have ρ(B1)1/5, which implies

ρ(S)=11+ρ(B1)++ρ(B)11+ρ(B1)56.

 □

We have now collected the necessary (weak) bounds for ρ(S) for rooted subtrees of optimal trees. These suffice for the following path decomposition lemma, using the exchange lemma (Lemma 4.21) to derive bounds for ρi along a path (as in Fig. 9) when the two ends of the path are roots of claws.

Lemma 5.8

Let T be an optimal tree of the shape as in Fig. 9 for some even k2 with S0,1==S0,r0=L, Sk,1==Sk,rk=L, i.e., v0 and vk are the roots of an r0-claw and an rk-claw, respectively. We assume that r0rk1 and r02 and set

ρi=[typevi=A]+j=1riρ(Si,j).

Then vi is of type A and (ρi,ri)=(1,0) for odd i and r01ρirk for even i with 0<i<k . In particular, we have r0rk+1.

Proof

By the bipartition condition and the fact that v0 and vk are the roots of an r0-claw and an rk-claw, respectively, we conclude that vi is of type A for odd i and of type B for even i. We define the rooted trees (R,v0) and (L,vk) as two rooted versions of T, so that the notations R(vi) and L(vi) are defined.

We prove the lemma by induction on i, where we first only prove that

vi is of type A and (ρi,ri)=(1,0) for odd i and r01ρir0 for even i with 0<i<k, (5.1)

i.e., we relax the upper bound for ρi in the case of even i.

We first consider the case of odd i, i.e., vi is of type A. If ρi>1, we conclude that ρ(R(vi+1))<0+U1(r01,r0) from Lemma 4.21(4) (with vi and S0,1=L as pivotal vertices). By Lemma 5.6, we have ρ(R(vi+1))1/(rk+1)1/(r0+1). As U1(d1,d)<1/(d+1) for d{2,3,4}, this is a contradiction. So ρi=1 and therefore ri=0.

Next, we consider the case of even i, i.e., vi is of type B. If ρi>ρ0=r0, then Lemma 4.21(3) (now with vi and v0 as pivotal vertices) yields ρ(R(vi+1))U0(r01,r0)0.1153, a contradiction to Lemma 5.5. Thus we have ρiρ0=r0.

For the lower bound on ρi, we assume that ρi<r01=ρ(S0,2)++ρ(S0,r0). Then Lemma 4.21(3) implies 1=ρ(S0,1)ρ(R(vi+1))+U0(r01,r0)<ρ(R(vi+1))+0.1153. As ρ(R(vi+1))5/6 by Lemma 5.7, this is a contradiction. This concludes the proof of (5.1).

Finally, rirk is again a consequence of Lemma 4.21(3), as ρ(L(vi1))1/2>0.1153.

If k>2, then r01r2rk. If k=2, then Lemma 4.21(2) and ρ(S0,r0)>0 imply that r01=ρ(S0,1)++ρ(S0,r01)ρ(S0,1)++ρ(S0,rk)=rk, as required. □

Combining the description of rooted subtrees without any k-claw with k2 with the path decomposition lemma (Lemma 5.8) shows that B2 is forbidden in almost all optimal trees.

Lemma 5.9

Suppose that B2 is a rooted subtree of an optimal tree T . Then T{T3,T6,2}.

Proof

Assume first that T does not contain an -claw for any 2. As T contains a B2 as a rooted subtree, we have |T|3. Consider a leaf s of T. Then by Proposition 4.7, the rooted subtree Ts of T is of type B and therefore equals B2 by Lemma 5.3. We conclude that T=T3.

So we may now assume that T contains an -claw with 2. Thus T can be decomposed as in Lemma 5.8 with r02 and rk=1 for some k2. By Lemma 5.8, we have r01+1=2, so r0=2. If k=2, then we have T=T6,2.

So we may assume that k>2. By Lemma 5.8 again, we have ρk2=1, which by Lemmata 5.5 and 5.7 implies that rk2=1 with Sk2,1=L or rk2=2 with Sk2,1=Sk2,2=A3. Thus R(vk2) is a rooted subtree of T of type B and order at least 5 containing no -claw for any 2, contradiction to Lemma 5.3. □

Remark 5.10

Having excluded B2, we can also exclude the presence of A3 as a rooted subtree in the following, which will be important in many arguments.

As a direct consequence of the path decomposition lemma (together with the information that it can always be applied as B2 has now been excluded), we have shown LC2.

Proposition 5.11

Let TT2 be an optimal tree. Then T fulfils LC1–LC2.

Proof

Let v be a vertex of type A in T of degree at least 2. Then T may be represented as in Lemma 5.8 with v=vi for some odd i: choose a longest path that contains v. The ends of this path are leaves, their unique neighbours are the pivotal vertices v0 and vk. All but one of the neighbours of v0 have to be leaves by the choice of the path, and there has to be more than one such neighbour in view of Lemma 5.9. The same applies to vk. Hence ri=0 by Lemma 5.8, i.e., degv=2. □

We conclude this subsection by excluding 4-claws in almost all cases. To do so, we will use a direct substitution for those cases which are allowed by the path decomposition lemma.

Lemma 5.12

Let T be an optimal tree containing a 4-claw as a rooted subtree. Then T=T6,1.

Proof

We denote the root of the 4-claw by w. The neighbour of w which is not contained in the 4-claw is denoted by v. By Proposition 5.11, T fulfils LC1–LC2. As w is of type B, v is of type A and degv2. If v is a leaf, then T=T6,1 and we are done. So we assume the contrary and denote the neighbour of v different from w by u. Then T is of the shape shown in Fig. 16(a), where A1,,A denote some trees of type A with ρ(A1)ρ(A2)ρ(A). If A1==A=L, then {3,4} by Lemmata 5.4 and 5.8. Both cases do not lead to an optimal tree, cf. (R1).

So we may assume that ρ(A)<1, whence Lemma 4.21(2), with u and w as pivotal vertices, yields ρ(A1)++ρ(A1)3. Thus we have =4 and A1=A2=A3=L or 5.

We consider the tree T shown in Fig. 16(b) of the same order as T. Using the abbreviations a=ρ(A3)++ρ(A), b=ρ(A1)+ρ(A2) and M=m(A1)m(A) as well as the optimality of T, Lemma 4.18 yields

M(5a+5b+4)=MK(a+b,1,4)=m(T)m(T)=3MK(a,1,b+23)=M(3ab+5a+3b+2),

which implies

2b(3a2).

As b=ρ(A1)+ρ(A2)1 and a3/2 (for 5 as well as for =4 and ρ(A3)=1) by Lemma 5.5, this is a contradiction. □

Fig. 16.

Trees considered in Lemma 5.12.

graphic file with name gr16a.jpg

(a) T.

graphic file with name gr16b.jpg

(b) T.

5.2. Lower degree bounds for vertices of type B

We now want to show that almost all optimal trees fulfil LC1–LC4. In order to facilitate the discussion, we introduce the notion of a “light” vertex.

Definition 5.13

Let T be a tree fulfilling the bipartition condition and v a vertex of type B in T. Then v is said to be a light vertex if it has degree 3 and is adjacent to at most one leaf.

The sum of the ρ-values of the rooted connected components of Tv is quite small for a light vertex v. The exchange lemma then forbids vertices whose rooted connected components have a high sum of ρ-values.

The path decomposition lemma can be used to derive a description of light vertices:

Lemma 5.14

If T{T2,T3,T6,2} is an optimal tree, then T fulfils LC1–LC3. If v is a light vertex of T, then degv=3 and v is adjacent to exactly one leaf.

Proof

If T has no light vertex, then T fulfils LC1–LC4. So we assume that v is a light vertex. Denote the rooted connected components of Tv by T0,,Tk1 with |T0||T1||Tk1|. As v is a light vertex, we have k3. If |T0|=1, i.e., T0==Tk1=L, we have k2 and T{T2,T3}, which have been excluded. If |T1|=1, then v is the root of a (k1)-claw. As v is light, we have k=2, contradiction to Lemma 5.9.

So both T0 and T1 contain an 0-claw and an 1-claw, respectively, for some 02, 12 by Lemmata 5.3 and 5.9. By Lemma 5.8, we obtain ρ(T2)++ρ(Tk1)1. As k3 by assumption, we have ρ(T2)++ρ(Tk1)=ρ(T2)1, thus T2 is a leaf, as required. □

We now describe vertices of type B when a light vertex is present.

Lemma 5.15

Let T{T2,T3,T6,2} be an optimal tree, v be a light vertex and w be a vertex of type B of T . Then either degw=4 and w is not adjacent to any leaf or degw=3 and w is adjacent to one or two leaves.

Proof

If w is light, then there is nothing to show by Lemma 5.14. Otherwise, either degw4 or degw=3 and w is adjacent to two leaves, as required. So we now assume that degw4.

Denote the rooted connected components of Tw by T0,,T1 with vT0 and ρ(T1)ρ(T1). The rooted connected component of Tv which does not contain w and is not a leaf is denoted by S. By Lemma 5.5, we have ρ(T2)+ρ(T1)>ρ(L) (since A3 has been excluded). Now we make use of a combination of Lemma 5.8 and Lemma 4.21(3)—the following argument will be used several times, so we only explain it in detail here: the path between v and w can be extended to a longest path ending in an r0- and an rk-claw, with 2r0, rk3 by Lemmata 5.9, 5.4 and 5.12. Application of Lemma 5.8 now shows that the vertices on the path between v and w satisfy the necessary conditions to make Lemma 4.21(3) applicable (with v and w as pivotal vertices), which yields

ρ(T1)++ρ(T3)ρ(S)+0.1153. (5.2)

From Lemmata 5.5 and 5.7, we obtain

32<ρ(T1)++ρ(T3)ρ(S)+0.115356+0.1153<1,

which yields <5 and degw==4. Furthermore, (5.2) together with Lemma 5.7 yields ρ(T1)56+0.1153<1, i.e., 1>ρ(T1)ρ(T2)ρ(T3), so w is not adjacent to any leaf. □

Light vertices correspond to low values of ρ(A) for rooted subtrees of type A. This correspondence is described in the following two lemmata.

Lemma 5.16

Let T{T2,T3,T6,2} be an optimal tree and A be a rooted subtree of T of type Athat contains no light vertex of T . Then

ρ(A)23

with equality for A{F,A14,A24}.

Proof

We prove the result by induction on the order of A. If A has order 1, then ρ(A)=1.

Otherwise, A=AB(A1,,Ak) for suitable rooted trees A1,,Ak of type A by Proposition 5.11. As A does not contain any light vertex, we have A=F with ρ(A)=2/3 or k3.

We now turn to the case k3. We have ρ(Aj)2/3 by the induction hypothesis and therefore

ρ(A)=11+1ρ(A1)++ρ(Ak)11+1323=23.

Equality holds for k=3 and ρ(A1)=ρ(A2)=ρ(A3)=2/3. By the induction hypothesis, we conclude that A1, A2, A3{F,A14,A24}. We have A14=AB(F,F,F) and A24=AB(F,F,A14). Next, the two trees AB(F,F,A24) and AB(F,A14,A14) are not α-optimal for α>50/2473, cf. (R5), whence they do not occur as rooted subtrees of optimal trees by Proposition 4.15 and Lemma 5.6. A further six cases have to be checked, but none of these is an α-optimal tree for any α0, cf. again (R5). □

Lemma 5.17

Let T{T2,T3,T6,2} be an optimal tree containing a light vertex and A be a rooted subtree of T of type A . Then A is a leaf or

ρ(A)23,

where equality holds if and only if A does not contain a light vertex.

Proof

We prove the assertion by induction on the order of A. For |A|>1, we have A=AB(A1,,Ak) for suitable rooted trees A1,,Ak. By Lemma 5.15, we have k{2,3}.

If k=2, then ρ(A1)+ρ(A2)1+12 where equality holds if and only if both A1 and A2 are leaves, i.e., A does not contain a light vertex.

If k=3, then none of A1, A2, A3 is a leaf by Lemma 5.15, so the induction hypothesis yields ρ(A1)+ρ(A2)+ρ(A3)2 with equality if and only if none of A1, A2, A3 contains a light vertex. In both cases, we get

ρ(A)=11+1ρ(A1)++ρ(Ak)11+12=23.

 □

We are now ready to prove local condition LC4.

Proposition 5.18

Let T be an optimal tree containing a light vertex. Then T{T2,T3,T6,2,T10} . In other words, all optimal trees except T2,T3,T6,2,T10 fulfil LC1–LC4.

Proof

We assume that T{T2,T3,T6,2}. As the tree in Fig. 17 with root r is not α-optimal for α<7/6, cf. (R6), and any vertex of degree 3 is adjacent to at least one leaf by Lemma 5.14,

rooted subtrees S of type A of T containing a light vertex and no vertex of degree 4 are isomorphic to A7=AB(L,F)orA10=AB(L,A7). (5.3)

By (R7), Lemma 5.17, Proposition 4.15 and the fact that T contains a light vertex, we see that

neither B(L,A14) nor B(L,A24) occurs as a rooted subtree of T. (5.4)

Let now S be a rooted subtree of type A of T containing a vertex of degree 4 and a light vertex. We choose S in such a way that its order is minimal among all rooted subtrees with these properties. We write S=AB(S1,,Sd) for some d{2,3} (by Lemma 5.15).

We first consider the case d=2. By Lemma 5.15, we have S1=L. As S contains a vertex of degree 4, so does S2. By minimality of S, S2 does not contain a light vertex. By Lemmata 5.16 and 5.17, we have ρ(S2)=2/3 and therefore S2{A14,A24}, contradiction to (5.4).

Thus we are left with the case d=3. By the minimality of S, each of the Sj either contains a light vertex and does not contain a vertex of degree 4, whence Sj{A7,A10} by (5.3), or does not contain a light vertex, whence Sj{F,A14,A24} by Lemmata 5.16 and 5.17.

We first consider the case that one of S1, S2, S3, say S1, is an A10. Then T can be decomposed as in Fig. 18(a) for some rooted tree S0 of type A. As ρ(S0)+ρ(S3)>ρ(L)=1, Lemma 4.21(2) yields ρ(S2)ρ(A7)<2/3. Analogously, we get ρ(S3)<2/3. Thus we have {S2,S3}{A7,A10}. By (R8), B(S1,S2,S3) is not ρ(S0)-optimal, contradiction to Proposition 4.15.

So we are left with the case that {S1,S2,S3}{F,A7,A14,A24}. As S contains a light vertex, we may assume that S1=A7. If S2{A14,A24}, we note that switching S2 and the F of S1=A7=AB(L,F) yields the tree T shown in Fig. 18(b) with m(T)=m(T) by Lemma 4.18. But T is not optimal by (5.4). We conclude that {S2,S3}{F,A7}. By (R9), Lemma 5.17 and Proposition 4.15, the only remaining case is (S0,S1,S2,S3)=(L,A7,F,F). This case is ruled out by (R2).

So there is no rooted subtree S of type A containing both a light vertex and a vertex of degree 4. By Lemmata 5.15 and 5.3, T contains a 2-claw. Removing this 2-claw from T yields a rooted tree S of type A of T containing a light vertex. We conclude that S does not contain a vertex of degree 4, thus S{A7,A10} by (5.3). The case S=A7 yields T=T10, the case S=A10 is impossible in view of (R6) (cf. Fig. 17). □

Fig. 17.

Fig. 17

Subtree of T in Proposition 5.18.

Fig. 18.

Decomposition of T and T in Proposition 5.18.

graphic file with name gr18a.jpg

(a) T.

graphic file with name gr18b.jpg

(b) T.

5.3. Upper degree bounds for vertices of type B

We now conclude the proof which shows that almost all optimal trees fulfil LC1–LC6.

If an optimal tree contains a 2-claw, the upper degree bound LC5 for degrees of type B is a consequence of the exchange lemma together with the improved lower bound for ρ(S) for rooted subtrees S of type A obtained by the exclusion of light vertices.

Lemma 5.19

Let T{T2,T3,T6,2,T10} be an optimal tree. If T contains a 2-claw as a rooted subtree, then T fulfils LC1–LC5.

Proof

Assume that there is a vertex w of type B of degree k5. We denote the rooted connected components of Tw by T0,T1,,Tk1, where a 2-claw is contained in T0. Now we combine Lemma 5.8 and Lemma 4.21(3) as before: since ρ(Tk1)+ρ(Tk2)>1=ρ(L) (where L is one of the leaves of the 2-claw contained in T0), we have

ρ(T1)+ρ(T2)++ρ(Tk3)<ρ(L)+0.1153.

As T contains no light vertex by Proposition 5.18, Lemma 5.16 yields

23(k3)<1.1153.

We conclude that k<5, as required. □

We are now left with optimal trees containing 3-claws. The arguments are somewhat similar as in the case of light vertices, except that we now have to deal with “heavy” vertices.

Lemma 5.20

Let T be an optimal tree containing a 3-claw and containing a vertex v of degree 4 which is adjacent to at most 2 leaves.

Then v is adjacent to exactly 2 leaves. Furthermore, each vertex of degree k>4 is adjacent to at most 2 leaves.

Proof

Denote the rooted connected components of Tv by T1,,T4, where we assume that T1 contains a 3-claw and T4 is not a leaf. By Lemma 5.8, we have ρ(T2)+ρ(T3)2, i.e., T2 and T3 are indeed leaves.

We have therefore shown that every vertex of degree 4 is adjacent to 2 or 3 leaves.

Assume that w is a vertex of degree k5 in T. Still denoting the rooted connected components of Tv by T1, T2=L, T3=L, T4, we may now assume that w is contained in T4 (T4 also contains a 3-claw, since otherwise Lemma 5.19 would apply, so we can interchange the roles of T1 and T4 if necessary). The rooted connected components of Tw are denoted by S0,S1,,Sk1 with the assumption that v is contained in S0 and ρ(S1)ρ(S2)ρ(Sk1). Combining Lemmata 5.8 and 4.21(3) again (with v and w as pivotal vertices and ρ(Sk1)>0.1153), we get

ρ(S1)+ρ(S2)+ρ(S3)ρ(S1)+ρ(S2)++ρ(Sk2)ρ(T1)+ρ(T2)+ρ(T3)<3,

which implies that ρ(S3)<1 and therefore w is adjacent to at most 2 leaves. □

The lower bound in the following lemma is the same as in Lemma 5.16; but instead of considering a rooted subtree of an optimal tree, we only assume LC1–LC4.

Lemma 5.21

Let T be a tree fulfilling LC1–LC4 and A be a rooted subtree of type A . Then

ρ(A)2/3.

Proof

Analogous to Lemma 5.16. □

We will exclude the occurrence of three or more 3-claws in an optimal tree by substituting two 3-claws by 2-claws and use the additional vertices in order to create a light vertex. To make this work, we have to analyse the effects of these substitutions. As the intermediate steps do not necessarily lead to optimal trees, we can only use the above bound.

Lemma 5.22

Let T be a tree fulfilling LC1–LC4.

  • (1)

    If one 3-claw in T is replaced by a 2-claw, we have m(T)/m(T)8/11 and |T|=|T|1 for the resulting tree T . Furthermore T fulfils LC1–LC4.

  • (2)

    If one 3-claw in T is replaced by a B(L,F), we have m(T)/m(T)21/11 and |T|=|T|+2 for the resulting tree T.

Proof

Let T consist of a 3-claw, a rooted tree S of type A and the edge connecting the root of the 3-claw and the root of S. We have ρ(S)2/3 by Lemma 5.21.

  • (1)
    By Lemma 4.4, we have
    m(T)m(T)=1m0(S)+2m(S)1m0(S)+3m(S)=ρ(S)+2ρ(S)+3=11ρ(S)+31123+3=811.
  • (2)

    Analogous. □

We now deal with optimal trees containing a 3-claw. The restrictions are now so strict that we can discuss all cases.

Proposition 5.23

Let T be an optimal tree which contains a 3-claw. Then T{T5,T8,T9,T12,T13,T16,T20}.

Proof

Assume that T contains 3 rooted subtrees isomorphic to a 3-claw. Replacing two of them by a 2-claw and the third by a B(L,F) yields a tree T with |T|=|T| and

m(T)m(T)8118112111>1

by Lemma 5.22, contradiction.

We conclude that T contains at most 2 rooted subtrees isomorphic to a 3-claw.

We now assume that T has a vertex v of degree k5. By Lemma 5.19, T does not contain a 2-claw and thus no vertices of degree 3 by Proposition 5.18. Thus every rooted subtree of Tv either contains a 3-claw or is a leaf by Lemmata 5.3, 5.4, 5.9 and 5.12. As there are at most 2 subtrees isomorphic to a 3-claw in T, there are at least k2 leaves. By Lemmata 5.4 and 5.12, we conclude that v is adjacent to exactly k2 leaves. Denote the rooted connected components of Tv by T1,T2,L,,L. By Lemma 5.8, we have

k2=ρ(L)++ρ(L)3,

i.e., k=5. By Lemma 5.20, we conclude that all vertices of degree 4 in T are adjacent to 3 leaves, i.e., they are the root of a 3-claw.

Thus T is of the shape given in Fig. 19(a) for some k1 (as there is no vertex of degree 5 for k=0). As the tree in Fig. 19(b) with root r is not α-optimal for α>1/2, cf. (R10), we conclude from Proposition 4.15, Lemmata 5.5 and 5.9 that this tree does not occur as a subtree of an optimal tree. Thus T has no vertex of degree 5, i.e., T fulfils LC1–LC5.

So by Lemma 5.20, every vertex of T of type B is adjacent to 2 or 3 leaves and has degree 3 or 4. We conclude that T=T5 or it is a caterpillar tree of the shape given in Fig. 20 for some k0 and some {2,3}. We note that the tree in Fig. 21 with root r is not α-optimal for α>2/17, cf. (R10), thus this tree does not occur as a rooted subtree of an optimal tree by Proposition 4.15 and Lemma 5.5. This implies that k2 or (k,)=(3,2). For (k,)=(2,3), the resulting tree is not optimal, cf. (R3). The remaining cases correspond to T{T8,T9,T12,T13,T16,T20}. □

Fig. 19.

Shape of T in Proposition 5.23.

graphic file with name gr19a.jpg

(a).

graphic file with name gr19b.jpg

(b).

Fig. 20.

Fig. 20

Shape of T in Proposition 5.23.

Fig. 21.

Fig. 21

Subtree of T in Proposition 5.23.

We are now able to prove Theorem 5.2.

Proof of Theorem 5.2

As TS, T has no light vertex by Proposition 5.18 and fulfils LC1–LC4. By Proposition 5.23, T contains no 3-claw as a rooted subtree, so T fulfils LC1–LC4 and LC6. By Lemmata 5.3, 5.4, 5.9, 5.12 and LC6, T contains a 2-claw and fulfils LC5 by Lemma 5.19. □

We conclude this section with refined bounds on ρ for subtrees of type A of optimal trees. The bounds only depend on the LC.

Lemma 5.24

Let T be a tree fulfilling the LC and let A be a rooted subtree of T of order >1 and type A . Then

23ρ(A)<310.7321.

If no branch of the unique branch of A is a leaf, then

ρ(A)<0.688.

Proof

We prove the result by induction on the order of A. By the LC, we have A=AB(T1,,Tr) for some r{2,3} and rooted trees T1,,Tr. If r=2, then both T1 and T2 are leaves by the LC and A=F with ρ(F)=2/3.

We now consider the case r=3. By the LC, there are at most two leaves among T1, T2, T3. Thus 2ρ(T1)+ρ(T2)+ρ(T3)<2+(31) by the induction hypothesis. We obtain

23=11+12ρ(A)=11+1ρ(T1)+ρ(T2)+ρ(T3)<11+11+3=1+32+3=31.

If none of T1, T2, T3 is a leaf, we use the upper bound ρ(T1)+ρ(T2)+ρ(T3)<3(31) to obtain

ρ(A)=11+1ρ(T1)+ρ(T2)+ρ(T3)<11+13(31)=213323<0.688.

 □

If there is a vertex of type B and degree 4 which is adjacent to two leaves (e.g., in a CL) in an optimal tree, this has consequences to every vertex of type B, as the following lemma shows.

Lemma 5.25

Let TS be an optimal tree. If there is a vertex v of degree 4 of T which is adjacent to two leaves, then every vertex w of T of type B is adjacent to at least one leaf.

Proof

We assume the contrary and denote the rooted connected components of Tw by S0, S1, S2, S3 where v is contained in S0 and the rooted connected components of Tv by T0, T1, L, L with w contained in T0. As 1=ρ(L)>ρ(S3), Lemmata 5.8 and 4.21(3) yield

531+ρ(T1)ρ(S1)+ρ(S2)+0.1153<2(31)+0.1153<1.58,

a contradiction. □

6. The upper bound: global structure

6.1. Outline graph

Now we start with the discussion of the global structure of optimal trees. Let us first collect a few results on the outline graph of an optimal tree.

Lemma 6.1

Let TS be an optimal tree of order n and T its outline graph as defined in Definition 3.2. Then T has the following properties.

  • (1)

    The leaves of T correspond to rooted subtrees of type A of T, the non-leaves of T correspond to vertices of type B of T.

  • (2)
    If there is a vertex of degree 3 in T, then n0(mod7), T has the shape as given inFig. 4(e) and
    k=n77.
  • (3)

    There is no vertex v in T which is adjacent to an L and an F.

  • (4)

    If T is of order 1, then n1(mod7) and T=C(n1)/7L or n4(mod7) and T=C(n4)/7F.

Proof

  • (1)

    By construction, all special leaves of T correspond to rooted subtrees of type A of T. A leaf in T is either contained in some larger special leaf or is eventually seen as an L in T. All non-leaves of T of type A are either contained in some larger special edge or special leaf or they are transformed into a C0. Thus all non-leaves of T have to correspond to vertices of type B of T.

  • (2)

    We denote the vertex of degree 3 by v. By Theorem 5.2, two of the neighbours of v are leaves. Thus we have one of the situations in Fig. 22 for appropriate k0 and rooted trees T1, T2, T3. The first case is a contradiction to the construction of the outline, as a subtree CkF would have been contracted earlier than the Ck. The second case is also not a correct outline, as this graph is isomorphic to a CkF (use the L in the present CkL as the new root).

    So we are left with the third case. In this case, we have n=|T|=3+7k+4, which immediately implies n0(mod7) and k=(n7)/7. And this is exactly the situation in Fig. 4(e).

  • (3)
    We assume that there is a vertex v in T which is adjacent to an L and an F. This could mean one of the situations in Fig. 23, where Sj{L,F} for j{0,1,2}.
    • (a)
      In the situation in Fig. 23(a), a Ck1+k2+1 would have been used in the outline of T instead of v, L, F, Ck1 and Ck2.
    • (b)
      In the situation in Fig. 23(b), a Ck1+k2+1S0 would have been used in the outline of T instead of v, L, F, Ck1 and Ck2S0.
    • (c)
      We consider the situation in Fig. 23(c).
      If S1=S2=F, the graph is isomorphic to the graph in Fig. 4(e), i.e., a Ck1+k2+1F would have been combined with a vertex of degree 3 and two leaves in the outline of T.
      If S1=L, the graph is isomorphic to Ck1+k2+1S2, and this would have been taken in the outline of T.
  • (4)

    If T is of order 1, then the unique vertex of T must be a CkF or a CkL for a suitable k. In the first case, we have n=7k+4, in the second n=7k+1. □

Fig. 22.

Fig. 22

Possible cases in Lemma 6.1.

Fig. 23.

Fig. 23

Possible cases in Lemma 6.1.

6.2. Chains

In the global structure of optimal trees, chains as introduced in Definition 3.1 occur prominently. This subsection is devoted to the computation of the relevant parameters and to some further necessary optimality criteria in relation to chains. Recall that the growth constant λ10.1097722286464 in Theorem 1.2 is defined as the larger root of the polynomial x211x+9. In the following, the other root of this polynomial is denoted by λ¯0.890227771353556.

Lemma 6.2

  • (1)
    Let T be a rooted tree of type A . Then
    (m(CT)m0(CT))=M(m(T)m0(T)),ρ(CT)=σ(ρ(T)),
    with
    M=(8353),σ:R+R+;x138+3x.
  • (2)
    We have
    m(CkL)=Gk+1,m0(CkL)=Gk+13Gk,
    m(CkF)=3Gk+13Gk,m0(CkF)=2Gk+1Gk,withGk=λkλ¯kλλ¯.
  • (3)

    Let ρlim=(λ8)/30.7032574 and x>0 . If x<ρlim, then the sequence σk(x) is strictly increasing, if x>ρlim, then the sequence σk(x) is strictly decreasing. In both cases, limkσk(x)=ρlim.

    In particular, ρ(CkF) is strictly increasing and ρ(CkL) is strictly decreasing.

Proof

  • (1)

    This is a straightforward consequence of the recursive formulæ (4.4) and (4.5) for m, m0 and ρ.

  • (2)

    The eigenvalues of M are λ and λ¯. Thus the sequences m(CkL), m0(CkF), m(CkF), m0(CkF) for k0 are elements of the linear space spanned by λk and λ¯k. Another basis of this linear space is given by Gk+1 and Gk. It therefore suffices to check the formulæ for k=0 and k=1.

  • (3)

    It is easily checked that ρlim is the unique positive fixed point of σ. The assertions on σk(x) are easy consequences of the definition of σ. Finally, the assertions on ρ(CkF) and ρ(CkL) follow from ρ(F)=2/3 and ρ(L)=1. □

Next we show that an L and a CkF never occur as neighbours of the same vertex in the outline of an optimal tree.

Lemma 6.3

Let TS be an optimal tree, v a vertex of degree 4 and T1, T2, T3, T4the rooted connected components of Tv . We assume that T1=L and T2=CkF for some k0 . Then F{T2,T3,T4} . In particular, v is not in the outline of T.

Proof

We assume k>0. Without loss of generality, we also assume that ρ(T3)ρ(T4). The tree T has the shape shown in Fig. 24(a). If ρ(T3)>ρ(F)=2/3, we have ρ(L)+ρ(T4)ρ(L)+ρ(F) by Lemma 4.21(2), which implies that ρ(T4)=2/3 and ρ(T3)=2/3 by Lemma 5.24, contradiction.

Therefore, we have ρ(T3)=2/3, thus T3{F,A14,A24} by Lemma 5.16. In the case T3=F, Lemma 6.1 yields the result. Thus we are left with T3{A14,A24}.

We consider the tree T where F and T3 have been exchanged, cf. Fig. 24(b). From Lemma 4.18 we conclude that m(T)=m(T), i.e., T is also an optimal tree. We rewrite T as in Fig. 24(c). For T3{A14,A24}, the rooted tree S is not α-optimal for any α>1/3, cf. (R11). As ρ(Ck1T4)2/3 by Lemma 5.24, this is a contradiction to Proposition 4.15 and the optimality of T. □

Fig. 24.

Trees T and T considered in Lemma 6.3.

graphic file with name gr24a.jpg

(a) T.

graphic file with name gr24b.jpg

(b) T.

graphic file with name gr24c.jpg

(c) T.

We now prove a necessary optimality condition involving one chain element.

Lemma 6.4

Let TS be an optimal tree, v a vertex of type B and degree 4 of T and CS1, S2, S3, S4the rooted connected components of Tv for some rooted trees S1, S2, S3, S4.

  • (1)
    If neither S3 nor S4 is a leaf, then
    ρ(S1)ρ(S2).
  • (2)
    If S3 is a leaf and ρ(S4)>2/3, then
    ρ(S1)ρ(S2).

Proof

If ρ(S1)>ρ(S2), then 5/3=ρ(L)+ρ(F)ρ(S3)+ρ(S4) by Lemma 4.21(2), i.e., S3 or S4 is a leaf by Lemma 5.24.

If ρ(S1)<ρ(S2), then 5/3ρ(S3)+ρ(S4) by Lemma 4.21(2), i.e., either both S3 and S4 are non-leaves or S3, say, is a leaf and ρ(S4)2/3.

The contrapositions are the statements of the lemma. □

The following lemma lists some consequences of this result.

Lemma 6.5

Let TS be an optimal tree, v a vertex of degree 4 and T0, T1, T2, T3the rooted connected components of Tv.

  • (1)

    Let 4 and let Tj=CkjSj for some kj0 and some rooted tree Sj with ρ(Sj)<7/10 for j{0,,1} . Further assume that Tj is not a leaf for j{,,3} and that k0k1.

    Then k1k0+1 and, if kj<kj+1 for some 0j<1, then ρ(Sj)ρ(Sj+1).

  • (2)

    If T0=L, T1=Ck1S1, T2=Ck2S2 with k1, k20, ρ(Cj+1L)<ρ(Sj)ρ(CjL) with j{0,1} for j{1,2} and finally ρ(T3)>2/3, then we have k2+2k1+1+1.

    Furthermore, if σ1(ρ(S1))>σ2(ρ(S2)), then k2max{0,k1+12}.

Proof

  • (1)
    Assume that k1>k0+1. Then
    ρ(Ck11F)ρ(Ck11S1)ρ(Ck0S0)<ρ(Ck0+1F)
    by the monotonicity of σ, Lemmata 5.24, 6.4 and the fact that σ(2/3)=7/10. We conclude that k11<k0+1, i.e., k1k0+1, contradiction. Thus k1k0+1.
    If kj<kj+1, then kj+1=kj+1. From Lemma 6.4 we see that
    ρ(CkjSj+1)=ρ(Ckj+11Sj+1)ρ(CkjSj),
    which yields ρ(Sj+1)ρ(Sj) in view of the monotonicity of σ.
  • (2)
    Assume that k2>0. By Lemma 6.4, we have
    ρ(Ck2+21L)ρ(Ck21S2)ρ(Ck1S1)>ρ(Ck1+1+1L),
    which yields k2+21<k1+1+1 by Lemma 6.2 and therefore k2+2k1+1+1. If k2=0, then the same inequality holds trivially as 2 has been assumed to be at most 1.
    Now we turn to the second assertion and assume k2>0 and σ1(ρ(S1))>σ2(ρ(S2)). We have
    σk21(ρ(S2))=ρ(Ck21S2)ρ(Ck1S1)=σk1+1(σ1(ρ(S1)))>σk1+12(ρ(S2))
    by Lemma 6.4, which yields k21<k1+12, i.e., k2k1+12, as required. For k2=0, there is nothing to show. □

6.3. Switching forks and leaves

So far, we mainly compared optimal trees to trees where some subtrees have been switched between two positions. It turns out that more invasive operations are needed in order to obtain information on the global structure of optimal trees.

The basic idea is the following: if 7 forks are replaced by 7 leaves, the order of the tree is reduced by 7⋅3=21. As a chain element requires 7 vertices, these 21 “free” vertices can be used to introduce 3 chain elements. If all this is done at the “right” positions, then m(T) increases. In some circumstances, however, the inverse operation may be beneficial.

Before we state the main lemma regarding such exchange operations, we collect two technical details concerning the floor function in the following lemma.

Lemma 6.6

  • (1)
    For real x and positive integers d, the identity
    dx=j=0d1x+jd
    holds.
  • (2)
    Let d0 and k0,,kd1 be integers with
    k0k1kd1k0+1.
    Then
    kj=k+jdwith k=k0+k1++kd1
    for j{0,,d1}.

Proof

  • (1)

    Cf. Graham, Knuth and Patashnik [6, (3.26)].

  • (2)
    Choose 1rd such that k0==kr1<krkd1=k0+1. Then k=dk0+(dr) and
    k+jd=k0+d+jrd=k0+[d+jrd]=k0+[jr]=kj
    for j{0,,d1}. □

Now we are able to provide the required exchange operations. Their consequences will be exploited afterwards. The proof relies on similar ideas as the proof of Lemma 5.22.

Lemma 6.7

Let T be a tree fulfilling the LC and let T be a tree that is obtained from T by replacing one rooted subtree S by S, where S and S will be specified below.

  • (1)

    If S=B(Ck0F,Ck1F,Ck2F) with kj=(k+j)/3 for some k0 and S=B(Ck/2+1L,C(k+1)/2+1L,L), then m(T)/m(T)5.211 and |T||T|=5.

  • (2)

    If S=B(Ck0F,Ck1F,Ck2F) with kj=(k+j)/3 for some k0 and S=B(Ck/2+1L,C(k+1)/2L,L), then m(T)/m(T)0.5154 and |T||T|=2.

  • (3)

    If S=B(Ck0F,Ck1F,Ck2F) with kj=(k+j)/3 for some k0 and S=B(CkF,F,L), then m(T)/m(T)0.3726 and |T||T|=3.

  • (4)
    If
    S=B(Ck+(j+1)/3F,Ck+(j+2)/3F,CkAB(Ck+(i+1)/4F,Ck+(i+2)/4F,Ck+(i+3)/4F))
    and S=B(L,F,C2k+i+jAB(C2k+1L,C2k+1L,L)) for some k0, i{0,1,2,3}, j{0,1,2}, then m(T)/m(T)1.943 and |T||T|=2.
  • (5)

    If S=B(L,Ck/2L,C(k+1)/2L) and S=B(L,F,Ck1F) for some k1, then m(T)/m(T)0.722 and |T||T|=1.

  • (6)

    If S=B(L,Ck/2L,C(k+1)/2L) and S=B(C(k1)/3F,Ck/3F,C(k+1)/3F) for some k1, then m(T)/m(T)27/14 and |T||T|=2 . If k2, then m(T)/m(T)1.9302.

  • (7)

    If S=B(Ck+1+tL,L,CkAB(Ck+1+(s+1)/3L,Ck+1+(s+2)/3L,L)) and S=B(F,L,Ck1+(s+t+2)/4AB(Ck+(s+t+3)/4F,Ck+(s+t+4)/4F,Ck+(s+t+5)/4F)) for some k1, s{0,1,2}, t{0,1}, then m(T)/m(T)0.5181 and |T||T|=2.

  • (8)
    If
    S=B(CtL,L,AB(Cs/2L,C(s+1)/2L,L))
    and S=B(C(s+t1)/3F,C(s+t)/3F,C(s+t+1)/3F) for some s{1,2,3,4} and t{0,1,2}, then m(T)/m(T)0.516 and |T||T|=2.
  • (9)

    If S=B(CtL,L,AB(Cs/2L,C(s+1)/2L,L)) and S=B(L,F,Cs+tF) for some s{1,2,3,4} and t{0,1,2}, then m(T)/m(T)1.95 and |T||T|=2.

Proof

  • (1)
    Let T consist of some rooted subtree T1, S and the edge between the roots of T1 and S. Then from Lemma 4.4 we obtain
    m(T)m(T)=m0(S)m0(S)ρ(T1)+ρ(Ck/2+1L)+ρ(C(k+1)/2+1L)+1ρ(T1)+ρ(Ck0F)+ρ(Ck1F)+ρ(Ck2F)=m0(S)m0(S)(1+ρ(Ck/2+1L)+ρ(C(k+1)/2+1L)+1ρ(Ck0F)ρ(Ck1F)ρ(Ck2F)ρ(T1)+ρ(Ck0F)+ρ(Ck1F)+ρ(Ck2F))m0(S)m0(S)(1+ρ(Ck/2+1L)+ρ(C(k+1)/2+1L)+1ρ(Ck0F)ρ(Ck1F)ρ(Ck2F)31+ρ(Ck0F)+ρ(Ck1F)+ρ(Ck2F))
    where
    ρ(Ck/2+1L)+ρ(C(k+1)/2+1L)+1>3ρlim>ρ(Ck0F)+ρ(Ck1F)+ρ(Ck2F),
    cf. Lemma 6.2 and Lemma 5.24 have been used. From Lemmata 6.2 and 6.6, we get
    m(T)m(T)λ4(λλ¯)(3λ3)3(1+O(qk/3))31+2ρlim(1+O(qk/2))+131+3ρlim(1+O(qk/3))5.21101232
    with q=λ¯/λ0.088, where we replaced the explicit formulæ obtained from Lemma 6.2 by asymptotic expansions for ease of presentation; the actual computations leading to the given constant have been performed exactly—in this particular case, it even turned out that the whole expression was strictly decreasing in k. The explicit branch L of S has been taken into account exactly instead of using Lemma 6.2.

    We have |S|=1+7(k0+k1+k2)+34=13+7k and |S|=1+7(2+k/2+(k+1)/2)+3=18+7k by Lemma 6.6.

  • (2)

    Analogous.

  • (3)

    Analogous.

  • (4)

    Analogous.

  • (5)

    Analogous, but the lower bound ρ(T1)2/3 (Lemma 5.24) has to be used, as ρ(L)+ρ(F)+ρ(Ck1F)<1+2ρlim<ρ(L)+ρ(Ck/2L)+ρ(C(k+1)/2L).

  • (6)

    Analogous.

  • (7)

    Analogous.

  • (8)

    Analogous, but simpler, as this is a finite case and no limits have to be considered.

  • (9)

    Analogous. □

Remark 6.8

The precise proof of Lemma 6.7 has been carried out using Sage [20]. The program is available in [11].

6.4. CL-free optimal trees

Throughout this subsection, we assume that TS is an optimal tree which is CL-free, i.e., it does not contain a CL as a rooted subtree. Obviously, such a tree does not contain any CkL as rooted subtree for k1. We will describe all optimal trees with this property.

Lemma 6.9

Let TS be a CL-free optimal tree and v be a vertex of degree 4 in the outline graph of T which is adjacent to at least three “special leaves” T0, T1, T2 with |T0||T1||T2| . Then there is a k0 such that Tj=CkjF with kj=(k+j)/3 for j{0,1,2}.

Proof

As T is CL-free, TjCkjL for any kj>0 and j{0,1,2}. So for every j we have Tj=L or Tj=CkjF for some kj. As T is optimal, the case T0=T1=T2=L is excluded by LC6. The cases T0=T1=L, T2=Ck2F and T0=L, T1=Ck1F, T2=Ck2F are excluded by Lemma 6.3. Thus Tj=CkjF for some kj0 for all j. As v is in the outline of T, it is not adjacent to a leaf by Lemma 6.3. Thus Lemma 6.5(1) (with =3) and Lemma 6.6 prove the assertion. □

Lemma 6.10

Let TS be a CL-free optimal tree. Then there are no three distinct vertices in the outline of T such that each of them is adjacent to three “special leaves”.

Proof

Assume that there are three distinct vertices v0, v1, v2 in the outline of T such that vi is adjacent to Cki0F, Cki1F, Cki2F with kij=(ki+j)/3 for some ki0 and j{0,1,2}. By Lemma 6.9 this is the only case to consider.

Replacing the rooted subtree with root v0 and branches Ck00F, Ck01F, Ck02F by a rooted subtree with root v0 and branches Ck0/2+1L, C(k0+1)/2+1L, L, cf. Lemma 6.7(1), yields a tree T which is not necessarily optimal, but fulfils the LC.

Replacing the rooted subtree with root v1 and branches Ck10F, Ck11F, Ck12F in T by a rooted subtree with root v1 and branches Ck1/2+1L, C(k1+1)/2L, L, cf. Lemma 6.7(2), yields a tree T, which still fulfils the LC.

Replacing the rooted subtree with root v2 and branches Ck20F, Ck21F, Ck22F in T by a rooted subtree with root v2 and branches Ck2F, F, L, cf. Lemma 6.7(3), yields a tree T.

Lemma 6.7(1, 2 and 3) yields |T||T|=523=0 and

m(T)m(T)=m(T)m(T)m(T)m(T)m(T)m(T)5.2110.51540.3726>1.0007,

thus m(T)>m(T), contradiction to the optimality of T. □

Lemma 6.11

Let TS be a CL-free optimal tree of order n and u, v two distinct vertices of degree 4 in the outline graph of T which are adjacent to three special leaves.

Then n6(mod7) and T is of the shape given in Fig. 4(d) where

kj=n27+7j49

for 0j6 or (k0,k1,k2,k3,k4,k5,k6)=(1,0,0,0,0,0,0).

Proof

By Lemma 6.1, the outline of T does not contain a vertex of degree 3. By Lemma 6.10, the outline graph of T is a caterpillar tree. By Lemma 6.9, it must be of the shape shown in Fig. 25 for some s0, some non-negative integers k0,,k8 and some special leaves Tj, Tj for j{1,,s}. As T is CL-free, Tj is a leaf or a CjF and Tj is a leaf or a CjF for suitable j, j0 and j{1,,s}.

Assume that s>0. By Lemma 5.25, it is impossible that both T1=L and T1=L, as u is not adjacent to a leaf. By Lemma 6.3 we conclude that T1=C1F and T2=C1F for some 10, 10. Lemma 5.24 implies that ρ(S1)0.688 and ρ(S2)0.688.

W.l.o.g. we assume k1k3k5. We claim that

k0k1k3k5k0+1. (6.1)

If ρ(S2)>2/3=ρ(F) or k0=0, this follows from Lemma 6.5(1). So we consider the case that ρ(S2)=2/3 and k0>0. By Lemma 5.16 and the shape of T as shown in Fig. 25, this implies that S2=A24. In particular, we have T1=T1=F, i.e., 1=1=0. As 1<k0, we have 2/3=ρ(T1)ρ(S1) and k01 by Lemma 6.5(1). Thus we have S1=A14 by Lemma 5.16 and k0=1. Then T=AB(F,A14,A24) which is not optimal by (R5). This concludes the proof of (6.1).

W.l.o.g we assume 11. If ρ(S1)=2/3, then S1=A14 and k1=k3=k5=0, thus also k0=0 by (6.1). Thus k011k0+1 in this case by Lemma 6.5(1). If ρ(S1)>2/3=ρ(F), we get the same estimate k011k0+1 from Lemma 6.5(1).

Replacing S=B(T1,T1,Ck0AB(Ck1F,Ck3F,Ck5F)) in T by S as in Lemma 6.7(4) yields a tree T fulfilling the LC.

Replacing B(Ck2F,Ck4F,Ck6F) in T as in Lemma 6.7(2) yields a tree T.

By Lemma 6.7(2 and 4), we have |T|=|T| and

m(T)m(T)=m(T)m(T)m(T)m(T)0.51541.943>1.001,

contradiction to the optimality of m(T).

Thus we have shown that s=0, i.e., T is of the shape given in Fig. 4(d). We set k=k0+k1+k2+k3+k4+k5+k6 and have n=7k+64+3=27+7k and in particular n6(mod7).

We set a=k1+k3+k5 and b=k2+k4+k6. Without loss of generality we assume ab. By Lemmata 6.5(1) and 6.6, we have

k1=a3,k3=a+13,k5=a+23,k2=b3,k4=b+13,k6=b+23. (6.2)

If ba+2, we obtain k6(a+4)/3=k3+1, thus ρ(Ck6F)>ρ(Ck3F), and Lemmata 4.21(2) and 6.2 yield

ρ(Ck1F)+ρ(Ck5F)<ρ(C(a+2)/3F)+ρ(C(a+3)/3F)ρ(Cb/3F)+ρ(C(b+1)/3F)=ρ(Ck2F)+ρ(Ck4F)ρ(Ck1F)+ρ(Ck5F),

a contradiction. We conclude that aba+1. From (6.2) we immediately conclude that k1k2k3k4k5k6k1+1 holds in both cases.

From Lemmata 6.5(1) and 5.24 we see that k0k1+1 and k6k0+1.

If b=0, we therefore obtain

(k0,k1,k2,k3,k4,k5,k6){(0,0,0,0,0,0,0),(1,0,0,0,0,0,0)},

where we have T=T27 in the first case and T=T34,2 in the second case.

If b>0, then ρ(S2)>23=ρ(F) and thus k0k1 by Lemma 6.5(1). Thus we have

k0k1k2k3k4k5k6k0+1,

and therefore kj=(k+j)/7 for j{0,1,2,3,4,5,6} by Lemma 6.6. We have k1, the case k=1 corresponds to T=T34,1. Indeed, m(T34,1)=m(T34,2). □

Fig. 25.

Fig. 25

Decomposition of T for Lemma 6.11.

Proposition 6.12

Let TS be a CL-free optimal tree of order n>1 . Then n0(mod7), n3(mod7) or n6(mod7) and T has the shape described in Theorem 3.3 for these congruence classes.

Proof

Let T be the outline of T. If T has a vertex of degree 3, Lemma 6.1 yields the required result.

If T has at least two vertices of degree 4, then there are at least two vertices of degree 4 which are adjacent to at least 3 special leaves. In this case, Lemma 6.11 yields the required result.

We now consider the case that T has exactly one vertex v of degree 4. Its neighbours are special leaves T0, T1, T2, T3, where each Tj is either an L or a CkjF for 0j3. We assume that ρ(T0)ρ(T1)ρ(T2)ρ(T3). The case T0=T1=T2=T3=L corresponds to T=T5S. Then by Lemma 6.3, we cannot have a leaf, so Tj=CkjF for 0j3. From Lemmata 6.5(1) and 6.6, we obtain that kj=(k+j)/4 for 0j3 and k=k0+k1+k2+k3. We have

n=|T|=1+44+7(k0+k1+k2+k3)=17+7k.

We conclude that n3(mod7) and obtain

kj=n17+7j28

and of course, T has the shape given in Fig. 4(b).

Finally we consider the case that T has order 1. This case is covered by Lemma 6.1. Both graphs mentioned in this lemma (C(n1)/7L and C(n4)/7F) contain a CL except for T=T1. □

6.5. Optimal trees containing CL

This final subsection is devoted to optimal trees TS containing a CL as a rooted subtree. By Lemma 5.25, every vertex of type B of such a tree is adjacent to a leaf. By Lemma 6.3, T does not contain any rooted subtree of the shape CkF for k>0.

Lemma 6.13

Let TS be an optimal tree containing CL as a rooted subtree and v a vertex of degree 4 in the outline of T which is adjacent to 3 special leaves T0, T1, L with |T0||T1| . Then there is a k1 such that Tj=C(k+j)/2L for j{0,1}.

Proof

Let T0, T1, T2, L denote the rooted connected components of Tv. None of them is a CkF for k0 by Lemmata 6.3 and 5.25. Thus T0=Ck0L and T1=Ck1L for suitable k0, k10. As A14 and A24 cannot be rooted subtrees of T by Lemma 5.25, we have ρ(T2)>2/3 by Lemma 5.16. Thus Lemma 6.5(2) can be used to see that k0k1k0+1. With k=k0+k1, Lemma 6.6 and LC6, the desired result follows. □

Lemma 6.14

Let TS be an optimal tree containing CL as a rooted subtree. Then there are no three distinct vertices in the outline of T such that each of them is adjacent to three special leaves.

Proof

Assume that there are three distinct vertices v0, v1, v2 in the outline of T such that vi is adjacent to Cki0L, Cki1L, L with kij=(ki+j)/2 for some ki1 and j{0,1}. By Lemma 6.13 this is the only case to consider.

Replacing the rooted subtree with root v0 and branches Ck00L, Ck01L, L by a rooted subtree with root v0 and branches L, F, Ck01F, cf. Lemma 6.7(5), yields a tree T, which does not have to be optimal, but fulfils the LC.

Replacing the rooted subtree with root v1 and branches Ck10L, Ck11L, L in T by a rooted subtree with root v1 and branches L, F, Ck11F, cf. Lemma 6.7(5), yields a tree T, which still fulfils the LC.

Replacing the rooted subtree with root v2 and branches Ck20L, Ck21L, L in T by a rooted subtree with root v2 and branches C(k21)/3F,Ck2/3F,C(k2+1)/3F, cf. Lemma 6.7(6), yields a tree T.

Lemma 6.7(5 and 6) yields |T||T|=11+2=0 and

m(T)m(T)=m(T)m(T)m(T)m(T)m(T)m(T)0.7220.7222714>1.005,

thus m(T)>m(T), contradiction to the optimality of T. □

Next we need better bounds on the ρ-values of subtrees of type A which are visible in the outline of an optimal tree.

Lemma 6.15

Let TS be an optimal tree containing a CL, v be a vertex of degree 4 in the outline of T and L, T1, T2, T3 be the rooted connected components of Tv . We assume that ρ(T1)ρ(T2) and set =[|T1|>1] . We further set S=AB(L,T1,T2).

Then

ρ(C1+L)<ρ(S)<ρ(CL).

Proof

We prove the lemma by induction on the order of S.

By Lemma 5.24 and LC6, we have ρ(T1)+ρ(T2)1+(1+)(31).

We write Tj=CkjTj for suitable trees Tj and maximal kj0 for j{1,2}. As the outline of T does not contain a CkF by Lemmata 6.3 and 5.25, we conclude that either Tj is a leaf or we have ρ(Tj)>ρ(C2L) by the induction hypothesis. By Lemma 6.2, we have ρ(Tj)>ρlim in both cases.

We obtain

ρ(C1+L)<11+12+(1+)ρlimρ(S)=11+11+ρ(T1)+ρ(T2)11+12+(1+)(31)<ρ(CL).

 □

Lemma 6.16

Let TS be an optimal tree of order n containing a CL whose outline contains at least two vertices of degree 4. Then n2(mod7) and T is of the shape given in Fig. 4(a) with

k0=max{0,n3735},kj={n2+7j35 if n37,n9+7j35 if n30

for j{1,2,3,4}.

Proof

By Lemma 6.1, the outline of T has no vertex of degree 3. By Lemma 6.14, the outline graph of T is a caterpillar tree. By Lemmata 5.25, 6.13 and 6.3, it must be of the shape shown in Fig. 26 with 0k1k3,0k2k4,0k0,01,,0s, 0k5. By LC6, we have k3>0 and k4>0.

We claim that s=0; let us assume, to the contrary, that s1. Then by Lemma 6.15, we have

ρ(C[1>0]+1L)<ρ(S2)<ρ(C[1>0]L),
ρ(C[k1>0]+1L)<ρ(S1)<ρ(C[k1>0]L).

As σ[1>0]ρ(S2)<ρ(L) and σ[k1>0]ρ(S1)<ρ(L), Lemmata 6.5(2) and 6.13 imply

k0max{0,k1[1>0]},k1k3k0+[1>0]+1,k3k1+1,k0max{0,1[k1>0]},1k0+[k1>0]+1,If k0>0, then k12 and 12 (6.3)

(the last statement following from the two inequalities for k0) and the analogous inequalities

k5max{0,k2[s>0]},k2k4k5+[s>0]+1,k4k2+1,k5max{0,s[k2>0]},sk5+[k2>0]+1,If k5>0, then k22 and s2. (6.4)

Without loss of generality, we may assume k5+k2k0+k1.

We consider 3 cases:

  • (1)
    We assume that k0>0 and k2>0. From (6.3) and Lemma 6.6, we obtain
    k1=k0+1+s+13,k3=k0+1+s+23,1=k0+1+t
    for some s{0,1,2} and some t{0,1}.

    We replace B(L,C1L,Ck0AB(Ck1L,Ck3L,L)) as in Lemma 6.7(7) and obtain a tree T fulfilling the LC.

    Replacing B(Ck2L,Ck4L,L) in T as in Lemma 6.7(6), we obtain a tree T. We have
    m(T)m(T)0.51811.9302>1.00003|T|=|T|,
    a contradiction.
  • (2)

    We assume that k0>0 and k2=0. By (6.4), this implies k5=0 and k4=1.

    We assume first that s2 and replace B(L,C1L,Ck0AB(Ck1L,Ck3L,L)) in T as in Lemma 6.7(7) and obtain a tree T fulfilling the LC.

    We now replace B(L,CsL,AB(Ck2L,Ck4L,L)) in T as in Lemma 6.7(9) and obtain a tree T.

    We conclude that
    m(T)m(T)1.950.5181>1.01,|T|=|T|,
    a contradiction.
    Thus we have s=1. By (6.3), we have 2k1k3. As k4<k1, and thus ρ(Ck4L)>ρ(Ck1L), we obtain
    2=ρ(Ck2L)+ρ(L)ρ(Ck3L)+ρ(L)+0.1153ρ(C2L)+ρ(L)+0.1153<2
    from Lemmata 6.2 and 4.21(3). This is a contradiction.
  • (3)

    We assume that k0=0. By (6.3) this implies k1k32 and 12. Consequently, we have k5+k2k0+k12, thus k5=0, k2k42 and s2 by (6.4).

    We assume first that s2 and replace B(L,C1L,AB(Ck1L,Ck3L,L)) in T as in Lemma 6.7(8) and obtain a tree T.

    We now replace B(L,CsL,AB(Ck2L,Ck4L,L)) in T as in Lemma 6.7(9) and obtain a tree T. We conclude that
    m(T)m(T)1.950.516>1.006,|T|=|T|,
    a contradiction.

    Thus we have s=1. It follows that 27n=13+7(k1+k3+k2+k4+1)83. In each of the possible cases remaining, it turns out that m(T)<m(Tn) for the tree Tn given in Fig. 4(d) (or m(T)<m(T34,1)=m(T34,2) for n=34), contradiction.

So we have shown that s=0 and that T therefore has the shape as in Fig. 4(a). We set k=k0+k1+k2+k3+k4 and obtain n=9+7k and n2(mod7) in particular.

We set a=k1+k3 and b=k2+k4 and assume that ab. From Lemmata 6.5(2) and 6.6, we see that

k1=a/2,k3=(a+1)/2,k2=b/2,k4=(b+1)/2.

If ba+2, we have k4(a+3)/2>k3 and therefore k1=a/2<b/2=k2k1 by Lemmata 4.21(2) and 6.2, a contradiction. Thus b{a,a+1} and k1k2k3k4k1+1 in both cases.

By Lemmata 6.15 and 6.5(2), we have (in analogy to (6.3) and (6.4))

k0max{0,k1[k2>0]},k4k0+[k1>0]+1. (6.5)

If k1=0, we have k0=0 and 0=k0k1k2k3k4k0+1=1 by (6.5) and n30, i.e., kj=(n9+7j)/35 for j{0,1,2,3,4} by Lemma 6.6.

If k1>0, we have n37 and (6.5) yields

k0+1k1k2k3k4k0+2

and therefore kj+[j=0]=(k+1+j)/5 for j{0,1,2,3,4} by Lemma 6.6. □

Fig. 26.

Fig. 26

Decomposition of T for Lemma 6.16.

Proposition 6.17

Let TS be an optimal tree of order n containing a CL as a rooted subtree. Then n1(mod7), n2(mod7), n4(mod7) or n5(mod7) and T has the shape described in Theorem 3.3 for these congruence classes.

Proof

Let T be the outline of T. If T has a vertex of degree 3, Lemma 6.1 shows that T does not contain a CL as a rooted subtree.

If T has at least two vertices of degree 4, then there are at least two vertices of degree 4 which are adjacent to at least 3 special leaves. In this case, Lemma 6.16 yields the required result.

We now consider the case that T has exactly one vertex v of degree 4. Its neighbours are special leaves T0, T1, T2, T3. By Lemma 5.25, we have T3=L after suitable reordering. As v is in the outline of T, we have F{T0,T1,T2}. By Lemma 6.3, we must have Tj=CkjL for some kj0 for j{0,1,2}. Thus T is of the shape given in Fig. 4(c). This yields n=7(k0+k1+k2)+5; in particular n5(mod7).

Without loss of generality, we may assume that k0k1k2. By Lemma 6.5(2), we have

k0k1k2k0+1.

From Lemma 6.6, we conclude that

kj=k0+k1+k2+j3=n57+j3

for 0j2, as required.

Finally we consider the case that T has order 1. This case has been considered in Lemma 6.1. □

Proof of Theorem 3.3

Let T be an optimal tree of order n. If TS, then there are two possibilities: T can be CL-free or it can contain a CL as a rooted subtree. Then Propositions 6.12 and 6.17 respectively show that T has the shape given in Theorem 3.3 with the parameters as given by the theorem. For n{8,9,12,16}, the trees in the exceptional set S still have this shape. For n{6,10,13,20} or n<4, however, it is not possible for a tree of order n to have the shape shown in Fig. 4 (since n is too small). In these cases, the optimal tree has to be an element of the exceptional set S, which gives us a unique optimal tree for n6 and two optimal trees for n=6. Finally, let us remark that the asymptotic formulæ given in Theorem 1.2 follow easily from the structure of the trees by means of Lemma 6.2. □

Acknowledgements

C. Heuberger is supported by the Austrian Science Fund (FWF): S09606, that is part of the Austrian National Research Network “Analytic Combinatorics and Probabilistic Number Theory.” This paper was partly written while C. Heuberger was a visitor at Stellenbosch University.

This material is based upon work supported financially by the National Research Foundation of South Africa under grant number 70560.

Contributor Information

Clemens Heuberger, Email: clemens.heuberger@tugraz.at.

Stephan Wagner, Email: swagner@sun.ac.za.

Appendix. Tables of replacements

See Tables 2 and 3.

References

  • 1.Baxter R.J., Enting I.G., Tsang S.K. Hard-square lattice gas. J. Statist. Phys. 1980;22(4):465–489. [Google Scholar]
  • 2.Bród D., Skupień Z. Trees with extremal numbers of dominating sets. Australas. J. Combin. 2006;35:273–290. [Google Scholar]
  • 3.Bród D., Skupień Z. Recurrence among trees with most numerous efficient dominating sets. Discrete Math. Theor. Comput. Sci. 2008;10(1):43–55. [Google Scholar]
  • 4.Cvetković D.M., Doob M., Sachs H. third ed. Johann Ambrosius Barth; Heidelberg: 1995. Spectra of Graphs. [Google Scholar]
  • 5.Górska J., Skupień Z. Trees with maximum number of maximal matchings. Discrete Math. 2007;307(11–12):1367–1377. [Google Scholar]
  • 6.Graham R.L., Knuth D.E., Patashnik O. second ed. Addison-Wesley; 1994. Concrete Mathematics. A Foundation for Computer Science. [Google Scholar]
  • 7.Gutman I. Graphs with greatest number of matchings. Publ. Inst. Math. (Beograd) (N.S.) 1980;27(41):67–76. [Google Scholar]
  • 8.Gutman I. The energy of a graph: old and new results. (Gößweinstein, 1999); Berlin: Springer; 2001. pp. 196–211. [Google Scholar]
  • 9.Gutman I., Polansky O.E. Springer-Verlag; Berlin: 1986. Mathematical Concepts in Organic Chemistry. [Google Scholar]
  • 10.Heilmann O.J., Lieb E.H. Theory of monomer–dimer systems. Comm. Math. Phys. 1972;25:190–232. [Google Scholar]
  • 11.C. Heuberger, S.G. Wagner, The number of maximum matchings in a tree—online ressources, http://www.math.tugraz.at/~cheub/publications/max-card-matching/. [DOI] [PMC free article] [PubMed]
  • 12.Hosoya H. Topological index as a common tool for quantum chemistry, statistical mechanics, and graph theory. Mathematical and Computational Concepts in Chemistry; (Dubrovnik, 1985); Chichester: Horwood; 1986. pp. 110–123. [Google Scholar]
  • 13.Kirk R., Wang H. Largest number of subtrees of trees with a given maximum degree. SIAM J. Discrete Math. 2008;22(3):985–995. [Google Scholar]
  • 14.Knuth D.E. Two notes on notation. Amer. Math. Monthly. 1992;99(5):403–422. [Google Scholar]
  • 15.Lin S.B., Lin C. Trees and forests with large and small independent indices. Chinese J. Math. 1995;23(3):199–210. [Google Scholar]
  • 16.Lovász L., Plummer M.D. Vol. 121. North-Holland Publishing Co; Amsterdam: 1986. Matching theory. (North-Holland Mathematics Studies). [Google Scholar]
  • 17.Merrifield R.E., Simmons H.E. Wiley; New York: 1989. Topological Methods in Chemistry. [Google Scholar]
  • 18.Prodinger H., Tichy R.F. Fibonacci numbers of graphs. Fibonacci Quart. 1982;20(1):16–21. [Google Scholar]
  • 19.Sagan B.E. A note on independent sets in trees. SIAM J. Discrete Math. 1988;1(1):105–108. [Google Scholar]
  • 20.W.A. Stein, et al. Sage Mathematics Software (Version 4.6), The Sage Development Team, (2010), http://www.sagemath.org.
  • 21.Székely L.A., Wang H. Binary trees with the largest number of subtrees. Discrete Appl. Math. 2007;155(3):374–385. [Google Scholar]
  • 22.Wagner S., Gutman I. Maxima and minima of the Hosoya index and the Merrifield–Simmons index: a survey of results and techniques. Acta Appl. Math. 2010;112(3):323–346. [Google Scholar]
  • 23.Wilf H.S. The number of maximal independent sets in a tree. SIAM J. Algebraic Discrete Methods. 1986;7(1):125–130. [Google Scholar]

RESOURCES