Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2021 Aug 3;35(3):1367–1390. doi: 10.1007/s10959-021-01117-1

The Generalized Entropy Ergodic Theorem for Nonhomogeneous Bifurcating Markov Chains Indexed by a Binary Tree

Zhiyan Shi 1,, Zhongzhi Wang 2, Pingping Zhong 1, Yan Fan 1
PMCID: PMC8329646  PMID: 34366565

Abstract

In this paper, we study the generalized entropy ergodic theorem for nonhomogeneous bifurcating Markov chains indexed by a binary tree. Firstly, by constructing a class of random variables with a parameter and the mean value of one, we establish a strong limit theorem for delayed sums of the bivariate functions of such chains using the Borel–Cantelli lemma. Secondly, we prove the strong law of large numbers for the frequencies of occurrence of states of delayed sums and the generalized entropy ergodic theorem. As corollaries, we generalize some known results.

Keywords: Binary tree, Nonhomogeneous bifurcating Markov chains, Entropy ergodic theorem

Introduction

Let {Xn,n0} be an arbitrary information source taking values in a finite alphabet set on probability space (Ω,F,P). Set fn(ω)=-1nlnP(X0,X1,,Xn), and fn(ω) is called the relative entropy density of {X0,X1,,Xn} in information theory. The convergence of fn(ω) to a constant in a sense (L1 convergence, convergence in probability, a.e. convergence) is regarded as the entropy ergodic theorem or the asymptotic equipartiton property (AEP), or the Shannon–McMillan–Breiman theorem, which is the fundamental theorem in information theory. Shannon [22] first proved the entropy ergodic theorem for convergence in probability for stationary ergodic information sources with finite alphabet. The entropy ergodic theorem in L1 and a.e. convergence, respectively, for stationary ergodic information sources was explored by McMillan [20] and Breiman [6]. Chung [8] considered the case of countable alphabet and Billingsley [5] extended the result to stationary nonergodic sources. Gray and Kieffer [14] extended it to asymptotically stationary measure process. The entropy ergodic theorem for general stochastic processes can be found, for example, in Barron [2] and Algoet and Cover [1]. Yang and Liu [19, 30] obtained the entropy ergodic theorem of nonhomogeneous Markov information sources in finite state space. Yang and Liu [32] studied the entropy ergodic theorem for mth-order nonhomogeneous Markov information sources.

Let Sn=k=1nXk, S0=0 and for any m,nN+, define

Tm,n=Sm+n-Sm=k=m+1m+nXk,

Tm,nn is regarded as the moving average or delay sum in probability theory. Many researches have been taken on topics of moving average. Shepp [23] studied the limiting values of the averages [Sn+f(n)-Sn]/f(n) for i.i.d. random variables. Gaposhkin [12] established the law of large numbers for moving averages of independent random variables. Lanzinger [17] studied an almost sure limit theorem for moving averages of random variables between the strong law of large numbers and the Erdos–Rényi law. Lai [16] gave a review of limit theorems for moving averages and described some recent developments motivated by applications to signal detection and change point problems. Recently, Wang and Yang [27] considered the entropy ergodic theorem of the moving average form and obtained the generalized entropy ergodic theorem for nonhomogeneous Markov chains.

The tree indexed stochastic process is one of the research hotspots of stochastic structure in recent years. The tree indexed stochastic process generally includes tree indexed random wak, random tree (such as Galton–Watson tree) and tree indexed Markov chain et al. There are a lot of researches about probability limit theorems of tree indexed stochastic process, we briefly list as follows: Chen [7] studied the average properties of random walks on Galton-Watson trees. Telcs and Wormald [26] studied the strong recurrence of tree indexed random walks determined by the resistance properties of spherically symmetric graphs. Dembo et al. [10] extended the notions of shift-invariance and specific relative entropy—as typically understood for Markov fields on deterministic graphs such as Zd-to Markov fields on random trees, and also developed single-generation empirical measure large deviation principles for a more general class of random trees. Le Gall [18] considered Galton–Watson trees associated with a critical offspring distribution and condition to have exactly n vertices, and they proved that these conditioned spatial trees converge as n, moduloan appropriate rescaling, towards the conditioned brownian tree under suitable assumptions on the offspring distribution and the spatial displacements. Guyon [13] studied the law of large numbers and central limit theorems for the bifurcating Markov chains indexed by a binary tree, and applied these results to detect cellular aging in Escherichia Coli, using the data of Stewart et al. and a bifurcating autoregressive model. Yamamoto [34] established a large deviation theorem for the number of branches of each order in a random binary tree, where the rate function associated with a large deviation was given by asymptotic forms of the rate function.

The significant progress of tree indexed Markov chains is its entropy ergodic theorem. Benjamini and Peres [3] gave the definition of tree-indexed Markov chains and studied the recurrence and ray-recurrence for them. Berger and Ye [4] studied the existence of entropy rate for some stationary random fields on a homogenous tree. Ye and Berger [35, 36], by using Pemantle’s [21] result and a combinational approach, have obtained entropy ergodic theorem in probability for a PPG-invariant and ergodic random field on a homogenous tree. Yang and Liu [30] established the strong law of large numbers for frequency of state occurrence on Markov chains indexed by a homogenous tree (in fact, it is special case of tree-indexed Markov chains and PPG-invariant random field). Yang [31, 33] obtained the strong law of large numbers and the entropy ergodic theorem for tree-indexed Markov chains. Huang and Yang [15] studied the strong law of large numbers and entropy ergodic theorem for Markov chains indexed by an uniformly bounded tree. Shi and Yang [25] studied the entropy ergodic theorem for mth-order nonhomogeneous Markov chains indexed by a tree. Recently, Dang et al. [9] defined a discrete form of nonhomogeneous bifurcating Markov chains indexed by a binary tree and discuss the equivalent properties for them, meanwhile the strong law of large numbers and the entropy ergodic theorem are studied for these Markov chains with finite state space. Shi et al. [24] studied the strong law of large numbers and entropy ergodic theorem for Markov chains indexed by a Cayley tree in a Markovian environment with countable state space.

Inspired by Dang et al. [9], Wang and Yang [27], and infused with some new ideas, in this paper, we study the generalized entropy ergodic theorem for nonhomogeneous bifurcating Markov chains indexed by a binary tree. Firstly, we prove a strong limit theorem for moving average of the bivariate functions of such chains. Secondly, we prove the strong law of large numbers for the frequencies of occurrence of states of moving average and the generalized entropy ergodic theorem. As corollaries, we generalize some known results. The research innovations of this paper are embodied in generalizing the entropy ergodic theorem in the form of moving average. As the classical Doob martingale convergence theorem cannot be employed, the core technique in this paper is that we construct a class of random variables with a parameter and the mean value of one, and use Borel–Cantelli lemma to prove the existence of a.e. convergence of certain random variables.

The rest of this paper is organized as follows. Section 2 describes some preliminaries, some concepts and properties of Markov chains indexed by a tree and the entropy density are reviewed. The most significant results of this article, i.e. the strong law of large numbers for the frequencies of occurrence of states and the generalized entropy ergodic theorem for the finite nonhomogeneous bifurcating Markov chains indexed by a binary tree, will be illustrated in Sect. 3. Finally, the proofs of main results in Sect. 3 are provided in Sect. 4.

Preliminaries

A tree is a graph T which is connected and contains no circuits. Given any two vertices αβT. Let αβ¯ be the unique path connecting α and β. Define the distance d(α,β) to be the number of edges contained in the path αβ¯. Select a vertex as the root (denoted by o). For any two vertices σ and t of tree T, we write σt if σ is on the unique path from the root o to t. We denote by σt the vertex farthest from o satisfying σtt and σtσ. The set of all vertices with distance n from the root o is called the n-th level of T . We denote by Ln the set of all vertices on level n (Lo={o}). We denote by Lmn to be the set of all vertices on the mth to nth level of T, specially by T(n) to be the set of all vertices on level 0 (the root o) to level n. Let T be any tree and tT\{o}. If a vertex in this tree is on the unique path from the root o to t and is the nearest to t, we call it the predecessor of t and denote it by 1t , we also call t a successor of 1t. If the root of a tree has N neighboring vertices and other vertices have N+1 neighboring vertices, we call this type of tree a Cayley tree and denote it by TC,N. That is, for any vertex t of Cayley tree TC,N, it has N successors on the next level. In this paper, we mainly investigate the binary tree TC,2, on which each vertex has two successors on the next level. For simplicity, we denote TC,2 by T2 (see Fig. 1). For any vertex t of the binary tree T2, we denote by t1 and t2 the two successors of t, and call them the first successor and the second successor of t respectively.

Fig. 1.

Fig. 1

Binary tree TC,2

Let (Ω,F,P) be a probability space, and T be any tree, {Xt,tT} be tree-indexed stochastic processes defined on (Ω,F,P). Let A be the subgraph of T, XA={Xt,tA}. We denote by |A| the number of vertices of A, xA the realization of XA. Dang et al. [9] defined the discrete form of nonhomogeneous bifurcating Markov chains indexed by a binary tree. First we review the definition of this process.

Definition 2.1

(Dang et al. [9]) Let T2 be a binary tree, G a countable state space, {Xt,tT2} be a collection of G-valued random variables defined on probability space (Ω,F,P). Let

p={p(x),xG} 1

be a distribution on G, and

Pt=(Pt(y1,y2|x)),x,y1,y2G,tT2 2

be a collection of stochastic matrices (that is Pt(y1,y2|x)0,y1,y2,xG, and (y1,y2)G2Pt(y1,y2|x)=1,xG) on G×G2. If n1,

P(XLn=xLn|XT(n-1)=xT(n-1))=tLn-1Pt(xt1,xt2|xt), 3

and

P(Xo=x)=p(x),xG, 4

{Xt,tT2} will be called G-valued nonhomogeneous bifurcating Markov chains indexed by a binary tree T2 with the initial distribution (1) and stochastic matrices (2). If tT2,Pt=P, where P={P(y1,y2|x),x,y1,y2G} is a stochastic matrix on G×G2, {Xt,tT2} will be called G-valued homogeneous bifurcating Markov chains indexed by a binary tree.

Dang et al. [9] presented the equivalent properties for nonhomogeneous bifurcating Markov chains indexed by a binary tree as following.

Property 2.1

(Dang et al. [9]) Let T2 be a binary tree, G a countable state space, and {Xt,tT2} be a collection of G-valued random variables defined on probability space (Ω,F,P), then the three propositions below are equivalent:

  • (i)

    {Xt,tT2} is a G-valued nonhomogeneous bifurcating Markov chain indexed by a binary tree T2 with the initial distribution (1) and stochastic matrices (2) defined by Definition 2.1;

  • (ii)
    For n1 and xT(n)GT(n), we have
    P(XT(n)=xT(n))=p(xo)tT(n-1)Pt(xt1,xt2|xt); 5
  • (iii)
    For n1 and t,t1,t2,,tnT2, satisfying tit1t,tit2t,1in, we have
    P(Xt1=y1,Xt2=y2|Xt=x,Xt1=xt1,,Xtn=xtn)=Pt(y1,y2|x)=P(Xt1=y1,Xt2=y2|Xt=x),x,y1,y2G, 6
    and
    P(Xo=x)=p(x),xG.

Remark 2.1

It is a consequence of Kolmogorov extension theorem that there exists a collection of G-valued random variables {Xt,tT2} on some probability space such that (5) holds.

Remark 2.2

By (5), we can easily obtain that for m,n1,nm and xLmn=GLmn,

P(XLmn=xLmn)=P(XLm=xLm)tLmn-1P(xt1,xt2|xt). 7

Remark 2.3

If {Xt,tT2} is a G-valued nonhomogeneous bifurcating Markov chains indexed by a binary tree T2 with the stochastic matrices (2) defined by Definition 2.1. From the second equality of (6), we have that for any tT,

P(Xt1=y1,Xt2=y2|Xt=x)=Pt(y1,y2|x).

Below we will recall the definition of tree indexed nonhomogeneous Markov chains.

Definition 2.2

(Dong et al. [11]) Let T be a local finite and infinite tree, G a countable state space, {Xt,tT} be a collection of G-valued random variables defined on probability space (Ω,F,P). Let

p={p(x),xG} 8

be a distribution on G, and

Qt=(Qt(y|x)),x,yG,tT\{o} 9

be a collection of transition matrices on G2. If n1, and t,t1,t2,,tnT, satisfying tit1t,1in, we have

P(Xt=y|X1t=x,Xt1=xt1,,Xtn=xtn)=P(Xt=y|X1t=x)=Qt(y|x),x,yG, 10

and

P(Xo=x)=p(x),xG, 11

{Xt,tT} will be called G-valued nonhomogeneous Markov chains indexed by tree T with the initial distribution (8) and transition matrices (9), or called tree indexed nonhomogeneous Markov chains with state space G.

The above definition is the natural generalization of the definition of homogeneous Markov chains indexed by tree T (see Benjamini and Peres [3]). Similar to the equivalent property of nonhomogeneous bifurcating Markov chains indexed by a binary tree, by Property 2.1, we can immediately obtain the equivalent property of nonhomogeneous Markov chains indexed by a tree.

Property 2.2

(Dang et al. [9]) Let T be a local finite and infinite tree, G a countable state space, and {Xt,tT} be a collection of G-valued random variables defined on probability space (Ω,F,P). Then {Xt,tT} is a tree indexed nonhomogeneous Markov chain taking values in G defined by Definition 2.2 if and only if n1 and xT(n)GT(n),

P(XT(n)=xT(n))=p(xo)tT(n)\{o}Qt(xt|x1t). 12

Remark 2.4

From Property 2.2, we know that {Xt,tT2} is a tree indexed nonhomogeneous Markov chain if and only if, n1 and xT(n)GT(n),

P(XT(n)=xT(n))=p(xo)tT(n-1)Qt1(xt1|xt)Qt2(xt2|xt). 13

Thus a nonhomogeneous bifurcating Markov chain indexed by a binary tree is the nonhomogeneous Markov chain indexed by a binary tree if and only if, for tT2 and x,y1,y2G,

Pt(y1,y2|x)=Qt1(y1|x)Qt2(y2|x), 14

that is tT2,

P(Xt1=y1,Xt2=y2|Xt=x)=P(Xt1=y1|Xt=x)P(Xt2=y2|Xt=x).

The above equality means that a nonhomogeneous bifurcating Markov chain indexed by a binary tree is the nonhomogeneous Markov chain indexed by a binary tree if and only if for any tT2, their two successors of the same predecessor of t are conditionally independent.

Let T be a tree, {Xt,tT} be a stochastic process indexed by tree T taking values in countable state space G. Denote P(xLmn)=P(XLmn=xLmn). Let {an,n0} and {ϕ(n),n0} be two sequences of nonnegative integers such that limnϕ(n)=. Define

fan,ϕ(n)(ω)=-1|Lanan+ϕ(n)|lnP(XLanan+ϕ(n)), 15

fan,ϕ(n)(ω) will be called the generalized entropy density of XLanan+ϕ(n). Particularly, if an0 and ϕ(n)=n, fan,ϕ(n)(ω) will become the classical entropy density of XT(n) defined as follows

fn(ω)f0,n(ω)=-1|T(n)|lnP(XT(n)). 16

Obviously, if {Xt,tT} is a nonhomogeneous bifurcating Markov chains indexed by a binary tree defined by Definition 2.1, it follows from (7) that

fan,ϕ(n)(ω)=-1|Lanan+ϕ(n)|[lnP(XLan)+tLanan+ϕ(n)-1lnPt(Xt1,Xt2|Xt)], 17

and

fn(ω)=-1T(n)[lnP(Xo)+tT(n-1)lnPt(Xt1,Xt2|Xt)]. 18

Property 2.3

(Yang and Yang [28]) Let T2 be a binary tree, G={0,1,,b-1} a finite state space and {Xt,tT2} a tree-indexed stochastic process taking values in G. Let fan,ϕ(n)(ω) be defined by (15). Then fan,ϕ(n)(ω) are uniformly integrable.

Main Results

Let G={0,1,,b-1} be a finite state space, {Xt,tT2} be a G-valued nonhomogeneous bifurcating Markov chain indexed by a binary tree defined as before. Let Sk(Lanan+ϕ(n))(kG) be the number of k in set of random variables {Xt,tLanan+ϕ(n)}, and Sk(Lan)(kG) be the number of k in set of random variables {Xt,tLan},Ski(Lanan+ϕ(n)-1)(kG) be the number of k in set of random variables {Xti=k,tLanan+ϕ(n)-1},i=1,2, which are defined as,

Sk(Lanan+ϕ(n))=|{tLanan+ϕ(n):Xt=k}|;Sk(Lan)=|{tLan:Xt=k}|;

and

Ski(Lanan+ϕ(n)-1)=|{tLanan+ϕ(n)-1:Xti=k}|,i=1,2.

It follows that

Sk(Lanan+ϕ(n))=tLanan+ϕ(n)Ik(Xt); 19
Sk(Lan)=tLanIk(Xt); 20
Sk1(Lanan+ϕ(n)-1)=tLanan+ϕ(n)-1Ik(Xt1); 21
Sk2(Lanan+ϕ(n)-1)=tLanan+ϕ(n)-1Ik(Xt2); 22

and

Sk(Lanan+ϕ(n))=Sk1(Lanan+ϕ(n)-1)+Sk2(Lanan+ϕ(n)-1)+Sk(Lan), 23

where

Ik(i)=1,i=k,0,ik.

In this section, we will establish the strong law of large numbers for the frequencies of occurrence of states and the generalized entropy ergodic theorem for the finite nonhomogeneous bifurcating Markov chains indexed by a binary tree. Firstly, we will give the strong law of large numbers for the frequencies of occurrence of states for this chains with finite state space.

Theorem 3.1

Let G={0,1,,b-1} be a finite state space, and {Xt,tT2} be a G-valued nonhomogeneous bifurcating Markov chain indexed by a binary tree T2 with stochastic matrices {Pt,tT2} defined by Definition 2.1, Sk(Lanan+ϕ(n)) be defined by (19). Let P=(P(y1,y2|x)),x,y1,y2G be another stochastic matrix, and let P1(y1|x)=y2GP(y1,y2|x),P2(y2|x)=y1GP(y1,y2|x),P1=(P1(y|x)),P2=(P2(y|x)). Let Q=12(P1+P2), and assume that the transition matrix Q is ergodic. Let {an,n0} and {ϕ(n),n0} be two nonnegative integer sequences such that for any positive integers nm

ϕ(m+n)-ϕ(n)m. 24

If x,y1,y2G,

limn1|T(n)|tT(n-1)|Pt(y1,y2|x)-P(y1,y2|x)|=0, 25

then

limnSk(Lanan+ϕ(n))|Lanan+ϕ(n)|=π(k)a.e.kG, 26

where π={π(0),π(1),,π(b-1)} is the unique stationary distribution determined by the transition matrix Q.

The proof of the above theorem will be given in Sect. 4.

In the following, we will study the generalized entropy ergodic theorem for nonhomogeneous bifurcating Markov chains indexed by a binary tree with finite state space G={0,1,,b-1}.

Theorem 3.2

Under the conditions of Theorem 3.1, let fan,ϕn(ω) be as defined in (17) and {an,n0} be a sequence of bounded nonnegative numbers, then

limnfan,ϕn(ω)=-12l=0b-1π(l)k1=0b-1k2=0b-1P(k1,k2|l)lnP(k1,k2|l)a.e.. 27

The proof of the above theorem will be presented in Sect. 4.

Remark 3.1

Let an0,ϕ(n)=n in Theorem 3.2, we can immediately get the entropy ergodic theorem for nonhomogeneous bifurcating Markov chains indexed by a binary tree with finite state space G (see Dang et al. [9]).

Remark 3.2

From Property 2.3, we know that fan,ϕ(n)(ω) are uniformly integrable. Thus (27) also holds with L1 convergence.

We denote by gan,ϕ(n)(ω) the generalized entropy density of nonhomogeneous Markov chains indexed by a tree with the initial distribution (8) and transition matrices (9). From (12), it is easy to see that

gan,ϕ(n)(ω)=-1|Lanan+ϕ(n)|[lnP(XLan)+tLan+1an+ϕ(n)lnQt(Xt|X1t)]. 28

By Theorem 3.2, we can establish the generalized entropy ergodic theorem for nonhomogeneous Markov chains indexed by a binary tree.

Corollary 3.1

Let T2 be a binary tree, G={0,1,2,,b-1} be a finite state space, {Xt,tT2} be a G-valued nonhomogeneous Markov chain indexed by T2 with the transition matrices (9) defined by Definition 2.2. Let Q=(Q(k|l)),k,lG be another transition matrix, and assume that Q is ergodic. Let {an,n0} be a sequence of bounded nonnegative integers and {ϕ(n),n0} be a nonnegative integer sequences such that for any positive integers nm,

ϕ(m+n)-ϕ(n)m.

If

limn1|T(n)|T(n)\{o}|Qt(k|l)-Q(k|l)|=0,k,lG, 29

then

limngan,ϕ(n)(ω)=-l=0b-1k=0b-1π(l)Q(k|l)lnQ(k|l)a.e., 30

where π={π(0),,π(b-1)} is the unique stationary distribution determined by the transition matrix Q.

The proof of the above corollary will be given in Sect. 4.

Remark 3.3

Take an0,ϕ(n)=n in Corollary 3.1, it is straightforward to obtain the entropy ergodic theorem for nonhomogeneous Markov chains indexed by a Cayley tree TC,2 with finite state space G. The result is a special case of Dong, Yang and Bai [11] for N=2.

If there is only one son for each vertex of the tree, nonhomogeneous Markov chains indexed by a binary tree will degenerate into nonhomogeneous Markov chains. Similarly, we denote by han,ϕ(n)(ω) the generalized entropy density of nonhomogeneous Markov chain with the initial distribution {μ0(0),,μ0(b-1)} and transition matrices Pn=(pn(i,j)),i,jG. It easily follows that

han,ϕ(n)(ω)=-1ϕ(n){logμan(Xan)+k=an+1an+ϕ(n)logpk(Xk-1,Xk)}, 31

where μan(x) is the distribution of Xan. Thus we can get the generalized entropy ergodic theorem for nonhomogeneous Markov chains.

Corollary 3.2

Suppose {Xn,n0} is a nonhomogeneous Markov chain taking values from a finite state space G={0,1,,b-1} with the initial distribution {μ0(0),,μ0(b-1)} and the transition matrices {Pn=(pn(i,j)),i,jG,n=1,2,}, where pn(i,j)=P(Xn=j|Xn-1=i). Let {an,n0} be a sequence of bounded nonnegative integer and {ϕ(n),n0} be a nonnegative integer sequences such that for any positive integers nm

ϕ(m+n)-ϕ(n)m.

Let P=(p(i,j)) be another transition matrix, and assume that P is irreducible. If

limn1nk=1n|pk(i,j)-p(i,j)|=0, 32

then

limnhan,ϕ(n)(ω)=-i=0b-1j=0b-1πip(i,j)logp(i,j)a.e.. 33

Proof

The corollary is a special case of Corollary 3.1, where T2 is the set of nonnegative integers N.

Remark 3.4

Note that

1ϕ(n)k=an+1an+ϕ(n)|pk(i,j)-p(i,j)|(1+anϕ(n))1an+ϕ(n)k=1an+ϕ(n)|pk(i,j)-p(i,j)|,

and {an} is bounded, by (32), we have that

limn1ϕ(n)k=an+1an+ϕ(n)|pk(i,j)-p(i,j)|=0.

Thus, we can immediately obtain the results of Wang and Yang [27] on the generalized entropy ergodic theorem for delayed sums of nonhomogeneous Markov chains.

Remark 3.5

If an0,ϕ(n)=n in Corollary 3.2, we can get the entropy ergodic theorem of nonhomogeneous Markov chains (see Yang, [29]).

The Proofs

Before providing the proofs of the main results in Sect 3, we begin with some lemmas.

Lemma 4.1

Let T2 be a binary tree, and G be a countable state space. Assuming that {Xt,tT2} be a G-valued nonhomogeneous bifurcating Markov chain indexed by a binary tree T2 defined by Definition 2.1, and {gt(x,y1,y2),tT2} be a collection of functions defined on G3. Suppose that α>0, s.t. E[eα|gt(Xt,Xt1,Xt2)|]<,tT2. Let {an,n0} and {ϕ(n),n0} be two sequences of nonnegative integers such that ϕ(n) converges to infinity as n. Assume that for ε>0,

n=1exp(-|Lanan+ϕ(n)|ε)<. 34

Let

Han,ϕ(n)(ω)=tLanan+ϕ(n)-1gt(Xt,Xt1,Xt2), 35

and

Gan,ϕ(n)(ω)=tLanan+ϕ(n)-1E[gt(Xt,Xt1,Xt2)|Xt]. 36

Let α>0, and set

D(α)={ω:lim supn1|Lanan+ϕ(n)|tLan+1an+ϕ(n)E[gt2(Xt,Xt1,Xt2)eα|gt(Xt,Xt1,Xt2)||Xt]=M(α;ω)<}. 37

Then we have

limnHan,ϕ(n)(ω)-Gan,ϕ(n)(ω)|Lanan+ϕ(n)|=0a.e.ωD(α). 38

Remark 4.1

It is easy to see that if {gt(x,y1,y2),tT2} is a collection of uniformly bounded functions, then for any α>0,D(α)=Ω, thus we can get

limnHan,ϕ(n)(ω)-Gan,ϕ(n)(ω)|Lanan+ϕ(n)|=0a.e..

Remark 4.2

Let an=0 and ϕ(n)=[log2nα](α>0). Since T2 is a binary tree, we have

|Lanan+ϕ(n)|=2[log2nα]+1-12log2nα-1+1-1=nα-1,

where [·] is the usual greatest integer function. In this case (34) holds.

Proof

Let λ be a nonzero real number, for fixed n, define

tan,m(λ,ω)=eλtLanan+m-1gt(Xt,Xt1,Xt2)tLanan+m-1E[eλgt(Xt,Xt1,Xt2)|Xt],m=1,2,,ϕ(n). 39

Noticing that

E[eλtLan+ϕ(n)-1gt(Xt,Xt1,Xt2)|XT(an+ϕ(n)-1)]=tLan+ϕ(n)-1,(xt1,xt2)G2eλtLan+ϕ(n)-1gt(Xt,xt1,xt2)·P(XLan+ϕ(n)=xLan+ϕ(n)|XT(an+φ(n)-1))=tLan+ϕ(n)-1,(xt1,xt2)G2eλtLan+ϕ(n)-1gt(Xt,xt1,xt2)·tLan+ϕ(n)-1Pt(xt1,xt2|Xt)=tLan+ϕ(n)-1(xt1,xt2)G2eλgt(Xt,xt1,xt2)Pt(xt1,xt2|Xt)=tLan+ϕ(n)-1E[eλgt(Xt,Xt1,Xt2)|Xt]. 40

It is easy to see that E[tan,1(λ,ω)]=1. Hence by (40),

E[tan,ϕ(n)(λ,ω)]=EE[tan,ϕ(n)(λ,ω)|XT(an+ϕ(n)-1)]=E[E[tan,ϕ(n)-1(λ,ω)eλtLan+ϕ(n)-1gt(Xt,Xt1,Xt2)tLan+ϕ(n)-1E[eλgt(Xt,Xt1,Xt2)|Xt]|XT(an+ϕ(n)-1)]]=E[tan,ϕ(n)-1(λ,ω)·E[eλtLan+ϕ(n)-1gt(Xt,Xt1,Xt2)XT(an+ϕ(n)-1)]tLan+ϕ(n)-1E[eλgt(Xt,Xt1,Xt2)|X1t]]=E[tan,ϕ(n)-1(λ,ω)]==E[tan,1(λ,ω)]=1. 41

By Markov inequality, (34) and (41), for any ε>0, we have

n=1P[1|Lanan+ϕ(n)|lntan,ϕ(n)(λ,ω)ε]=n=1Ptan,ϕ(n)(λ,ω)exp(|Lanan+ϕ(n)|·ε)n=1exp(-|Lanan+ϕ(n)|·ε)<. 42

According to Borel–Cantelli Lemma and arbitrariness of ε, we have

lim supn1|Lanan+ϕ(n)|lntan,ϕ(n)(λ,ω)0a.e.. 43

Noticing that

lntan,ϕ(n)(λ,ω)|Lanan+ϕ(n)|=1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{λgt(Xt,Xt1,Xt2)-lnE[eλgt(Xt,Xt1,Xt2)|Xt]}. 44

by (43) and (44), we have

lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{λgt(Xt,Xt1,Xt2)-lnE[eλgt(Xt,Xt1,Xt2)|Xt]}0a.e.. 45

Let 0<λα, dividing both sides of (45) by λ, we have

lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{gt(Xt,Xt1,Xt2)-lnE[eλgt(Xt,Xt1,Xt2)|Xt]λ}0a.e.. 46

By (37), (46), and inequalities lnxx-1(x>0) and 0ex-1-x12x2e|x|, we get

lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{gt(Xt,Xt1,Xt2)-E[gt(Xt,Xt1,Xt2)|Xt]}lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{lnE[eλgt(Xt,Xt1,Xt2)|Xt]λ-E[gt(Xt,Xt1,Xt2)|Xt]}lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{E[eλgt(Xt,Xt1,Xt2)|Xt]-1λ-E[λgt(Xt,Xt1,Xt2)|Xt]λ}=lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1E[eλgt(Xt,Xt1,Xt2)-1-λgt(Xt,Xt1,Xt2)|Xt]λλ2lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1E[gt2(Xt,Xt1,Xt2)e|λ|·|gt(Xt,Xt1,Xt2)||Xt]λ2lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1E[gt2(Xt,Xt1,Xt2)e|α|·|gt(Xt,Xt1,Xt2)||Xt]=λ2M(α;ω)a.e.ωD(α). 47

Letting λ0+ in (47) we have

lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{gt(Xt,Xt1,Xt2)-E[gt(Xt,Xt1,Xt2)|Xt]}0a.e.ωD(α). 48

Let -αλ<0, we similarly get

lim infn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1{gt(Xt,Xt1,Xt2)-E[gt(Xt,Xt1,Xt2)|Xt]}0a.e.ωD(α). 49

Combining (48) and (49), we obtain (38) directly.

Lemma 4.2

Let T2 be a binary tree, {an,n0} and {ϕ(n),n0} defined as in Lemma 4.1. Let {at,tT} be a collection of real numbers, and a be a real number. If

limn1|T(n)|tT(n-1)|at-a|=0, 50

then

limn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1|at-a|=0. 51

Proof

Noticing that

1|Lanan+ϕ(n)|tLanan+ϕ(n)-1|at-a||T(an+ϕ(n))||Lanan+ϕ(n)|1|T(an+ϕ(n))|tT(an+ϕ(n)-1)|at-a|. 52

Since T2 is a binary tree, we have

limn|T(an+ϕ(n))||Lanan+ϕ(n)|=limn2an+ϕ(n)+1-12an(2ϕ(n)+1-1)=1. 53

Equation (51) immediately follows from (50), (52) and (53).

Now, we present the proof of Theorem 3.1 as follows.

Proof of Theorem 3.1

It is easy to see from (24) that limnϕ(n)= and (34) is satisfied. By (25) and Lemma 4.2, we have

limn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1|Pt(y1,y2|x)-P(y1,y2|x)|=0. 54

Let gt(x,y1,y2)=Ik(y1) in Lemma 4.1. Obviously, {gt(x,y1,y2),tT2} are uniformly bounded. Since

Han,ϕ(n)(ω)=tLanan+ϕ(n)-1Ik(Xt1)=Sk1(Lanan+ϕ(n)-1), 55

and

Gan,ϕ(n)(ω)=tLanan+ϕ(n)-1E[gt(Xt,Xt1,Xt2)|Xt]=tLanan+ϕ(n)-1(xt1,xt2)G2gt(Xt,xt1,xt2)·Pt(xt1,xt2|Xt)=tLanan+ϕ(n)-1(xt1,xt2)G2Ik(xt1)·Pt(xt1,xt2|Xt). 56

From Lemma 4.1, we have

limn1|Lanan+ϕ(n)|{Sk1(Lanan+ϕ(n)-1)-tLanan+ϕ(n)-1(xt1,xt2)G2Ik(xt1)Pt(xt1,xt2|Xt)}=0a.e.. 57

From (54), it can be easily verified that

limn1|Lanan+ϕ(n)|{tLanan+ϕ(n)-1(xt1,xt2)G2Ik(xt1)Pt(xt1,xt2|Xt)-tLanan+ϕ(n)-1(xt1,xt2)G2Ik(xt1)P(xt1,xt2|Xt)}=0. 58

Since xt2GP(xt1,xt2|Xt)=P1(xt1|Xt), so

tLanan+ϕ(n)-1(xt1,xt2)G2Ik(xt1)P(xt1,xt2|Xt)=tLanan+ϕ(n)-1P1(k|Xt)=tLanan+ϕ(n)-1l=0b-1Il(Xt)P1(k|l)=l=0b-1P1(k|l)Sl(Lanan+ϕ(n)-1). 59

By (57)–(59), we have

limn1|Lanan+ϕ(n)|{Sk1(Lanan+ϕ(n)-1)-l=0b-1P1(k|l)Sl(Lanan+ϕ(n)-1)}=0a.e.. 60

Let gt(x,y1,y2)=Ik(y2) in Lemma 4.1, similarly, we obtain that

limn1|Lanan+ϕ(n)|{Sk2(Lanan+ϕ(n)-1)-l=0b-1P2(k|l)Sl(Lanan+ϕ(n)-1)}=0a.e.. 61

Adding (60) and (61), and noticing that

0limnSk(Lan)|Lanan+ϕ(n)|limn|Lan||Lanan+ϕ(n)|=limn2an2an(2ϕ(n)-1)=0,

limn|Lanan+ϕn)||Lanan+ϕ(n)-1|=2, and Q=12(P1+P2). By (23), we have

limn{Sk(Lanan+ϕ(n))|Lanan+ϕ(n)|-l=0b-1Q(k|l)Sl(Lanan+ϕ(n)-1)|Lanan+ϕ(n)-1|}=0a.e. 62

Letting ϕ(n)=ϕ(n)-1, it is easy to see that {ϕ(n),n0} also satisfies (34). Using the same argument as that used to derive (62), we can prove that

limn{Sk(Lanan+ϕ(n)-1)|Lanan+ϕ(n)-1|-l=0b-1Q(k|l)Sl(Lanan+ϕ(n)-2)|Lanan+ϕ(n)-2|}=0a.e.. 63

Multiplying the k-th equality of (63) by Q(j|k), adding them together and using (62), we have

0=limnk=0b-1Sk(Lanan+ϕ(n)-1)|Lanan+ϕ(n)-1|Q(j|k)-k=0b-1l=0b-1Sl(Lanan+ϕ(n)-2)|Lanan+ϕ(n)-2|Q(k|l)Q(j|k)=limn{k=0b-1Sk(Lanan+ϕ(n)-1)|Lanan+ϕ(n)-1|Q(j|k)-Sj(Lanan+ϕ(n))|Lanan+ϕ(n)|+Sj(Lanan+ϕ(n))|Lanan+ϕ(n)|-k=0b-1l=0b-1Sl(Lanan+ϕ(n)-2)|Lanan+ϕ(n)-2|Q(k|l)Q(j|k)}=limnSj(Lanan+ϕ(n))|Lanan+ϕ(n)|-l=0b-1Sl(Lanan+ϕ(n)-2)|Lanan+ϕ(n)-2|Q(2)(j|l)a.e.. 64

where Q(N)(j|l) is the N-step transition probability determined by Q. By induction, we have

limnSj(Lanan+ϕ(n))|Lanan+ϕ(n)|-l=0b-1Sl(Lanan+ϕ(n)-N)|Lanan+ϕ(n)-N|Q(N)(j|l)=0a.e.. 65

Noticing that

1|Lanan+ϕ(n)-N|l=0b-1Sl(Lanan+ϕ(n)-N)=1, 66

and

limNQ(N)(j|l)=π(j),jG. 67

(26) follows from (65), (66) and (67). This completes the proof of the Theorem 3.1.

Before presenting the proof of Theorem 3.2, we cite a lemma which will be used.

Lemma 4.3

(Dong et al. [11]) Let T2 be a binary tree, φ(x) be a bounded function defined on interval , and φ be continuous at x=b(b). Let {bt,tT2} be a collection of real numbers. If

limn1|T(n)|tT(n-1)|bt-b|=0, 68

then

limn1|T(n)|tT(n-1)|φ(bt)-φ(b)|=0. 69

Proof of Theorem 3.2

Since {an,n0} is bounded, then there exists M0 such that |an|M for all n0. Since

E[e|lnP(XLan)|]=xLane-lnP(XLan=xLan)P(XLan=xLan)b|Lan|.

It is easy to see from (24) that {ϕ(n),n0} satisfies (34). By Markov inequality and (34), we have for every ε>0,

n=1P1|Lanan+ϕ(n)|lnP(XLan)εb2Mn=1exp{-ε|Lanan+ϕ(n)|}<. 70

By Borel–Cantelli lemma, we get

limn1|Lanan+ϕ(n)|lnP(XLan)=0a.e.. 71

Let φ(x)=xlogx(φ(0)=0). It is easy to see that φ(x) is a continuous function on the interval [0, 1]. By Lemmas 4.2, 4.3 and (25), we have k1,k2,lG

limn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1Pt(k1,k2|l)lnPt(k1,k2|l)-P(k1,k2|l)lnP(k1,k2|l)=0. 72

Let gt(x,y1,y2)=lnPt(y1,y2|x) for all tT2 in Lemma 4.1. By (35) and (36), we have

Han,ϕ(n)(ω)=tLanan+ϕ(n)-1lnPt(Xt1,Xt2|Xt), 73
Gan,ϕ(n)(ω)=tLanan+ϕ(n)-1(xt1,xt2)G2Pt(xt1,xt2|Xt)lnPt(xt1,xt2|Xt). 74

Letting α=12, noticing that for any tT2, we have

Egt2(Xt,Xt1,Xt2)eα|gt(Xt,Xt1,Xt2)||Xt=(xt1,xt2)G2ln2Pt(xt1,xt2|Xt)·e-12lnPt(xt1,xt2|Xt)Pt(xt1,xt2|Xt)=(xt1,xt2)G2ln2Pt(xt1,xt2|Xt)[Pt(xt1,xt2|Xt)]1216b2e-2.

and tT2,

E[e12|gt(Xt,Xt1,Xt2)||Xt]<. 75

Thus

lim supn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1E[gt2(Xt,Xt1,Xt2)·e12|gt(Xt,Xt1,Xt2)||Xt]16b2e-2. 76

By (73)–(76) and Lemma 4.1, we have

limn1|Lanan+ϕ(n)|{tLanan+ϕ(n)-1lnPt(Xt1,Xt2|Xt)-tLanan+ϕ(n)-1(xt1,xt2)G2Pt(xt1,xt2|Xt)·lnPt(xt1,xt2|Xt)}=0a.e.. 77

Now, we have

|1|Lanan+ϕ(n)|tLanan+ϕ(n)-1(xt1,xt2)G2Pt(xt1,xt2|Xt)·lnPt(xt1,xt2|Xt)-12l=0b-1π(l)k1=0b-1k2=0b-1P(k1,k2|l)lnP(k1,k2|l)||1|Lanan+ϕ(n)|tLanan+ϕ(n)-1l=0b-1k1=0b-1k2=0b-1Il(Xt)Pt(k1,k2|l)·lnPt(k1,k2|l)-1|Lanan+ϕ(n)|tLanan+ϕ(n)-1l=0b-1k1=0b-1k2=0b-1Il(Xt)P(k1,k2|l)·lnP(k1,k2|l)|+|1|Lanan+ϕ(n)|tLanan+ϕ(n)-1l=0b-1k1=0b-1k2=0b-1Il(Xt)P(k1,k2|l)·lnP(k1,k2|l)-12l=0b-1π(l)k1=0b-1k2=0b-1P(k1,k2|l)lnP(k1,k2|l)|l=0b-1k1=0b-1k2=0b-11|Lanan+ϕ(n)|tLanan+ϕ(n)-1|Pt(k1,k2|l)·lnPt(k1,k2|l)-P(k1,k2|l)·lnP(k1,k2|l)|+l=0b-1k1=0b-1k2=0b-1P(k1,k2|l)·lnP(k1,k2|l)1|Lanan+ϕ(n)|tLanan+ϕ(n)-1Il(Xt)-12π(l)l=0b-1k1=0b-1k2=0b-11|Lanan+ϕ(n)|tLanan+ϕ(n)-1|Pt(k1,k2|l)·lnPt(k1,k2|l)-P(k1,k2|l)·lnP(k1,k2|l)|+l=0b-1k1=0b-1k2=0b-1P(k1,k2|l)·lnP(k1,k2|l)1|Lanan+ϕ(n)|Sl(Lanan+ϕ(n)-1)-12π(l)a.e.. 78

By Theorem 3.1, (72), (77) and (78), and noticing that limn|Lanan+ϕ(n)-1||Lanan+ϕ(n)|=12. We have

limn1|Lanan+ϕ(n)|tLanan+ϕ(n)-1lnPt(Xt1,Xt2|Xt)=12l=0b-1π(l)k1=0b-1k2=0b-1P(k1,k2|l)lnP(k1,k2|l)a.e.. 79

(27) can be obtained from (17), (71) and (79), which completes the proof of the theorem 3.2.

Proof of Corollary 3.1

Let tT2 and x,y1,y2G,Pt(y1,y2|x)=Qt1(y1|x)Qt2(y2|x). From Remark 2.4 we know that nonhomogeneous Markov chain indexed by a binary tree given in this corollary is a nonhomogeneous bifurcating Markov chain indexed by a binary tree with the stochastic matrices {Pt=(Pt(y1,y2|x)),tT2}, and

gan,ϕ(n)(ω)=1|Lanan+ϕ(n)|[lnP(XLan)+tLan+1an+ϕ(n)lnQt(Xt|X1t)]=-1|Lanan+ϕ(n)|[lnP(XLan)+tLanan+ϕ(n)-1lnQt1(Xt1|Xt)+tLanan+ϕ(n)-1lnQt2(Xt2|Xt)]=-1|Lanan+ϕ(n)|[lnP(XLan)+tLanan+ϕ(n)-1lnPt(Xt1,Xt2|Xt)]=fan,ϕ(n)(ω). 80

Let P(y1,y2|x)=Q(y1|x)Q(y2|x). It is easy to see that P1=Q,P2=Q,12(P1+P2)=Q, and Q is ergodic. Since

|Pt(k1,k2|l)-P(k1,k2|l)|=|Qt1(k1|l)Qt2(k2|l)-Q(k1|l)Q(k2|l)||Qt1(k1|l)Qt2(k2|l)-Q(k1|l)Qt2(k2|l)|+|Q(k1|l)Qt2(k2|l)-Q(k1|l)Q(k2|l)||Qt1(k1|l)-Q(k1|l)|+|Qt2(k2|l)-Q(k2|l)|, 81

and by (29), for i=1,2,

limn1|T(n)|tT(n-1)\{o}|Qti(k1|l)-Q(k1|l)|limn1|T(n)|tT(n)\{o}|Qt(k1|l)-Q(k1|l)|=0, 82

Thus (25) follows from (81), (82). By Theorem 3.2 and (80),

limngan,ϕ(n)(ω)=limnfan,ϕ(n)(ω)=-12l=0b-1π(l)k1=0b-1k2=0b-1P(k1,k2|l)lnP(k1,k2|l)=-12l=0b-1π(l)k1=0b-1k2=0b-1Q(k1|l)Q(k2|l)·[lnQ(k1|l)+lnQ(k2|l)]=-l=0b-1k=0b-1π(l)Q(k|l)lnQ(k|l)a.e.. 83

Thus, (30) holds.

Acknowledgements

The authors sincerely thank the editor and reviewers for their helpful and important comments, especially during the time with COVID-19 pandemic. The authors are also very thankful to Professor Keyue Ding who helped us to improve the English of this paper greatly. This work is supported by the National Natural Science Foundation of China (11971197, 11601191).

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Algoet PH, Cover TM. A sandwich proof of the Shannon–McMillan–Breiman theorem. Ann. Probab. 1988;16(2):899–909. doi: 10.1214/aop/1176991794. [DOI] [Google Scholar]
  • 2.Barron AR. The strong ergodic theorem for densities: generalized Shannon–McMillan–Breiman theorem. Ann. Probab. 1985;13(4):1292–1303. doi: 10.1214/aop/1176992813. [DOI] [Google Scholar]
  • 3.Benjamini I, Peres Y. Markov chains indexed by trees. Ann. Probab. 1994;22:219–243. doi: 10.1214/aop/1176988857. [DOI] [Google Scholar]
  • 4.Berger T, Ye Z. Entropic aspects of random fields on trees. IEEE Trans. Inform. Theory. 1990;36:1006–1018. doi: 10.1109/18.57200. [DOI] [Google Scholar]
  • 5.Billingsley P. Ergodic Theory and Information. New York: Wiley; 1965. [Google Scholar]
  • 6.Breiman L. The individual ergodic theorem of information theory. Ann. Math. Stat. 1957;28(3):809–811. doi: 10.1214/aoms/1177706899. [DOI] [Google Scholar]
  • 7.Chen, D.Y.: Average properties of random walks on Galton-Watson trees. Ann. Inst. Henri Poincare 33(3), 359-369 (1997)
  • 8.Chung KL. The ergodic theorem of information theory. Ann. Math. Stat. 1961;32(2):612–614. doi: 10.1214/aoms/1177705069. [DOI] [Google Scholar]
  • 9.Dang H, Yang WG, Shi ZY. The strong law of large numbers and the entropy ergodic theorem for nonhomogeneous bifurcating Markov chains indexed by a binary tree. IEEE Trans. Inf. Theory. 2015;61(4):1640–164. doi: 10.1109/TIT.2015.2404310. [DOI] [Google Scholar]
  • 10.Dembo A, Mörter P, Sheffied S. Large deviations of Markov chains indexed by random trees. Ann. I. H. Poincare-Fr. 2005;41(6):971–996. doi: 10.1016/j.anihpb.2004.09.005. [DOI] [Google Scholar]
  • 11.Dong Y, Yang WG, Bai JF. The strong law of large numbers and the Shannon-McMillan theorem for nonhomogeneous Markov chains indexed by a Cayley tree. Stat. Probab. Lett. 2011;81(12):1883–1890. doi: 10.1016/j.spl.2011.06.021. [DOI] [Google Scholar]
  • 12.Gaposhkin VF. The law of large numbers for moving averages of independent random variables. Mathematical notes of the Academy of Sciences of the USSR. 1987;42:579–583. [Google Scholar]
  • 13.Guyon, J.: Limit theorems for bifurcating Markov chains. Application to the detection of cellular aging. Ann. Appl. Probab. 17(5-6):1538–1569 (2007)
  • 14.Gray RM, Kieffer JC. Asymptotically mean stationary measures. Ann. Probab. 1980;8:962–973. doi: 10.1214/aop/1176994624. [DOI] [Google Scholar]
  • 15.Huang HL, Yang WG. Strong law of large number for Markov chains indexed by an infinite tree with uniformly bounded degree. Sci. China Ser. A. 2008;51(2):195–202. doi: 10.1007/s11425-008-0015-1. [DOI] [Google Scholar]
  • 16.Lai, T. L.: Limit theorems for moving averages. Probab. Finance Insur 25(4), 1-14 (2004)
  • 17.Lanzinger H. An almost sure limit theorem for moving averages of random variables between the strong law of large numbers and the Erdös–Rényi law. ESAIM-Probab. Stat. 1998;2:163–183. doi: 10.1051/ps:1998106. [DOI] [Google Scholar]
  • 18.Le Gall JF. A conditional limit theorem for tree-indexedrandom walk. Stoch. Proc. Appl. 2006;116:539–567. doi: 10.1016/j.spa.2005.11.008. [DOI] [Google Scholar]
  • 19.Liu W, Yang WG. A extension of Shannon–McMillan theorem and some limit properties for nonhomogeneous Markov chains. Stoch. Proc. Appl. 1996;61:129–145. doi: 10.1016/0304-4149(95)00068-2. [DOI] [Google Scholar]
  • 20.McMillan B. The basic theorems of information theory. Ann. Math. Statist. 1953;24:196–219. doi: 10.1214/aoms/1177729028. [DOI] [Google Scholar]
  • 21.Pemantle R. Antomorphism invariant measure on trees. Ann. Probab. 1992;20:1549–1566. doi: 10.1214/aop/1176989706. [DOI] [Google Scholar]
  • 22.Shannon C. A mathematical theory of communication. Bell. Syst. Tech. J. 1948;27(379–423):623–656. doi: 10.1002/j.1538-7305.1948.tb00917.x. [DOI] [Google Scholar]
  • 23.Shepp LA. A limit law concerning moving averages. Ann. Math. Statist. 1964;35(1):424–428. doi: 10.1214/aoms/1177703767. [DOI] [Google Scholar]
  • 24.Shi ZY, Zhong PP, Fan Y. The Shannon–McMillan theorem for Markov chains indexed by a Cayley tree in random environment. Probab. Eng. Inform. Sc. 2018;32(4):626–639. doi: 10.1017/S0269964817000444. [DOI] [Google Scholar]
  • 25.Shi ZY, Yang WG. Some limit properties for the mth-order nonhomogeneous Markov chains indexed by an rooted Cayley tree. Stat. Probab. Lett. 2010;80:1223–1233. doi: 10.1016/j.spl.2010.03.020. [DOI] [Google Scholar]
  • 26.Telcs A, Wormald N. Branching and tree indexed random walks on fractals. J. Appl. Prob. 1999;36:999–1011. doi: 10.1239/jap/1032374750. [DOI] [Google Scholar]
  • 27.Wang ZZ, Yang WG. The generalized entropy ergodic theorem for nonhomogeneous Markov chains. J. Theor. Probab. 2016;29:761–775. doi: 10.1007/s10959-015-0597-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Yang, J., Yang, W.G.: The generalized entropy ergodic theorem for nonhomogeneous Markov chains indexed by a Cayley tree. Chin. Ann. Math. A 41(1): 99-114 (2020). (In Chinese)
  • 29.Yang WG. The asymptotic equipartition property for nonhomogeneous Markov information sources. Probab. Eng. Inform. Sci. 1998;12:509–518. doi: 10.1017/S0269964800005350. [DOI] [Google Scholar]
  • 30.Yang WG, Liu W. Strong law of large numbers for Markov chains fields on a Bethe tree. Statist. Probab. Lett. 2000;49:245–250. doi: 10.1016/S0167-7152(00)00053-5. [DOI] [Google Scholar]
  • 31.Yang WG. Some limit properties for Markov chains indexed by a homogeneous tree. Statist. Probab. Lett. 2003;65:241–250. doi: 10.1016/j.spl.2003.04.001. [DOI] [Google Scholar]
  • 32.Yang WG, Liu W. The asymptotic equipartition property for mth-order nonhomogeneous Markov information sources. IEEE Trans. Inform. Theory. 2004;50(12):3326–3330. doi: 10.1109/TIT.2004.838339. [DOI] [Google Scholar]
  • 33.Yang WG, Ye Z. The asymptotic equipartition property for Markov chains indexed by a Homogeneous tree. IEEE Trans. Inf. Theory. 2007;53(9):3275–3280. doi: 10.1109/TIT.2007.903134. [DOI] [Google Scholar]
  • 34.Yamamoto K. Large deviation theorem for branches of the random binary tree in the Horton–Strahler analysis. SIAM J. Discrete Math. 2020;34(1):938–949. doi: 10.1137/18M1192810. [DOI] [Google Scholar]
  • 35.Ye Z, Berger T. Ergodic, regulary and asymptotic equipartition property of random fields on trees. J Combin. Inform. System Sci. 1996;21:157–184. [Google Scholar]
  • 36.Ye Z, Berger T. Information measures for discrete random fields. Beijing: Science; 1998. [Google Scholar]

Articles from Journal of Theoretical Probability are provided here courtesy of Nature Publishing Group

RESOURCES