Abstract
In this paper the asymptotic behavior of multitype Markov branching processes with discrete or continuous time is investigated in the positive regular and nonsingular case when both the initial number of ancestors and the time tend to infinity. Some limiting distributions are obtained as well as multivariate asymptotic normality is proved. The paper considers also the relative frequencies of distinct types of individuals which is motivated by applications in the field of cell biology. We obtained non-random limits for the frequencies and multivariate asymptotic normality when the initial number of ancestors is large and the time of observation increases to infinity. In fact this paper continues the investigations of Yakovlev and Yanev [32] where the time was fixed. The new obtained limiting results are of special interest for cell kinetics studies where the relative frequencies but not the absolute cell counts are accessible to measurement.
Keywords: multitype branching processes, relative frequencies, limit theorems, multivariate asymptotic normality
1. Introduction
This paper deals with multitype Markov branching processes with discrete or continuous time assuming a large number of ancestors. The asymptotic behavior in the positive regular and nonsingular case is investigated when both the number of ancestors N and the time t tend to infinity.
Recall that the terminology “branching processes” was first introduced by Kolmogorov and his coauthors [20, 21] considering multitype branching processes in the Markov case, which have received much attention in the literature on stochastic processes. For a further development of the theory of branching processes and their applications in biology we refer the reader to several books [1, 10, 12, 16, 18, 25, 28, 29]. Remember that the main problems of the theory of branching processes are focused on the asymptotic behavior of the probabilistic characteristics of the processes when t → ∞ assuming that the process begins from one ancestor.
Note that the multitype branching processes are considered in the present paper also from a new perspective: modeling of the relative frequencies of different types of individuals (instead of the usual counts of individuals) as functions of time. A short biological background and motivation is provided in Section 2 and the utility of relative frequencies (fractions, proportions) of distinct cell types in the analysis of biological studies of proliferation and differentiation of cells is considered. The need for such characteristics of cell kinetics arises in experimental situations where absolute cell counts are not accessible to measurement. In fact, the present article continue the investigations in this direction started in [32], where we considered the case with a large number of ancestors but the time was fixed. In this situation a limiting multivariate normal distribution was obtained (see Theorem A in Section 5) and a relevant statistical inference was developed. Now a new problem is investigated when both N and t increase to infinity and we would like to point out that the developed methods and the obtained results are quite different.
Feller [7] was probably the first who considered branching process with a large number of ancestors. For a classical Bienaymé-Galtton-Watson (BGW) process he showed a diffusion approximation in the near-critical case. Further results in this direction were obtained by Lamperti [22], Jiřina [17], Lindvall [23, 24], and Grimvall [9]. Lamperti [22] derived also some interesting limiting distributions. These results were summarized and discussed by Jagers [16].
Statistical inference for BGW processes with an increasing number of ancestors as well as limiting distributions when N and t tend to infinity were developed by Yanev [33] and Dion and Yanev [3, 4, 5] (see also a review chapter by Yanev [36]).
In the present paper some of the obtained limiting results by Lamperti [22], Yanev [33] and Dion and Yanev [3, 4, 5] are generalized for multitype Markov branching processes not only with discrete but also with continuous time. Note that usually the last two cases are investigated separately while in the present paper they are considered simultaneously. On the other hand, as usually in the theory of branching processes, the multitype generalization is quite different from the one dimensional case.
The paper is organized as follows. Section 3 introduces the basic notions and preliminary results for the further investigations. In Section 4, the probability for extinction are considered (Theorem 1) and some limit results are obtained for the relative frequencies (Theorem 2) and for the processes (Theorem 3). Section 5 deals with multivariate asymptotic normality of the processes (Theorem 4) as well as for the relative frequencies (Theorem 5). As a contrast of these results it is given Theorem A, where the obtained results in [32] are summarize. Note that in [32] we dealt with sequences of branching vectors while in the present paper we consider branching fields. In fact Theorem 2 can be interpreted as an analogue of a LLN in the case of branching arrays and Theorem 4 and Theorem 5 as analogues of a CLT in the same case. Finally in Section 6 it is shown that the obtained asymptotic results can be used to develop new statistical approach for estimation the basic parameters in branching processes with possible applications in the cell proliferation kinetics.
2. Biological Background and Motivation
The theory of branching processes has a long history of biological applications. It is worth to point out that the first asymptotic result for branching processes was obtained by Kolmogorov [19] considering some biological problems. Note that the branching processes have been proven especially useful in cell proliferation kinetics with multiple types of cells (see e.g. [29], [31], [35]).
Consider briefly the following example which is described with most details in [32]:
Oligodendrocyte type-2 astrocyte progenitor cells (referred to as O-2A progenitor cells), are known to be precursors of oligodendrocytes in the developing central nervous system. These cells grow in clones giving rise to oligodendrocytes, when they are plated in vitro and stimulated to divide by purified cortical astrocytes or by platelet-derived growth factor. Every O-2A progenitor cell can go to differentiation into an oligodendrocyte (with probability p) or it is able to proliferate (with probability 1 − p).
The first stochastic model of oligodendrocyte development in cell culture was proposed by Yakovlev et al. [30] and developed in [13, 14, 37]. The model structure was defined following a set of assumptions that specified it as a special case of the Bellman-Harris branching process with two types of cells similar to that studied by Jagers [15].
In the above-described example the scheme of observation allows to count the numbers of O-2A progenitor cells and oligodendrocytes in every of N clones but it may not be possible in many other experimental situation. As it is noted in [32], it is technically impossible to count the number of cells of a given type in the blood or bone marrow in animal experiments. Similarly in the case when studying suspension cell cultures which consist of those cell types for which no specific antibodies are available. Note that in these cases it is possible to observe only the proportions of different types of cells. This is routinely practiced in experimental and clinical laboratories but the mathematical models of cell population kinetics are traditionally formulated in terms of cell counts. That is why it is very important to investigate also the relative frequencies in multitype branching stochastic models, which is one of the motivations for the present work.
The asymptotic results obtained in this paper and in [32] give a new direction toward statistical inference and applications of branching processes in cell proliferation kinetics.
3. Multitype Markov Branching Processes with Discrete or Continuous Time
Usually branching processes with discrete or continuous time are investigated separately but in this paper we will try to treat them together. Throughout of this paper we will consider a multitype Markov branching process Z(t) = (Z1(t), Z2(t), …, Zd(t)), where Zk(t) denotes the number of cells (particles, individuals) of type k (k = 1, 2, …, d) at time t ∊ T. The time may be discrete T = N0 = {0, 1, 2, …} or continuous T = R+ = [0, ∞), but the process is assumed to be positive regular and nonsingular. For most details we refer to [1, 12, 25, 28].
Further on we will use the following notions:
Introduce the probability generating functions (p.g.f.)
Fi(t; s) = IE{sZ(t) | Z(0) = δi}, i = 1, 2, …, d; F(t; s) = (F1(t; s), …, Fd(t; s)).
Remember that in the Markov case
| (1) |
which has the following stochastic analog
where the vectors {Z(i)(τ; j)}j≥1 are i.i.d. as Z(i)(τ) and Z(i)(τ) means Z(τ) with Z(0) =δi.
In the discrete-time case we will use the offspring p.g.f.
Then by (1) one has
| (2) |
which means that F(t; s) is the t-th iterate of h(s).
In the continuous-time case as t ↓ 0
F(t; s) =s + tf(s) + o(t) uniformly for |s| ≤ 1,
where f(s) = (f1(s), …,fd(s)) is the vector of the infinitesimal generating functions
Then instead (2) the Kolmogorov backward and forward equations hold
It is well-known that all characteristics of the process can be obtained from the p.g.f. In particular, the following notation will be used:
Introduce the matrices A(t) = ‖Aij(t)‖, A = ‖Aij|| and a =‖aij‖. Then
Similarly one can obtain
Further on we will consider the positive regular and nonsingular case and then we will denote the Perron-Frobenius eigenvalue by R for the matrix A and by r for the matrix a. In both cases we will use one and same notion for the corresponding right u =(u1, …, ud) and left v = (υ1, …, υd) eigenvectors. It is well-known that they can be chosen positive and normalized such that Then it is well-known that as t → ∞
| (3) |
Note that the moment structure of multitype Galton-Watson process is investigated in Quine [27] where some linear recurrence relations are derived.
We will need also the following notation
| (4) |
The main purpose of this paper is the investigation of the asymptotic behavior of the processes with a large number of ancestors. For simplicity (and without any essential restriction) we will assume that Z(0) = (N, 0, …, 0) and Z1(0) = N → ∞. In this case we will use the notation Z(t; N) = (Z1(t; N), Z2(t; N), …, Zd(t; N)), t ∊ T, and we will consider the situation when both N and t go to infinity simultaneously. In general, N and t are free parameters, but further on one can assume that N = N(t) and N(t) ↑ ∞ as t → ∞.
By the branching property it follows that
| (5) |
where are i.i.d. copies of the branching process Zi(t) with Z(0) = δ1.
In fact, (5) is equivalent to the following relation
| (6) |
where
Introduce the total number of cells at the moment t as
| (7) |
Then by (5) and (7) it follows that
| (8) |
where are i.i.d. as U(t).
The relative frequencies (fractions, proportions) of types can be defined on the non-extinction set {U (t; N) > 0} as follows:
| (9) |
with the obvious condition
| (10) |
The investigation of the relative frequencies is very important for the applications (especially in the cell biology) because there are a lot of situations when it is not possible the observe the numbers of individuals but only their relative proportions.
In what follows, we will also need the following deterministic proportions:
| (11) |
where
| (12) |
4. Probability of Extinction and Limiting Distributions
In this section we will consider critical processes with R = 1, 0 < B < ∞, or r = 0, 0 < b < ∞, and non-critical processes (R ≠ 1 or r ≠ 0) with the classical” X log X condition”.
Theorem 1
Let q(t; N) = Pr{U(t; N) = 0} and N, t → ∞.
If {R > 1 or r > 0} ∨ {R = 1 or r = 0 and N/t → ∞} ∨ {R < 1, N Rt → ∞ or r < 0, Nert → ∞} then q(t; N) → 0 (non-extinction);
If {R = 1 or r = 0 and N/t → 0} ∨ {R < 1, NRt → 0 or r < 0, Nert → 0 then q(t; N) → 1 (extinction);
If 0 < C < ∞ and {R = 1 or r = 0 and N/t → C} ∨ {R < 1, N Rt → C or r < 0, Nert → C} then q(t; N) → q, 0 < q < 1 (extinction with probability q or non-extinction with probability 1 − q).
Remark 1
The constants in the critical and subcritical cases as well as for the discrete or continuous processes can be different but in (iii) and further on we will use for simplicity one and the same letter (without any indices).
Proof
Note that by (6) one has q(t; N) = FN(t; 0) = qN(t), where
q(t) = Pr{U(t) = 0} = F(t; 0), 0 =(0, 0, …, 0),
is the extinction probability and q(t) < 1 for every fixed t.
Therefore logq(t; N) = N log[1 − Q(t)], where Q(t) = 1 − q(t).
1) Let R < 1 or r < 0. Then Q(t) ~ Ku1Rt or Q(t) ~ Ku1ert as t → ∞ where K are some positive constants.
Therefore
Hence
2) Let R = 1 or r = 0. In this case Q(t) ~2u1/Bt or Q(t) ~ 2u1/bt as t → ∞.
Then
Therefore one obtains
3) Let R > 1 or r > 0. In this case q(t) ↑ q as t → ∞. Therefore q(t; N) = qN(t) → 0 as N → ∞ uniformly for 0 < t ≤ ∞.
Theorem 2
Let pk = υk/V, k = 1, …, d, and N, t → ∞.
If {R < 1, NRt → ∞ or r < 0, Nert → ∞} ∨ {R = 1 or r = 0 and N/t → ∞} then Δk(t; N) → pk in probability, k = 1, 2, …, d;
If {R > 1 or r > 0} then Δk(t; N) → pk a.s., k = 1, 2, …, d.
Proof
Remember that by (6)
| (13) |
where F (k) (t; sk) = F (t ; 1, …, 1, sk, 1, …, 1).
1) Let first consider the subcritical case R < 1 with NRt → ∞ (t N0) or r < 0 with Nert → ∞ (t R+).
Denote by β(k) (t, N ; λ), λ > 0, anyone of the following Laplace transforms:
Consider first the discrete-time case for which it is known that (see Athreya and Ney [1972, Ch. V. 3], Mode [1971,Ch. 1.11] or Sevastyanov [1971, Ch. VI. 2])
| (14) |
Since
log β(k)(t, N; λ) = N logF(k)(t; e−λ/NRt) ~ −N{1 − F(k)(t; e−λ/NRt)}
then applying (14) one obtains
| (15) |
On the other hand
Therefore by (15) one has log β(k)(t, N; λ) ~ − λu1υk and by the continuity theorem for Laplace transforms as NRt → ∞ one obtains
| (16) |
Then from (8) and (16) it follows
| (17) |
Finally from (9), (16) and (17) it is not difficult to see that in the case R < 1, NRt → ∞
| (18) |
In the continuous-time case the following relations hold (similarly to (14))
| (19) |
By differentiating the last relation in (19) and putting s = 1one obtains
Therefore by the Frobenius Theorem it follows that for some constant C1. Now using a similar arguments as in the proof of the discrete case (see Theorem 6.2.4 in Sevastyanov, 1971) it is not difficult to obtain that in fact C1 = 1/ K. On the other hand, by (3) and the first relation from (19) onehas
Hence applying (19) one obtains
and similarly to the discrete-time case
| (20) |
where
Finally from (20) one obtains that log β(k)(t, N; λ) ~ − λu1υk and by the continuity theorem as Nert → ∞ one has
| (21) |
Hence by (8) and (21) it follows
| (22) |
Then from (9), (21) and (22) it is not difficult to see that in the case r < 0, Nert → ∞
| (23) |
2) Consider now the critical case (R = 1 or r = 0) with N/t → ∞.
Using (13) one can introduce the following Laplace transform
| (24) |
Then
As t → ∞ in the discrete-time case one has (see Athreya and Ney [1972, Ch. V. 5], Mode [1971,Ch. 1.10] and Sevastyanov [1971, Ch. VI. 3])
| (25) |
uniformly for 0 ≤ s ≤ 1, s ≠ 1, where B is defined in (4).
Therefore by (13) and (23) one obtains
| (26) |
Since γ(k)(t, N; λ) → e−λu1υk as N/t → ∞ then by the continuity theorem it follows that
| (27) |
Hence by (8) and (27) one obtains
| (28) |
Finally by (9), (27) and (28) it follows that in the case R < 1, NRt → ∞
| (29) |
Note that some results obtained for discrete-time Markov branching processes can betransferred to the continuous-time case using the method of embedded branching processes. In fact,for F(t; s), t ∊R+, and εn = 2−n, n = 1, 2, …, one canconsider the p.g.f. of the embedded process where t ∊ { k∊n, k = 0, 1, 2, …}. In thiscase and for the critical processes B(εn) = bεn (1+ o(1)) as n → ∞. Therefore the asymptotic relations (25) and (26) will be valid also in the continuous-time case where B can be replaced by b from (4). Hence (27) – (29) hold.
3) In the supercritical case (R > 1 or r > 0) it is well-known (see Athreya and Ney [1972, Ch. V. 6 & 7], Mode [1971,Ch. 1.8 and 1.9] or Sevastyanov [1971, Ch. VI. 4]) that for Wk(t) = Zk(t)/Rt, t ∊ N0, or Wk(t) = Zk(t)/ert, t ∊ R+, the following relations hold as t → ∞
| (30) |
Let ηk (t; N)= Zk (t; N)/NRt, t ∊ N0, or ηk (t; N)= Zk (t; N)/Nert, t ∊ R+. Then by (5) one has where are i.i.d. as Wk(t), k = 1, …, d.
On the other hand, as N, t → ∞
where ζ k,j(t) =Wk,j (t)−υkW and by (30) one has that
Therefore by (8) one obtains that U(t; N)/NRt, t ∊ N0, or U(t; N)/Nert, t ∊ R+, converges a.s. to VW as N, t → ∞, Then the statement (ii) follows.
Remark 2
In other words, the fractions Δi(t; N), i = 1, …, d, are consistent estimators for pi under the conditions of Theorem 2. By virtue of the fact that 0 ≤ Δi (t; N) ≤ 1 and the dominated convergence theorem (DCT), it follows that IE{Δi (t; N) } converges to pi, implying that Δi (t; N) is an asymptotically unbiased estimator for pi, i = 1, 2, …, d. Similarly, one has that by the DCT and therefore Var {Δi (t; N) } → 0
Note also that for each i the defined in (11) quantity pi (t) may be interpreted as a probability for a randomly chosen cell attime t to be of the type i and by (3), (11) and (12) it is not difficult to see that pi (t) → pi as
On the other hand, if t ∊ T is fixed and N → ∞ then from (5) and (8) applying the law of large numbers (LLN) one obtains
Put it another way, the frequencies Δi(t; N), i = 1, …, d, are strongly consistent estimators for pi(t) when considered as functions of the initial number of ancestors N (but the time t is fixed). Moreover, in this case the frequencies are also asymptotically normal (see Theorem A in the next section and Remark 3).
Theorem 3
Let N, t → ∞.
(i)If {R < 1, NRt → 0 or r < 0, Nert → 0} ∨ {R = 1 or r = 0 and N/t → 0} then Zk(t; N) → 0 in probability, k = 1, 2, …, d.
(ii) If { R < 1, NRt → C or r < 0, Nert → C and 0 < C < ∞ } then , k = 1, 2, …, d, and where
The p.g.f. and F*(s)are well defined in the discrete-time case by (14) or by (19) in the continuous-time case.
(iii)If {R = 1 or r = 0 and N/t → C, 0 < C < ∞ } then Zk(t; N)/N ϱk, k = 1, 2, …, d, and U(t; N)/N ϱ, where
Proof
(i) In this case for the subcritical processes by (15) and (20) it is not difficult to obtain that log β(k)(t, N; λ) →0 and for the critical processes log γ(k)(t, N; λ) → 0 by (26), which means that IE{e−λZk(t;N)} → 1, showing the statement by the continuity theorem.
(ii) This case follows immediately from (15) and (20). Note that limiting distributions are Compound Poisson, i.e. where η has a Poisson distribution with parameterα = u1CK and ξ k (j), j ≥ 1, are i.i.d. with a Laplace transform Similarly for χ.
(iii) The statement follows directly from (26) (with a remark that in the continuous-time case one can use b instead B). The limiting distributions are also Compound Poisson but now with η ∊ Po(α), α = 2u1/C, and {ξk(j)} and i.i.d. and exponentially distributed with parameter βk = 2/CBυk or βk = 2/Cbυk. Similarly for where {ξk} are i.i.d. and ξk ∊ Exp(β), β = 2/BKV or β =2/bKV,
5. Multivariate Asymptotic Normality
In this section we will assume that the processes have finite second moments.
Consider X(t; N) = (X1(t; N), …, Xd(t; N)), t ∊ T, where for some norming functions Dk(t)
| (31) |
because of (5) and the fact that A1k(t; N) = IE{ Zk(t; N) } = N A1k (t) , k = 1, …, d.
Theorem 4
Assume N, t → ∞.
(i) If {R < 1, NRt→ ∞, or r < 0,Nert → ∞}and for k = 1, …, d,then
| (32) |
and ξ = (ξ1, …, ξd) has a multivariate normal distribution with IEζi = 0 and a covariance matrix C =‖Cjk‖, where
(ii) If {R = 1or r = 0 with N/t → ∞}and , k = 1, …, d,then
| (33) |
where η1 = … = ηd a.s., η1 ∊ N(0, 1).
(iii)If R > 1 or r > 0 and Dk(t) ~ {υkRtor υkert}, k = 1, …, d, then
| (34) |
where ζ1 = … = ζd a.s., ζ1 ∊ N (0, τ2) and τ2 = IE{W2 | Z(0) = δ1} is defined in (30).
Proof
Using (31) one can introduce the multivariate characteristic functions
where
Further on we will use the following representation
where F (t; s) is introduced in (6).
Hence,
| (35) |
(i) Note that in this case Therefore
| (36) |
where
| (37) |
Hence developing (37) it is not difficult to see that
Then the following asymptotic result hold
| (38) |
It is well-known that in the subcritical case
| (39) |
Further on we will use the notation and the corresponding matrix C= ‖Cjk‖.
Therefore from (36) – (39) one obtains log which proves (32) by the continuity theorem.
(ii) Note that in this case (or ) and therefore (36) and (37) hold. Further on we will consider only the discrete-time case because the continuous-time case is similar as it was pointed out in the proof of Theorem 2. Then by (25) and (37) it is not difficult to obtain that
From here using (36) and the fact that A1k(t) ~ u1υk one has
Since then
Finally which proves (33) by the continuity theorem.
(iii) In this case, using (30) and the references cited there, one has as
Therefore in the discrete-time case as t → ∞ the following convergence holds
Note that a similar relation is fulfilled also in the continuous-time case where only Rt has to be replaced by ert.
On the other hand, for the discrete-time processes the following equations hold
| (40) |
where
Therefore as x → 0
| (41) |
where τ2 is defined in (34).
Note that in this case the relations (35) – (37) hold and by (37) and (41) one has
Sine A1k(t)/Dk(t) → u1 then by (36) we obtain
which proves (34) in the discrete-time case (applying the continuity theorem).
In the continuous-time case instead (40) the following system of differential equations holds
| (42) |
From (42) it is not difficult to obtain that
Therefore by the Frobenius Theorem it follows that for some constant C and it is not difficult to prove that C = 1. Indeed, since Aik(t) ~ uiυkert then
Hence (41) holds and the rest of the proof is similar as in the discrete-time case.
As it was pointed out in Remark 2, the relative frequencies Δi(t; N), i = 1, 2, …, d, introduced by (9), are strongly consistent and asymptotically unbiased estimators for pi(t),i=1, 2, …,d, (defined by (11) and (12)) as N → ∞ for any fixed t. As one can see by the following theorem (proved in [32]), in this case the frequencies have also asymptotic multivariate normal distributions. Note that the theorem is valid for any kind of branching processes with discrete or continuous time (Markov or non Markov, reducible or irreducible) assuming only the usual independence of the individual evolutions.
Introduce the following notions:
Theorem A
([32]). Let t be fixed and N → ∞. Then for the r.υ.
i = 1, 2, …, d, the following statements are valid:
(i)
where the random vector X(d)(t)=(X1(t), …, Xd(t)) has a joint normal distribution with
(ii) where Yi(t) is a normally distributed r.υ. with
(iii) For every k = 2, 3, …, d – 1
and Y(k) (t) = (Y1 (t), …, Yk (t)) has a multivariate normal distribution with a covariance matrix which can be calculated as follows:
where [Cd×k(t)]T = ‖cji(t)‖, j = 1, 2, …, k; i = 1, 2, … d, is the corresponding transposed matrix of [k × d] dimensions.
Remark 3
While condition (10) implies that the fractions Δi(t; N), i = 1, 2, …, d, are linear dependent, there exist d − 1 joint normal distributions of lower dimensions that are asymptotically non-degenerate.
In a contrast of Theorem A the following results hold as both the number of the ancestors and the time increase to infinity.
Theorem 5
Assume N, t → ∞ and Condition (i) of Theorem 4. Then
(i) For every k = 1, 2, …, d
where Yk is a normally distributed r.υ. with IE{Yk }= 0 and
(ii) For every k = 2, 3, …, d − 1 and every subset (n1, n2,…, nk) with nonrecurring elements from the set {1, 2, …, d} the following joint distributions hold
where the random variables (Yn1, …, Ynk) have a multivariate normal distribution with
| (43) |
Proof
Using (31) it is not difficult to obtain the following relations
| (44) |
Note first that from (17) and (22) one obtains in probability. On the other hand, by (3), (11) and (12) it is not difficult to see that pk(t) →pk, k = 1, 2, …, d.Finally applying Theorem 4, (32), one has where (ξ1,…,ξd) have a multivariate normal distribution withzero means and a covariance matrix C =‖Cjk‖. Therefore by (44) it follows that
| (45) |
which obviously proves the statement (i).
The statement (ii) follows immediately by (44) and (45) applying Theorem 4, (i), where one have to check only that IE{YiYj)} is well presented by (43).
Remark 4
Surprisingly in the critical and supercritical cases there are not analogs of Theorem 5. We will consider only the discrete-time case, because the continuous-time case is quite similar.
Indeed, under the conditions of Theorem 4, (ii), it is not difficult to see that
where η1 = … = ηd a.s., η1 ∊ N (0,1). Since then Yk = 0 a.s., k = 1, 2, …, d.
Similarly, under the conditions of Theorem 4, (iii),
where
Therefore Yk = 0 a.s., k = 1, 2, …, d.
Remark 5
It is interesting to compare Theorem A with Theorem 4 and 5 (see also Remark 4). While Theorem A is valid for any kind of branching processes and does not depend of the criticality, then Theorem 4 and 5 are proved only in the Markov case and depend essentially by the usual trichotomy of the processes as well as by the relationship between N and t. Theorem 4 and 5 complete Theorem A in the case when t is large enough (but the obtained results are quite different).
6. Concluding Remarks
As it was pointed out in the Introduction and in Section 2, there are experimental situations in the cell biology where analyzing the relative frequencies, Δi(t; N), of cell types rather than the total cell counts Zi(t; N), i = 1, 2, …, d, may be quite advantageous. Should this be the case, the property of asymptotic normality given by Theorem A ((ii) and (iii)) and Theorem 5 could be useful in developing the needed statistical inference of model parameters from experimental data. On the other hand, if the process can be observed directly then one can apply Theorem 4 or Theorem A, (i).
In particular, the following observation process is directly relevant to quantitative studies of proliferation, differentiation, and death of cells. Suppose that the process under study begins with cells of type T1 and the values of Nk are all large, i.e., N0 = min{N1,N2,…, Nn} → ∞ The descendants of the first N1 ancestors are examined only once at time t1 to determine the observations of Zi(t1; N1) or Δi(t1; N1) = 1, 2, …, d, whereupon the observation process is discontinued (i.e. the cells under examination are destroyed). At the next moment t2 t1, the process Zi(t2; N2) or the fractions Δi(t2; N2), i = 1, 2, …, d, related to the descendants of the second N2 ancestors are observed, and so on. This procedure results in n independent observations of the form:
ζk = Z(tk; Nk) = (Z1(tk; Nk), …, Zd(tk; Nk)) or ζk = Δ(tk;Nk) = (Δ1(tk; Nk), …, Δd(tk; Nk)), k = 1, 2, …, n; t1 ≤ t2 ≤ … ≤ tn,
where each vector ζk is asymptotically normal in accordance with Theorems 4, 5 or A.
Denoting the corresponding contribution to the log-likelihood function by Lk(ζk; tk, Nk), the overall log-likelihood is given by
| (46) |
The log-likelihood (45) depends on the offspring parameters only, which are of primary interest in applications and especially in cell kinetics studies. Finally the parameters can be estimated from the data on the process or the relative frequencies by maximizing the log-likelihood (46). Some applications are given in [32].
Lamperti [22] obtained interesting limiting distributions for supercritical BGW processes with infinite offspring variance. It is an open problem to consider multitype processes with infinite variance when N and t tend to infinity.
Jagers [15] was probably the first to consider relative frequencies (proportions, fractions) of cells within the framework of multitype branching processes. He studied asymptotic (as t → ∞) properties of a reducible age-dependent branching process with two types of cells and proved convergence of their relative frequencies to non-random limits in mean square and almost surely on the non-extinction set. The usefulness of such frequencies in cell cycle analysis was further demonstrated by Mode [26] considering a four-type irreducible age-dependent branching process. Mode built his cell cycle analysis on a model of multitype positively regular age-dependent branching process. In the supercritical case, he proved that lim Δk(t) = δk a.s. as t → ∞, providing the population does not become extinct. It should be noted that the constants δk, k = 1, 2, …,d, depend only on the offspring characteristics. In fact, , where is a left eigenvector of the matrix with a Malthusian parameter α, while is the Laplace-Stieltjes transform of the life-span distribution Gk(t) for the type k(k= 1, 2, …, d). In his monograph, Mode [25] also considered the utility of fractions and reported a similar result for the BGW process.
The results obtained by Jagers and Mode for some age-dependent branching processes suggest that it would be interesting to investigate the asymptotic behavior of the fractions Δ(t; N) as well as the processes Z(t; N) in the non Markov case when both parameters N and t tend to infinity simultaneously. It is anticipated that such asymptotic properties will depend on a specific branching model and its reducibility. Another problem is the investigation (in the same situation) of the the most complicated models of branching processes such as considered in [2, 6, 34].
Acknowledgments
This paper was prepared while N. Yanev was a visiting professor in the Department of Bio-statistics and Computational Biology, University of Rochester, and he is grateful for hospitality and inspiring communications with his colleagues.
Footnotes
Mathematics subject classifications. Primary: 60J80, 60J85; secondary: 62P10, 92D25.
References
- 1.Athreya KB, Ney PE. Branching Processes. Springer; Berlin: 1972. [Google Scholar]
- 2.Del Puerto IM, Yanev NM. Stationary distributions for branching processes with multi-type random control functions. J Applied Statistical Science. 2008;16(1):91–102. [Google Scholar]
- 3.Dion JP, Yanev NM. Statistical inference for branching processes with an increasing random number of ancestors. J Statistical Planing and Inference. 1994;39:329–359. [Google Scholar]
- 4.Dion JP, Yanev NM. CLT for martingales in BGWR branching processes with some statistical applications. Mathematical Methods in Statistics. 1995;4:344–358. [Google Scholar]
- 5.Dion JP, Yanev NM. Limit theorems and estimation theory for branching processes with an increasing random number of ancestors. J Applied Probability. 1997;34:309–327. [Google Scholar]
- 6.Drmota M, Louchard G, Yanev NM. Analysis of a Recurrence Related to Critical Nonhomogeneous Branching Processes. Stochastic Analysis and Applications. 2006;24(1):37–59. [Google Scholar]
- 7.Feller W. Diffusion processes in genetics. Proceedings Second Berkeley Symp Math Stat Prob; Univ of California Press; 1951. pp. 227–246. [Google Scholar]
- 8.Feller W. An Introduction to Probability and Its Applications. 2. Vol. 2. Wiley; New York: 1971. [Google Scholar]
- 9.Grimvall A. On the convergence of sequences of branching processes. Annals of Probability. 1974;2:1027–1045. [Google Scholar]
- 10.Guttorp P. Statistical inference for Branching Processes. Wiley; New York: 1991. [Google Scholar]
- 11.Haccou P, Jagers P, Vatutin VA. Branching Processes: Variation, Growth and Extinction of Populations. Cambridge University Press; Cambridge: 2005. [Google Scholar]
- 12.Harris TE. Branching Processes. Springer; New York: 1963. [Google Scholar]
- 13.Hyrien O, Mayer-Proschel M, Noble M, Yakovlev AY. Estimating the life-span of oligodendrocytes from clonal data on their development in cell culture. Mathematical Biosciences. 2005;193:255–274. doi: 10.1016/j.mbs.2004.07.003. [DOI] [PubMed] [Google Scholar]
- 14.Hyrien O, Mayer-Proschel M, Noble M, Yakovlev AY. A stochastic model to analyze clonal data on multi-type cell populations. Biometrics. 2005;61:199–207. doi: 10.1111/j.0006-341X.2005.031210.x. [DOI] [PubMed] [Google Scholar]
- 15.Jagers P. The proportions of individuals of different kinds in two-type populations. A branching process problem arising in biology. J Applied Probability. 1969;6:249–260. [Google Scholar]
- 16.Jagers P. Branching Processes with Biological Applications. Wiley; London: 1975. [Google Scholar]
- 17.Jiřina P. On Feller’s branching diffusion processes. Ĝasopis PěSt Math. 1969;94:84–90. [Google Scholar]
- 18.Kimmel M, Axelrod DE. Branching Processes in Biology. Springer; New York: 2002. [Google Scholar]
- 19.Kolmogorov AN. Zur Lösung einer biologischen Aufgabe. Proceedings Tomsk State Univ. 1938;2:1–6. [Google Scholar]
- 20.Kolmogorov AN, Dmitriev NA. Branching random processes. Dokl Akad Nauk (Proceedings Acad Sci USSR) 1947;56:7–10. (in Russian) [Google Scholar]
- 21.Kolmogorov AN, Sevastyanov BA. Calculation of final probabilities of branching random processes. Dokl Akad Nauk (Proceedings Acad Sci USSR) 1947;56:783–786. (in Russian) [Google Scholar]
- 22.Lamperti J. Limiting distributions for branching processes. 1967; Proceedings Fifth Berkeley Symp Math Stat Prob; Univ of California Press; pp. 225–241. [Google Scholar]
- 23.Lindvall T. Weak convergence of probability measures and random functions in the function space D[0,∞) J Applied Probability. 1973;10:109–121. [Google Scholar]
- 24.Lindvall T. Limit theorems for some functionals of certain Galton-Watson branching processes. Advances of Applied Probability. 1974;6:309–321. [Google Scholar]
- 25.Mode CJ. Multitype Branching Processes: Theory and Applications. Elsevier; New York: 1971. [Google Scholar]
- 26.Mode CJ. Multitype age-dependent branching processes and cell cycle analysis. Mathamatical Biosciences. 1971;10:177–190. [Google Scholar]
- 27.Quine MP. A note of the moment structure of the multitype Galton-Watson process. Biometrika. 1970;57:219–222. [Google Scholar]
- 28.Sevastyanov BA. Branching Processes. Nauka; Moscow: 1971. (in Russian) [Google Scholar]
- 29.Yakovlev AY, Yanev NM. Transient Processes in Cell Proliferation Kinetics, Lecture Notes in Biomathematics. Vol. 82. Springer; Berlin: 1989. [Google Scholar]
- 30.Yakovlev AY, Boucher K, Mayer-Proschel M, Noble M. Quantative insight into proliferation and differentiation of oligodendroctyte type 2 astrocyte progenitor cells in vitro. Proceeding National Acad Sci USA. 1998;95:14164–14167. doi: 10.1073/pnas.95.24.14164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Yakovlev AY, Yanev NM. Branching stochastic processes with immigration in analysis of renewing cell populations. Mathematical Biosciences. 2006;203:37–63. doi: 10.1016/j.mbs.2006.06.001. [DOI] [PubMed] [Google Scholar]
- 32.Yakovlev AY, Yanev NM. Relative frequencies in multitype branching processes. Annals of Applied Probability. 2009;19(1):1–14. [Google Scholar]
- 33.Yanev NM. On the statistics of branching processes. Theory of Probability and Applications. 1975;20:612–622. [Google Scholar]
- 34.Yanev NM, Mitov KV. Critical branching processes with nonhomogeneous migration. Annals of Probability. 1985;13:923–933. [Google Scholar]
- 35.Yanev N, Jordan CT, Catlin S, Yakovlev A. Two-type Markov branching processes with immigration as a model of leukemia cell kinetics. Proceedings Bulg Acad Sci. 2005;58:1025–1032. [Google Scholar]
- 36.Yanev NM. Statistical Inference for Branching Processes. In: Ahsanullah M, Yanev GP, editors. Records and Branching Processes. NOVA Science Publishers; 2008. pp. 143–168. [Google Scholar]
- 37.Zorin AV, Yakovlev AY, Mayer-Proschel M, Noble M. Estimation problems associated with stochastic modeling of proliferation and differentiation of O-2A progenitor cells in vitro. Mathematical Biosciences. 2000;67:109–121. doi: 10.1016/s0025-5564(00)00040-7. [DOI] [PubMed] [Google Scholar]
