ON CLASSES OF EQUIVALENCE AND IDENTIFIABILITY OF AGE-DEPENDENT BRANCHING PROCESSES

RUI CHEN; OLLIVIER HYRIEN

doi:10.1239/aap/1409319556

. Author manuscript; available in PMC: 2015 Jan 14.

Published in final edited form as: Adv Appl Probab. 2014 Sep;46(3):704–718. doi: 10.1239/aap/1409319556

ON CLASSES OF EQUIVALENCE AND IDENTIFIABILITY OF AGE-DEPENDENT BRANCHING PROCESSES

RUI CHEN ^1,^*, OLLIVIER HYRIEN ^2,^**

PMCID: PMC4294276 NIHMSID: NIHMS645063 PMID: 25598541

Abstract

Age-dependent branching processes are increasingly used in analyses of biological data. Despite being central to most statistical procedures, the identifiability of these models has not been studied. In this paper, we partition a family of age-dependent branching processes into equivalence classes over which the distribution of the population size remains identical. This result is applicable to study identifiability of the offspring and lifespan distributions for parametric families of branching processes. For example, we identify classes of Markov processes that are not identifiable. We show that age-dependent processes with (non-exponential) gamma distributed lifespan are identifiable and that Smith-Martin processes are not always identifiable.

Keywords: Identifiability, Bellman-Harris Process, Sevastyanov Process, Smith-Martin process

1. Introduction

Let Z(t) denote the size of a population governed by an age-dependent branching process started at t = 0 with a single particle or cell of age 0. Upon completion of its lifespan, every cell produces a random number of offspring ξ ∈ Inline graphic = {0, 1, 2, …, J}, where J is a given positive integer. Let p := (p₀, …, p_J), where p_j := (ξ = j), j ∈ , denote the offspring distribution. Put h(u; p) := p_ju^j, u ∈ [−1, 1], and μ := (ξ) = jp_j for its probability generating function (p.g.f.) and expectation. A cell that produces a single offspring (ξ = 1) is said to be quiescent. This feature is relevant when modeling tumor growth ([1]; see also [5]). Throughout, we shall implicitly assume that p₁ ∈ [0, 1). Put Inline graphic (p) := {j ∈ : p_j > 0}. For every j ∈ (p), let G_j(t) := (τ ≤ t|ξ = j), t ≥ 0, denote the conditional cumulative distribution function (c.d.f.) of the lifespan τ, given ξ = j. Write for the class all absolutely continuous (a.c.) c.d.f. F that are proper and satisfy F(0+) = 0 (the assumption of a.c. is not needed but simplifies the presentation). Assume that G_j ∈ Inline graphic , j ∈ (p). As usual, every cell evolves independently of all other cells. Put G = {G_j, j ∈ (p)}. We shall refer to C = (p, G) as the characteristics of the process. The process is of Bellman-Harris type if the c.d.f. G_j are identical for all j ∈ (p). Otherwise it allows the lifespan and offspring to be dependent, and belongs to the class of Sevastyanov processes [7, 4, 3].

In this paper, we study the following question: are there distinct characteristics (p, G) under which the distribution of the process Z(t) is identical? This question is relevant to the problem of model identifiability, which is a central prerequisite to most statistical procedures. Although age-dependent branching processes are widely used in biology, this question does not appear to have been studied for this class of models [2, 5, 9]. Answering this question will inform us about what can or cannot be estimated by only observing Z(t), a situation that arises frequently in cell biology.

Let Inline graphic denote the class of all processes that satisfy the above assumptions. It will be useful to define a subclass of processes included in , say , with characteristics (p, G) satisfying p₁ = 0. We shall say that two processes with characteristics (p, G) and (p̂, Ĝ) are equivalent if, for all t ≥ 0, the distribution of Z(t) is the same under either characteristics. Let Inline graphic denote the collection of processes included in that are equivalent to the process with characteristics (p, G). It forms an equivalence class, and our objective is to identify all the processes included in this class for any admissible set of characteristics (p, G). If the class includes processes other than the process with characteristics (p, G), then p and G cannot be unequivocally identified by the marginal distribution of Z(t) for all t ≥ 0.

We construct the class Inline graphic in the next section. We proceed in three steps. Firstly, we identify a collection of equivalent processes (Section 2.1). Next, by inverting the transformation that defines this collection about a properly chosen process, we find a larger collection of equivalent processes (Section 2.2). Finally, we prove, when J = 2, which is typical of most biological applications, and J = 3, that the larger collection is identical to Inline graphic (Section 2.3). Each equivalence class contains a single process such that p₁ = 0. When J = 2, the equivalence classes are fully characterized by the expectation and the variance of Z(t) (Section 2.4). Our results are applicable to study identifiability of families of parametric models. For example, we find that the Markov version of the process is not always identifiable (Section 3.1). The age-dependent process with (non-exponential) gamma distributed lifespan is identifiable (Section 3.2). We also find that the Smith-Martin process is not always identifiable (Section 3.3).

2. Main results

2.1. A collection of equivalent processes

For every p₁ ∈ [0, 1) and a ∈ [0, p₁], define $p^{(a)} = (p_{0}^{(a)}, \dots, p_{J}^{(a)})$ , where

p_{j}^{(a)} : = {\begin{cases} \frac{p_{j}}{1 - a} & j \in J \ {1} \\ \frac{p_{1} - a}{1 - a} & j = 1. \end{cases}

(1)

Notice that Inline graphic (p)\{1} = (p⁽^a⁾)\{1}. By convention, when p₁ = 0, G₁ will denote any c.d.f. in . For every t ≥ 0, j ∈ (p), a ∈ [0, p₁], put

G_{j}^{(a)} (t) : = (1 - a) G_{j} * \sum_{k = 0}^{\infty} a^{k} G_{1}^{* k} (t),

(2)

where $G_{j} * G_{1} (t) : = \int_{0}^{t} G_{j} (t - x) {d G}_{1} (x)$ denotes the convolution of G_j and F₁, and where $G_{1}^{* k} (t) : = \int_{0}^{t} G_{1}^{* (k - 1)} (t - x) {d G}_{1} (x)$ is the k-fold convolution of G₁ with itself. For every p₁ ∈ [0, 1) and a ∈ [0, p₁], it can be verified that $G_{j}^{(a)}$ is the c.d.f. of a proper distribution; it can be interpreted as the c.d.f. of a (non-Markov) phase-type distribution, and the Laplace transform of $g_{j}^{(a)} (t) : = d G_{j}^{(a)} (t) / d t$ is:

L_{g_{j}^{(a)}} (s) = \frac{(1 - a) L_{g_{j}} (s)}{1 - a L_{g_{1}} (s)},

(3)

where Inline graphic is the Laplace transforms of g_j(t) := dG_j(t)/dt. Write $G^{(a)} = {G_{j}^{(a)}, j \in J^{*} (p^{(a)})}$ and C⁽^a⁾ = (p⁽^a⁾, G⁽^a⁾).

Let Inline graphic denote the collection of processes with characteristics (p⁽^a⁾, G⁽^a⁾), a ∈ [0, p₁]. Since (p⁽⁰⁾, G⁽⁰⁾) = (p, G), includes the process with characteristics (p, G). Thus it is never empty. Moreover, since $p_{1}^{(p_{1})} = 0$ , always includes at least one process from Inline graphic . This process will play a central role in constructing .

Theorem 1

For all t ≥ 0, the distribution of the population size process Z(t) is identical under all processes included in Inline graphic ; that is, ⊆ .

Proof

Let Φ_C(u, t) := Inline graphic {u^Z⁽^t⁾|Z(0) = 1}, u ∈ [−1, 1] and t ≥ 0, denote the p.g.f. of Z(t) under the process with characteristics C. Conditioning on the lifespan of the cell initiating the population yields

Φ_{C} (u, t) = u {1 - \sum_{j \in J^{*} (p)} p_{j} G_{j} (t)} + \sum_{j \in J^{*} (p)} p_{j} \int_{0}^{t} Φ_{C} {(u, t - x)}^{j} {d G}_{j} (x) .

(4)

For every j ∈ Inline graphic and u ∈ [−1, 1], let $L_{Φ_{C}^{j}} (u, s) : = \int_{0}^{\infty} e^{- s t} Φ_{C} {(u, t)}^{j} d t$ denote the Laplace transform of Φ_C(u, t)^j. Put $L_{Φ_{C}} (u, t) = L_{Φ_{C}^{1}} (u, t)$ . Since |Φ_C(u, t)| ≤ 1 for every u ∈ [−1, 1], and t ≥ 0, we have that $L_{Φ_{C}^{j}} (u, s) < \infty$ for every s > 0. Also, it follows from eqn. (4) that Inline graphic (u, s), s > 0, satisfies:

L_{Φ_{C}} (u, s) = \frac{u}{s} {1 - \sum_{j \in J^{*} (p)} p_{j} L_{g_{j}} (s)} + \sum_{j \in J^{*} (p)} p_{j} L_{Φ_{C}^{j}} (u, s) L_{g_{j}} (s) .

(5)

For every a ∈ [0, p₁], eqn. (5) can be rearranged into

\begin{array}{l} {1 - a L_{g_{1}} (s)} L_{Φ_{C}} (u, s) = \frac{u}{s} {1 - \sum_{j \in J^{*} (p) \ {1}} p_{j} L_{g_{j}} (s) - p_{1} L_{g_{1}} (s)} \\ + \sum_{j \in J^{*} (p) \ {1}} p_{j} L_{Φ_{C}^{j}} (u, s) L_{g_{j}} (s) + (p_{1} - a) L_{Φ_{C}} (u, s) L_{g_{1}} (s) . \end{array}

Dividing both sides of the equation by 1 − a Inline graphic (s) yields

\begin{array}{l} L_{Φ_{C}} (u, s) = \frac{u}{s} {\frac{1}{1 - a L_{g_{1}} (s)} - \sum_{j \in J^{*} (p^{(a)}) \ {1}} \frac{p_{j}}{1 - a} \frac{(1 - a) L_{g_{j}} (s)}{1 - a L_{g_{1}} (s)} \\ - \frac{p_{1} - a}{1 - a} \frac{(1 - a) L_{g_{1}} (s)}{1 - a L_{g_{1}} (s)} - \frac{a L_{g_{1}} (s)}{1 - a L_{g_{1}} (s)}} \\ + \sum_{j \in J^{*} (p^{(a)}) \ {1}} \frac{p_{j}}{1 - a} \frac{(1 - a) L_{g_{j}} (s)}{1 - a L_{g_{1}} (s)} L_{Φ_{C}^{j}} (u, s) + \frac{p_{1} - a}{1 - a} \frac{(1 - a) L_{f_{1}} (s)}{1 - a L_{g_{1}} (s)} L_{Φ_{C}} (u, s) \\ = \frac{u}{s} {1 - \sum_{j \in J^{*} (p^{(a)})} p_{j}^{(a)} (θ) L_{g_{j}^{(a)}} (s)} + \sum_{j \in J^{*} (p^{(a)})} p_{j}^{(a)} L_{Φ_{C}^{j}} (u, s) L_{g_{j}^{(a)}} (s) . \end{array}

(6)

By comparing eqns. (5) and (6), we deduce that Inline graphic (u, s) = (u, s), hence the processes with characteristics (p, G) and (p⁽^a⁾, G⁽^a⁾), a ∈ [0, p₁], are equivalent.

2.2. A larger collection of equivalent processes

By inverting the transformation (p, G) → (p⁽^a⁾, G⁽^a⁾), a ∈ [0, p₁], about a properly chosen process in Inline graphic , we will construct a collection of equivalent processes that is larger than . Setting a = p₁ in eqns. (1) and (3) yields

\begin{array}{l} p_{j}^{(p_{1})} = {\begin{cases} \frac{p_{j}}{1 - p_{1}} & j \in J \ {1} \\ 0 & j = 1, \end{cases} \\ L_{g_{j}^{(p_{1})}} (s) = \frac{(1 - p_{1}) L_{g_{j}} (s)}{1 - p_{1} L_{g_{1}} (s)}, j \in J^{*} (p) \ {1}, \end{array}

which identifies a process in Inline graphic . We remark that Φ_C^(p₁)(u, t) does not depend on $G_{1}^{(p_{1})}$ . Also, any process with characteristics (p̂, Ĝ) that satisfy

{\begin{cases} {\hat{p}}_{j}^{({\hat{p}}_{1})} = p_{j}^{(p_{1})} & j \in J \ {1} \\ L_{{\hat{g}}_{j}^{({\hat{p}}_{1})}} (s) = L_{g_{j}^{(p_{1})}} (s) & j \in J^{*} (p) \ {1} \end{cases}

(7)

belongs to Inline graphic because Φ_Ĉ(u, t) = Φ_Ĉ^(p̂₁)(u, t) = Φ_C^(p₁)(u, t) = Φ_C(u, t). By solving eqns. (7) we find that (p̂, Ĝ) satisfies

{\begin{cases} {\hat{p}}_{j} = p_{j}^{(p_{1})} (1 - {\hat{p}}_{1}) & j \in J \ {1} \\ L_{{\hat{g}}_{j}} (s) = L_{g_{j}}^{(p_{1})} (s) {1 - {\hat{p}}_{1} L_{{\hat{g}}_{1}} (s)} / (1 - {\hat{p}}_{1}) & j \in J^{*} (p) \ {1}, \end{cases}

(8)

where p̂₁ ∈ [0, 1) and Ĝ₁ ∈ Inline graphic , and where ⊆ is a set of distributions such that (s), j ∈ (p̂)\{1}, are the Laplace transforms of distributions in . Write (p_{p̂₁, Ĝ₁}, G_{p̂₁, Ĝ₁}) for any characteristics that satisfy eqns. (8). Then, the collection of processes

{\bar{S}}_{p, G} : = \underset{{\hat{p}}_{1} \in [0, 1)}{\cup} \underset{{\hat{G}}_{1} \in D_{p, G}}{\cup} {process with characteristics (p_{{\hat{p}}_{1}, {\hat{G}}_{1}}, G_{{\hat{p}}_{1}, {\hat{G}}_{1}})}

is included in Inline graphic . It is also clear that ⊂ .

2.3. Exhaustivity of when J = 2 and J = 3

Our final step toward identifying Inline graphic is to prove that it coincides with . Let $Φ_{C}^{(k)} (u, t) : = \partial^{k} Φ_{C} (u, t) / \partial u^{k}$ denote the k-th order partial derivative of Φ_C(u, t), k = 1, 2 ···. Let $m_{k} (t) : = E [\prod_{l = 0}^{k - 1} {Z (t) - l} ∣ Z (0) = 1]$ , t ≥ 0, k = 1, 2 ···, denote the k-th order factorial moment of Z(t) under the process with characteristics C, and write m(t) = m₁(t). We have that m_k(t) = Φ⁽^k⁾(1, t). Differentiating both sides of eqn. (4) with respect to u at u = 1 yields the following integral equation for the expectation of the process:

m (t) = 1 - \sum_{j \in J^{*} (p)} p_{j} G_{j} (t) + \sum_{j \in J^{*} (p)} {j p}_{j} \int_{0}^{t} m (t - x) {d G}_{j} (x) .

(9)

The second and third order factorial moments satisfy

m_{2} (t) = \sum_{j \in J^{*} (p)} {j p}_{j} \int_{0}^{t} m_{2} (t - x) {d G}_{j} (x) + \sum_{j \in J^{*} (p)} j (j - 1) p_{j} \int_{0}^{t} m^{2} (t - x) {d G}_{j} (x),

(10)

and

\begin{array}{l} m_{3} (t) = \sum_{j \in J^{*} (p)} {j p}_{j} \int_{0}^{t} m_{3} (t - x) {d G}_{j} (x) \\ + \sum_{j \in J^{*} (p)} 3 j (j - 1) p_{j} \int_{0}^{t} m (t - x) m_{2} (t - x) {d G}_{j} (x) \\ + \sum_{j \in J^{*} (p)} j (j - 1) (j - 2) p_{j} \int_{0}^{t} m {(t - x)}^{3} {d G}_{j} (x) . \end{array}

(11)

Let Inline graphic (s) denote the Laplace transform of m^k(t), k = 1, 2, 3. Taking the Laplace transform of both sides of eqns. (9–11) and rearranging the terms yields

L_{m} (s) = \frac{1 - \sum_{j \in J^{*} (p)} p_{j} L_{g_{j}} (s)}{s {1 - \sum_{j \in J^{*} (p)} {j p}_{j} L_{g_{j}} (s)}},

(12)

L_{m_{2}} (s) = \frac{L_{m^{2}} (s) \sum_{j \in J^{*} (p)} j (j - 1) p_{j} L_{g_{j}} (s)}{1 - \sum_{j \in J^{*} (p)} {j p}_{j} L_{g_{j}} (s)},

(13)

and

L_{m_{3}} (s) = \frac{L_{m^{3}} (s) \sum_{j \in J^{*} (p)} j (j - 1) (j - 2) p_{j} L_{g_{j}} (s)}{1 - \sum_{j \in J^{*} (p)} {j p}_{j} L_{g_{j}} (s)} + \frac{3 L_{m m_{2}} (s) L_{m_{2}} (s)}{L_{m^{2}} (s)},

(14)

where Inline graphic (s) denotes the Laplace transform of m(t)m₂(t).

Lemma 1

Suppose that J = 2 or J = 3. For every admissible (p, G), the equivalence class Inline graphic includes a single process in .

Proof

Assume first that J = 3. Consider two processes in Inline graphic with characteristics C = (p, G) and Ĉ = (p̂, Ĝ). Thus, p₁ = 0 and p̂₁ = 0. Suppose that these processes are equivalent; that is, they both belong to . Then Φ_C(u, t) = Φ_Ĉ(u, t) and $Φ_{C}^{(k)} (1, t) = Φ_{\hat{C}}^{(k)} (1, t)$ , u ∈ [−1, 1], t ≥ 0, and k = 1, 2, 3. Write m̂_k(t) for the k-th order factorial moment of the process with characteristics Ĉ. Hence, $L_{Φ_{C}^{(k)}} (s) = L_{Φ_{\hat{C}}^{(k)}} (s)$ , which, using Identities (12–14), yields

{\begin{matrix} \frac{1 - p_{0} L_{g_{0}} (s) - p_{2} L_{g_{2}} (s) - p_{3} L_{g_{3}} (s)}{d (s)} & = & \frac{1 - {\hat{p}}_{0} L_{{\hat{g}}_{0}} (s) - {\hat{p}}_{2} L_{{\hat{g}}_{2}} (s) - {\hat{p}}_{3} L_{{\hat{g}}_{3}} (s)}{\hat{d} (s)} \\ \frac{L_{m^{2}} (s) {2 p_{2} L_{g_{2}} (s) + 6 p_{3} L_{g_{3}} (s)}}{d (s)} & = & \frac{L_{m^{2}} (s; \hat{C}) {2 {\hat{p}}_{2} L_{{\hat{g}}_{2}} (s) + 6 {\hat{p}}_{3} L_{{\hat{g}}_{3}} (s)}}{\hat{d} (s)} \\ \frac{6 L_{m^{3}} (s) p_{3} L_{g_{3}} (s)}{d (s)} + \frac{3 L_{m m_{2}} (s) L_{m_{2}} (s)}{L_{m^{2}} (s)} & = & \frac{6 L_{{\hat{m}}^{3}} (s) {\hat{p}}_{3} L_{{\hat{g}}_{3}} (s)}{\hat{d} (s)} + \frac{3 L_{\hat{m} {\hat{m}}_{2}} (s;) L_{{\hat{m}}_{2}} (s)}{L_{{\hat{m}}^{2}} (s)}, \end{matrix}

where d(s) = 1 − 2p₂ Inline graphic (s) − 3p₃ (s) and d̂(s) = 1 − 2p̂₂ (s) − 3p̂₃ (s). Since (s) = (s), k = 1, 2, 3, and (s) = (s), the above system reduces to

p_{j} L_{g_{j}} (s) / d (s) = {\hat{p}}_{j} L_{{\hat{g}}_{j}} (s) / \hat{d} (s) j = 0, 2, 3.

(15)

The above equations obtained when j = 2, 3 yield

{\begin{matrix} p_{2} L_{g_{2}} (s) - 3 p_{2} L_{g_{2}} (s) {\hat{p}}_{3} L_{{\hat{g}}_{3}} (s) & = & {\hat{p}}_{2} L_{{\hat{g}}_{2}} (s) - 3 {\hat{p}}_{2} L_{{\hat{g}}_{2}} (s) p_{3} L_{g_{3}} (s) \\ p_{3} L_{g_{3}} (s) - 2 p_{3} L_{g_{3}} (s) {\hat{p}}_{2} L_{{\hat{g}}_{2}} (s) & = & {\hat{p}}_{3} L_{{\hat{g}}_{3}} (s) - 2 {\hat{p}}_{3} L_{{\hat{g}}_{3}} (s) p_{2} L_{g_{2}} (s), \end{matrix}

which implies that 2p₂ Inline graphic (s) + 3p₃ (s) = 2p̂₂ (s) + 3p̂₃ (s), hence d(s) = d̂(s), and the system of equations (15) reduces to p_j (s) = p̂_j (s), j = 0, 2, 3. Hence (p̂, Ĝ) = (p, G) since the distributions G_j and Ĝ_j, j ∈ (p), are all proper. This completes the proof when J = 3. The case J = 2 is treated similarly except that we only use the first and second equations of the system (15), and we set p₃ = p̂₃ = 0.

Theorem 2

We have Inline graphic = for every admissible (p, G) when J = 2 and J = 3.

Proof

We already know that Inline graphic ⊆ . To prove that the converse holds true, let (p̂, Ĝ) denote the characteristics of any process included in . Then, by construction, the process with characteristics (p̂^(p̂₁), Ĝ^(p̂₁)) belongs to . We also know from Lemma 1 that (p̂^(p̂₁), Ĝ^(p̂₁)) = (p^(p₁), G^(p₁)). Hence the process with characteristics (p̂, Ĝ) belongs to Inline graphic , which implies that ⊆ . This completes the proof.

2.4. Characterization of using moments when J = 2

In data analyses, model parameters are sometimes estimated using moments of the process rather than its distribution. Then, a relevant question is which moments are sufficient to fully characterize the equivalence class Inline graphic ? We show below that the answer is simply the expectation and variance when J = 2. This property does not appear to generalize when J > 2, however.

Theorem 3

Assume that J = 2 and that the marginal distribution of {Z(t), t ≥ 0}, is determined by its moments. Then, Inline graphic = {processes with characteristics (p̂, Ĝ) : m̂(t) = m(t), m̂₂(t) = m₂(t), t ≥ 0}.

Proof

To simplify the presentation, we assume, when p_j = 0, that G_j is any arbitrary c.d.f in Inline graphic . For k = 2, 3 ···, it can be shown by induction and using the identity $m_{k} (t) = Φ_{C}^{(k)} (1, t)$ that

m_{k} (t) = p_{2} \int_{0}^{t} \sum_{r = 1}^{⌊ k / 2 ⌋} c_{k r} m_{r} (t - x) m_{k - r} (t - x) {d G}_{2} (x) + \sum_{j = 1}^{2} {j p}_{j} \int_{0}^{t} m_{k} (t - x) {d G}_{j} (x),

where ⌊k/2⌋ denotes the largest integer less than or equal to k/2, and c_kr are some positive integers. Then,

L_{m_{k}} (s) = p_{2} l_{k} (s) L_{g_{2}} (s) + L_{m_{k}} (s) \sum_{j = 1}^{2} {j p}_{j} L_{g_{j}} (s),

(16)

where l_k(s) is the Laplace transform of $\sum_{r = 1}^{⌊ k / 2 ⌋} c_{k r} m_{r} (t) m_{k - r} (t)$ . Hence,

L_{m_{k}} (s) = \frac{l_{k} (s) p_{2} L_{g_{2}} (s)}{1 - \sum_{j = 1}^{2} {j p}_{j} {\tilde{L}}_{g_{j}} (s)} .

Let Ĉ = (p̂, Ĝ) denote the characteristics of any process in Inline graphic . Then Φ_Ĉ(u, t) = Φ_C(u, t), t ≥ 0, u ∈ [−1, 1]. By assumption, Φ_c(u, t) is determined by its moments. Hence, = {processes with characteristics (p̂, Ĝ) : m̂_k(t) = m_k(t), t ≥ 0, k ∈ }, where = {1, 2 ···}. We notice that m̂_k(t) = m_k(t) implies that Inline graphic (s) = (s) and l̂_k(s) = l_k(s), k ∈ , from which we deduce, when k = 2, that

\frac{p_{2} L_{g_{2}} (s)}{1 - \sum_{j = 1}^{2} {j p}_{j} L_{g_{j}} (s)} = \frac{{\hat{p}}_{2} L_{{\hat{g}}_{2}} (s)}{1 - \sum_{j = 1}^{2} j {\hat{p}}_{j} L_{{\hat{g}}_{j}} (s)},

(17)

and, when k = 3, 4 ···, that

\frac{l_{k} (s) p_{2} L_{g_{2}} (s)}{1 - \sum_{j = 1}^{2} {j p}_{j} L_{g_{j}} (s)} = \frac{l_{k} (s) {\hat{p}}_{2} L_{{\hat{g}}_{2}} (s)}{1 - \sum_{j = 1}^{2} j {\hat{p}}_{j} L_{{\hat{g}}_{j}} (s)} .

(18)

Eqns. (17) and (18) are clearly equivalent when l_k(s) ≠ 0. When l_k(s) = 0, we deduce from eqn. (16) that Inline graphic (s) = 0 and m_k(t) = 0, k = 3, 4 ···. Thus, in either case, we conclude that = {processes with characteristics (p̂, Ĝ) : m̂_k(t) = m_k(t), t ≥ 0, k =1, 2}, which completes the proof.

3. Application to model identifiability

Results obtained in Section 2 are applicable to study identifiability of branching processes when specific parametric assumptions are made about the lifespan distributions. To shorten the discussion, we only consider the case where J = 2.

3.1. Exponentially distributed lifespan

We assume here that τ is conditionally exponentially distributed, given {ξ = j}: G_j(t) = 1 − e^−ψ_jt, t ≥ 0, for some $ψ_{j} \in ℝ_{+}^{*}$ , j ∈ Inline graphic (p). The resulting class of processes is denoted by . We remark that (s) = ψ_j/(ψ_j + s), j ∈ (p). It is defined for s ∈ (−ψ_j, ∞), and extendable to s ∈ (−∞, −ψ_j) ∪ (−ψ_j, ∞) by analytic continuation.

For every admissible (p, G), let $C_{p, G}^{M} = C_{p, G} \cap M$ denote the class of all processes included in Inline graphic that are equivalent to the process with characteristics (p, G). We say that the characteristics (p, G) are identified by {Z(t), t ≥ 0} if and only if $C_{p, G}^{M}$ includes only the process with characteristics (p, G). To establish identifiability of (p, G), or lack thereof, it suffices to construct the class $C_{p, G}^{M}$ . Let (p̂ Ĝ) denote the characteristics of any process in $C_{p, G}^{M}$ . Then Ĝ_j, j ∈ Inline graphic (p̂), is exponential.

Assume first that p₁ = 0. If p̂₁ = 0, Lemma 1 implies that (p̂, Ĝ) = (p, G). If p̂₁ ∈ (0, 1), we know from Theorem 2 and eqn. (8) that for every j ∈ Inline graphic (p)\{1},

\frac{(1 - {\hat{p}}_{1}) {\hat{ψ}}_{j}}{{\hat{ψ}}_{j} + s} = \frac{ψ_{j}}{ψ_{j} + s} (1 - \frac{{\hat{p}}_{1} {\hat{ψ}}_{1}}{{\hat{ψ}}_{1} + s}) .

Rearranging the terms in the above identity leads to the polynomial equation

(1 - {\hat{p}}_{1}) {\hat{ψ}}_{j} (ψ_{j} + s) ({\hat{ψ}}_{1} + s) = ψ_{j} {(1 - {\hat{p}}_{1}) {\hat{ψ}}_{1} + s} ({\hat{ψ}}_{j} + s) .

(19)

This identity holds if and only if ψ̂_j = ψ̂₁ = ψ_j/(1 − p̂₁). Hence, for any j₁, j₂ ∈ Inline graphic (p), ψ_j₁ = ψ_j₂, and the process with the characteristics (p, G) must be Bellman-Harris. Write ψ := ψ_j, j ∈ (p), and we have ψ̂_j = ψ/(1 − p̂₁). Using the first equation in (8), we deduce that p̂_j = p_j(1 − p̂₁), p̂₁ ∈ (0, 1).

Assume next that p₁ ∈ (0, 1). If p̂₁ = 0, a similar line of arguments shows that the process with characteristics (p̂, Ĝ) satisfying ψ̂_j = ψ(1 − p₁), p̂_j = p_j/(1 − p₁), j ∈ Inline graphic (p){1}, belongs to $C_{p, G}^{M}$ if ψ_j = ψ, j ∈ (p)

Now assume that p₁ ∈ (0, 1) and p̂₁ ∈ (0, 1). Then, for every j ∈ Inline graphic (p), we have

\frac{(1 - {\hat{p}}_{1}) {\hat{ψ}}_{j}}{{\hat{ψ}}_{j} + s} = \frac{(1 - p_{1}) ψ_{j}}{ψ_{j} + s} (1 - \frac{{\hat{p}}_{1} {\hat{ψ}}_{1}}{{\hat{ψ}}_{1} + s}) / (1 - \frac{p_{1} ψ_{1}}{ψ_{1} + s}) .

Rearranging the terms in the above identity leads to the polynomial equation

(1 - {\hat{p}}_{1}) {\hat{ψ}}_{j} {(1 - p_{1}) ψ_{1} + s} (ψ_{j} + s) ({\hat{ψ}}_{1} + s) = (1 - p_{1}) ψ_{j} {(1 - {\hat{p}}_{1}) {\hat{ψ}}_{1} + s} ({\hat{ψ}}_{j} + s) (ψ_{1} + s) .

(20)

Solving this equation together with the first equation in (8) for each j ∈ Inline graphic (p)\{1} separately leads to three admissible sets of equations, denoted by A_j, B_j and C_j (indexed by j):

(A_{j}) {\begin{cases} (1 - {\hat{p}}_{1}) {\hat{ψ}}_{j} = (1 - p_{1}) ψ_{j} \\ {\hat{ψ}}_{1} = ψ_{1} \\ {\hat{ψ}}_{j} = ψ_{j} \\ (1 - {\hat{p}}_{1}) {\hat{ψ}}_{1} = (1 - p_{1}) ψ_{1} \\ {\hat{p}}_{j} / (1 - {\hat{p}}_{1}) = p_{j} / (1 - p_{1}), \end{cases} (B_{j}) {\begin{cases} (1 - {\hat{p}}_{1}) {\hat{ψ}}_{j} = (1 - p_{1}) ψ_{j} \\ ψ_{1} = ψ_{j} \\ {\hat{ψ}}_{1} = {\hat{ψ}}_{2} \\ (1 - p_{1}) ψ_{j} = (1 - {\hat{p}}_{1}) {\hat{ψ}}_{j} \\ {\hat{p}}_{j} / (1 - {\hat{p}}_{1}) = p_{j} / (1 - p_{1}), \end{cases}

(21)

and

(C_{j}) {\begin{cases} (1 - {\hat{p}}_{1}) {\hat{ψ}}_{2} = (1 - p_{1}) ψ_{2} \\ {\hat{ψ}}_{1} = ψ_{1} \\ ψ_{2} = (1 - {\hat{p}}_{1}) {\hat{ψ}}_{1} \\ {\hat{ψ}}_{2} = (1 - p_{1}) ψ_{1} \\ {\hat{p}}_{j} / (1 - {\hat{p}}_{1}) = p_{j} / (1 - p_{1}) . \end{cases}

(22)

Assume first that p₀p₂ > 0. Then the collection of Markov processes that are equivalent to the process with characteristics (p, G) is determined by simultaneously solving the equations X₀ and Y₂, where X and Y stand symbolically for either A, B, or C. There are 9 such combinations:

For equations A₀ and A₂, it is easy to show that the only solution is (p̂, Ĝ) = (p, G). Thus, here, (p̂, Ĝ) identifies the process with characteristics (p, G).
Equations B₀ and B₂ admit solutions if and only if ψ_j = ψ, j ∈ (p); that is, the process with characteristics (p, G) must be Bellman-Harris. When this condition is met, the solutions to the two equations satisfy ψ̂_j = ψ̂, j ∈ (p), where ψ̂ = ψ(1 − p₁)/(1 − p̂₁), p̂_j = p_j(1 − p̂₁)/(1 − p₁), and p̂₁ ∈ (0, 1). Thus, any Markov Bellman-Harris process admits infinitely many equivalent processes in , which are also (Markov) Bellman-Harris.
Equations C₀ and C₂ admit solutions if and only if ψ₀ = ψ₂ and ψ₀ ≤ ψ₁. Under these constraints, the unique solution to the two sets of equations is: ψ̂₁ = ψ₁, ψ̂₀ = ψ̂₂ = (1 − p₁)ψ₁, p̂₁ = 1 − ψ₀/ψ₁, and p̂_j = p_jψ₀/{(1 − p₁)ψ₁}, j = 0, 2. This solution always differs from (p, G), except when p₁ = 1 − ψ₀/ψ₁. Thus, processes that satisfy the conditions ψ₀ = ψ₂ and ψ₀ ≤ ψ₁ are identifiable only if p₁ = 1 − ψ₀/ψ₁. Otherwise, there exists a unique (p̂, Ĝ) that differs from (p, G) under which the distribution of the process Z(t) does not change.
Any other pair of equations admits solutions only under specific restrictions on (p, G). For example, equations A₀ and B₂ will have a solution provided that ψ₁ = ψ₂. When the conditions are met, the only solution to the equations is (p̂, Ĝ) = (p, G), and, therefore, identifies the initial process.

When either p₀ = 0 or p₂ = 0, we obtain the same set of solutions as above (details are omitted). We summarize the above findings in the following Corollary:

Corollary 1

Suppose that J = 2 and, for every j ∈ Inline graphic (p), that G_j(t) = 1 − e^−ψ_jt, t ≥ 0, for some $ψ_{j} \in ℝ_{+}^{*}$ . Then, (p, G) is uniquely identified by the process {Z(t), t ≥ 0}, except in the following cases:

Case 1: If ψ_j = ψ, j ∈ (p) (Bellman-Harris case), then $C_{p, G}^{M}$ includes the Markov processes with characteristics (p̂, Ĝ) ∈ {p̂₁ ∈ [0, 1), p̂_j = p_j(1 − p̂₁)/(1 − p₁), j ∈ (p)\{1}, ψ̂_j = ψ(1 − p₁)/(1 − p̂₁), j ∈ (p̂)}.
Case 2: If ψ_j = ψ, j ∈ (p)\{1}, p₁ ∈ (0, 1), ψ < ψ₁, and p₁ ≠ 1 − ψ/ψ₁ (“extended” Bellman-Harris case), then $C_{p, G}^{M}$ consists of the processes in with characteristics (p, G) and (p̂, Ĝ) where p̂₁ = 1 − ψ/ψ₁, p̂_j = p_jψ/{(1 − p₁)ψ₁}, ψ̂₁ = ψ₁, ψ̂₀ = ψ̂₂ = (1 − p₁)ψ₁, j ∈ (p)\{1}.

Remark 1

Corollary 1 identifies two classes of processes in Inline graphic that are not identifiable. The characteristics of the equivalent processes differ widely over when (p, G) identifies a Bellman-Harris process. As an illustration, consider the Markov process with offspring distribution $p = (\frac{1}{5}, \frac{1}{2}, \frac{3}{10})$ and exponentially distributed lifespan with parameters ψ₀ = ψ₁ = ψ₂ = 1. This process is of Bellman-Harris type. The class of processes in Inline graphic equivalent to this process is determined by Case 1 of Corollary 1, and it includes the processes with offspring distributions $\hat{p} = [\frac{2}{5} (1 - {\hat{p}}_{1}), {\hat{p}}_{1}, \frac{3}{5} (1 - {\hat{p}}_{1})]$ and exponentially distributed lifespan with parameters ${\hat{ψ}}_{0} = {\hat{ψ}}_{1} = {\hat{ψ}}_{2} = \frac{1}{2 (1 - {\hat{p}}_{1})}$ , where p̂₁ ∈ [0, 1). In particular, if p̂₁ = 0, we obtain the process parameterized by $\hat{p} = (\frac{2}{5}, 0, \frac{3}{5})$ and ψ̂₀ = ψ̂₂ = 0.5, which belongs to Inline graphic . Figure 1.A displays examples of probability density functions ĝ₂ for a sample of processes that belong to the equivalence class. Figures 1.B–C show the set of probability density functions ĝ₂ when the Bellman-Harris structure of the process is relaxed. For example, in Figure 1.B, we set ψ₁ = 1.5 (all other parameter values are identical to those used in Figure 1.A), and find using Case 2 of Corollary 1 that the class of equivalent Markov processes includes a second process with offspring distribution $\hat{p} = (\frac{4}{15}, \frac{1}{3}, \frac{2}{5})$ and exponentially distributed lifespan with parameters ψ̂₁ = 1.5, and ψ̂₀ = ψ̂₂ = 0.75. In Figure 1.C, we set ψ₁ = 0.5, while in Figure 1.D, we set ψ₁ = 1 and ψ₀ = 2. In these two cases, the class of equivalent processes includes only the original process, which is therefore identifiable.

Representation of the set of probability density functions ĝ₂ over the class of equivalent processes for four Markov processes: (A) ψ₀ = ψ₁ = ψ₂ = 1 (Bellman-Harris process); (B) ψ₀ = ψ₂ = 1 and ψ₁ = 1.5 (> ψ₀ and > ψ₂) (“extended” Bellman-Harris process); (C) ψ₀ = ψ₂ = 1 and ψ₁ < 1 (“extended” Bellman-Harris process); (D) ψ₀ ≠ ψ₂, ψ₁ > 0, ψ₂ = 1. We set $p_{1} = \frac{1}{2}$ in all cases. Each plot shows g₂ and ĝ₂ (whenever ∃ĝ₂ ≠ g₂) for: the process with characteristics (p, G) (solid line); representative processes of the equivalence class (dashed grey lines); equivalent Markov process in (dashed black lines). The model is non-identifiable in cases (A) and (B), and identifiable in cases (C) and (D).

Remark 2

From a statistical standpoint, when Z(t) is observed at discrete time points, the likelihood function can be solely expressed using the marginal distribution of {Z(t), t ≥ 0}, and the model parameters are therefore not always identifiable. The maximum likelihood estimator is not consistent, at least in the traditional sense [6]. If modeling quiescence is not of primary interest this non-identifiability issue may be avoided by imposing p̂₁ = p₁ = 0. Under such a restriction, the interpretation of G_j, j ∈ Inline graphic (p), may change because the time to producing j offspring could now include a resting phase latently embedded in the lifespan.

3.2. Gamma distributed lifespan

We now extend the Markov process by assuming that the lifespan is gamma distributed: $G_{j} (t) : = \int_{0}^{t} \frac{κ_{j}^{ω_{j}}}{Γ (ω_{j})} x^{ω_{j} - 1} e^{- κ_{j} x} d x$ for some $ψ_{j} : = (ω_{j}, κ_{j}) \in ℝ_{+}^{*} \times ℝ_{+}^{*}$ , j ∈ Inline graphic (p). We have $L_{g_{j}} (s) = κ_{j}^{ω_{j}} / {(κ_{j} + s)}^{ω_{j}}$ , defined for s ∈ (−κ_j, ∞), but also extendable to s ∈ (−∞, −κ_j) ∪ (−κ_j, ∞) by analytic continuation. The assumption of gamma distributed lifespan is frequently made in practice [3, 9]. We obtain the Markov process of the previous section if ω_j = 1, j ∈ Inline graphic (p). We show, when ω_j ≠ 1, that it is identifiable:

Corollary 2

Suppose that J = 2 and, for every j ∈ Inline graphic (p), that G_j is a gamma distribution with parameters κ_j > 0, ω_j > 0 and ω_j ≠ 1. Then, (p, G) is uniquely identified by the process {Z(t), t ≥ 0}.

Proof

Let (p̂, Ĝ) denote the characteristics of any process included in Inline graphic . Assume first that p₁ = 0. If p̂₁ = 0, Lemma 1 implies that (p̂, Ĝ) = (p, G). If p̂₁ ∈ (0, 1), eqn. (7) gives

(1 - {\hat{p}}_{1}) {(\frac{{\hat{κ}}_{j}}{{\hat{κ}}_{j} + s})}^{{\hat{ω}}_{j}} = {(\frac{κ_{j}}{κ_{j} + s})}^{ω_{j}} {1 - {\hat{p}}_{1} {(\frac{{\hat{κ}}_{1}}{{\hat{κ}}_{1} + s})}^{ω_{1}}} .

Rearranging the terms in the above identity leads to the equation

(1 - {\hat{p}}_{1}) {\hat{κ}}_{j}^{{\hat{ω}}_{j}} {({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} {(κ_{j} + s)}^{ω_{j}} = κ_{j}^{ω_{j}} {({\hat{κ}}_{j} + s)}^{{\hat{ω}}_{j}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}} .

(23)

Dividing both sides of eqn. (23) by (κ̂₁ + s)^ω̂₁ (κ_j + s)^ω_j and letting s → ∞ yields

(1 - {\hat{p}}_{1}) {\hat{κ}}_{j}^{{\hat{ω}}_{j}} = κ_{j}^{ω_{j}} lim_{s \to \infty} {\frac{{({\hat{κ}}_{j} + s)}^{{\hat{ω}}_{j}}}{{(κ_{j} + s)}^{ω_{j}}} - \frac{{\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}} {({\hat{κ}}_{j} + s)}^{{\hat{ω}}_{j}}}{{(κ_{j} + s)}^{ω_{j}} {({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}}}} .

In order for the R.H.S. to converge to a constant, we must have ω̂_j = ω_j, which implies that $(1 - {\hat{p}}_{1}) {\hat{κ}}_{j}^{{\hat{ω}}_{j}} = κ_{j}^{ω_{j}}$ . Then, eqn. (23) reduces to ${({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} {(κ_{j} + s)}^{ω_{j}} = {({\hat{κ}}_{j} + s)}^{ω_{j}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}}$ . Setting s = −κ̂₁ gives ${\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}} {({\hat{κ}}_{j} - {\hat{κ}}_{1})}^{ω_{j}} = 0$ , from which we deduce that κ̂_j = κ̂₁, and eqn. (23) reduces further to ${(κ_{j} + s)}^{ω_{j}} = {({\hat{κ}}_{1} + s)}^{ω_{j}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}} {({\hat{κ}}_{1} + s)}^{- {\hat{ω}}_{1}}$ . Letting s → −κ̂₁, the L.H.S. converges to (κ_j − κ̂₁)^ω_j whereas the R.H.S. diverges to −∞. Hence, eqn. (23) has no admissible solutions.

Assume next that p₁ ∈ (0, 1). If p̂₁ = 0, a similar line of arguments shows that there are no admissible solutions. If p̂₁ ∈ (0, 1), eqn. (7) gives

\frac{(1 - {\hat{p}}_{1}) {(\frac{{\hat{κ}}_{j}}{{\hat{κ}}_{j} + s})}^{{\hat{ω}}_{j}}}{1 - {\hat{p}}_{1} {(\frac{{\hat{κ}}_{1}}{{\hat{κ}}_{1} + s})}^{{\hat{ω}}_{1}}} = \frac{(1 - p_{1}) {(\frac{κ_{j}}{κ_{j} + s})}^{ω_{j}}}{1 - p_{1} {(\frac{κ_{1}}{κ_{1} + s})}^{ω_{1}}} .

Rearranging the terms in the above identity leads to the equation

(1 - p_{1}) κ_{j}^{ω_{j}} {(κ_{1} + s)}^{ω_{1}} {({\hat{κ}}_{j} + s)}^{{\hat{ω}}_{j}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}} = (1 - {\hat{p}}_{1}) {\hat{κ}}_{j}^{ω_{j}} {({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} {(κ_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .

(24)

Divide both sides of eqn. (24) by (κ₁ +s)^ω₁ (κ̂₁ +s)^ω̂₁ (κ̂_j +s)^ω̂_j and let s → ∞. Then

(1 - p_{1}) κ_{j}^{ω_{j}} = (1 - {\hat{p}}_{1}) {\hat{κ}}_{j}^{{\hat{ω}}_{j}} lim_{s \to \infty} {\frac{{(κ_{j} + s)}^{ω_{j}}}{{({\hat{κ}}_{j} + s)}^{{\hat{ω}}_{j}}} - \frac{p_{1} κ_{1}^{ω_{1}} {(κ_{j} + s)}^{ω_{j}}}{{({\hat{κ}}_{j} + s)}^{{\hat{ω}}_{j}} {(κ_{1} + s)}^{ω_{1}}}} .

(25)

In order for the R.H.S. to converge to a constant, we must have ω_j = ω̂_j, which implies that $(1 - p_{1}) κ_{j}^{ω_{j}} = (1 - {\hat{p}}_{1}) {\hat{κ}}_{j}^{{\hat{ω}}_{j}}$ . Then, eqn. (24) reduces to

{(κ_{1} + s)}^{ω_{1}} {({\hat{κ}}_{j} + s)}^{ω_{j}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}} = {({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} {(κ_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .

(26)

Setting s = −κ₁ gives $- p_{1} κ_{1}^{ω_{1}} {({\hat{κ}}_{1} - κ_{1})}^{{\hat{ω}}_{1}} {(κ_{j} - κ_{1})}^{ω_{j}} = 0$ , from which we deduce that either κ̂₁ = κ₁ or κ_j = κ₁. We study these two cases separately.

Case 1: κ̂₁ = κ₁. Assume first that ω₁ > ω̂₁. Eqn. (26) becomes
${(κ_{1} + s)}^{ω_{1} - {\hat{ω}}_{1}} {({\hat{κ}}_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} κ_{1}^{{\hat{ω}}_{1}}} = {(κ_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .$ (27)

Setting s = −κ₁ gives $p_{1} κ_{1}^{ω_{1}} (κ_{j} - κ_{1}) = 0$ . Hence, we must have κ₁ = κ_j, and eqn. (27) becomes
${(κ_{1} + s)}^{ω_{1} - {\hat{ω}}_{1}} {({\hat{κ}}_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} κ_{1}^{{\hat{ω}}_{1}}} = {(κ_{1} + s)}^{ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .$ (28)

We distinguish two sets of solutions:
1. If κ̂_j = κ₁, eqn. (28) reduces to ${(κ_{1} + s)}^{ω_{1} - {\hat{ω}}_{1}} {{(κ_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} κ_{1}^{{\hat{ω}}_{1}}} = {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}}$ . Setting s = −κ₁ yields $p_{1} κ_{1}^{ω_{1}} = 0$ , which is not admissible here because p₁κ₁ > 0.
2. If κ̂_j ≠ κ₁, dividing both sides of eqn. (28) by (κ₁ + s)^ω_j and letting s → −κ₁ entails that ω_j = ω₁ − ω̂₁ > 0, and eqn. (28) reduces to ${({\hat{κ}}_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} κ_{1}^{{\hat{ω}}_{1}}} = {(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}$ . Differentiating both sides of the equation with respect to s gives
  $ω_{j} {({\hat{κ}}_{j} + s)}^{ω_{j} - 1} {{(κ_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} κ_{1}^{{\hat{ω}}_{1}}} + {\hat{ω}}_{1} {({\hat{κ}}_{j} + s)}^{ω_{j}} {(κ_{1} + s)}^{{\hat{ω}}_{1} - 1} = ω_{1} {(κ_{1} + s)}^{ω_{1} - 1} .$
  
  Letting s → −κ̂_j, the L.H.S. of the equation converges to 0 if ω_j > 1 and diverges to −∞ if 0 < ω_j < 1, whereas the R.H.S. converges to ω₁(κ₁ − κ̂_j)^ω₁−1 ∈ (0, ∞). Hence, eqn. (26) has no admissible solutions in this case either. By using a similar line of arguments, one can show that eqn. (26) has no admissible solutions either when ω₁ < ω̂₁.
  
  When ω̂₁ = ω₁, eqn. (26) reduces to
  ${({\hat{κ}}_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - {\hat{p}}_{1} κ_{1}^{ω_{1}}} = {(κ_{j} + s)}^{ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .$ (29)
  
  If κ̂_j ≠ κ_j, setting s = −κ̂_j gives ${(κ_{1} - {\hat{κ}}_{j})}^{ω_{1}} = p_{1} κ_{1}^{ω_{1}}$ and setting s = −κ_j gives ${(κ_{1} - κ_{j})}^{ω_{1}} = {\hat{p}}_{1} κ_{1}^{ω_{1}}$ . This implies that κ₁ ≠ κ̂_j and κ₁ ≠ κ_j. Taking the derivative with respect to s on both sides of eqn. (29) yields
  $\begin{array}{l} ω_{j} {({\hat{κ}}_{j} + s)}^{ω_{j} - 1} {{(κ_{1} + s)}^{ω_{1}} - {(κ_{1} - κ_{j})}^{ω_{1}}} + ω_{1} {({\hat{κ}}_{j} + s)}^{ω_{j}} {(κ_{1} + s)}^{ω_{1} - 1} \\ = ω_{j} {(κ_{j} + s)}^{ω_{j} - 1} {{(κ_{1} + s)}^{ω_{1}} - {(κ_{1} - {\hat{κ}}_{j})}^{ω_{1}}} + ω_{1} {(κ_{j} + s)}^{ω_{j}} {(κ_{1} + s)}^{ω_{1} - 1} . \end{array}$ (30)
  
  As s → −κ̂_j, the L.H.S. of eqn. (30) converges to 0 if ω_j > 1 and diverges to −∞ if 0 < ω_j < 1, whereas the R.H.S. converges to ω₁(κ_j − κ̂_j)^ω_j (κ₁ − κ̂_j)^ω₁−1 ∈ (0, ∞). Hence, eqn. (29) has no admissible solutions.
  
  If κ̂_j = κ_j, then (κ̂_i, ω̂_i) = (κ_i, ω_i), i = 1, j. We also deduce from eqn. (29) that p̂₁ = p₁. Hence p̂_j = p_j using eqn. (19).
Case 2: κ̂₁ ≠ κ₁ and κ_j = κ₁. Eqn. (26) reduces to
${(κ_{1} + s)}^{ω_{1}} {({\hat{κ}}_{j} + s)}^{ω_{j}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}} = {({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} {(κ_{1} + s)}^{ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .$ (31)

Setting s = −κ̂₁ gives ${\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}} {(κ_{1} - {\hat{κ}}_{1})}^{ω_{1}} {({\hat{κ}}_{j} - {\hat{κ}}_{1})}^{ω_{j}} = 0$ . Because κ̂₁ ≠ κ₁, we must have κ̂₁ = κ̂_j, and eqn. (31) reduces to
${(κ_{1} + s)}^{ω_{1} - ω_{j}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{ω_{1}}} = {({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1} - ω_{j}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .$ (32)

We consider the following cases separately:
1. If ω₁ > ω_j, setting s = −κ₁ yields $p_{1} κ_{1}^{ω_{1}} {({\hat{κ}}_{1} - κ_{1})}^{{\hat{ω}}_{1} - ω_{j}} = 0$ , which has no admissible solutions.
2. If ω̂₁ > ω_j, setting s = −κ̂₁ yields ${\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}} {(κ_{1} - {\hat{κ}}_{1})}^{ω_{1} - ω_{j}} = 0$ , which has no admissible solutions.
3. If ω_j > ω₁ and ω_j > ω̂₁, eqn. (32) can be rewritten as
  ${({\hat{κ}}_{1} + s)}^{ω_{j} - {\hat{ω}}_{1}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}} = {(κ_{1} + s)}^{ω_{j} - ω_{1}} {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .$ (33)
  
  Setting s = −κ̂₁ yields ${(κ_{1} - {\hat{κ}}_{1})}^{ω_{1}} = p_{1} κ_{1}^{ω_{1}}$ , and setting s = −κ₁ yields ${({\hat{κ}}_{1} - κ_{1})}^{{\hat{ω}}_{1}} = {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}$ . Differentiating eqn. (33) w.r.t. s gives
  $\begin{array}{l} (ω_{j} - {\hat{ω}}_{1}) {({\hat{κ}}_{1} + s)}^{ω_{j} - {\hat{ω}}_{1} - 1} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {({\hat{κ}}_{1} - {\hat{κ}}_{1})}^{{\hat{ω}}_{1}}} + {\hat{ω}}_{1} {({\hat{κ}}_{1} + s)}^{ω_{j} - 1} \\ = (ω_{j} - ω_{1}) {(κ_{1} + s)}^{ω_{j} - ω_{1} - 1} {{(κ_{1} + s)}^{ω_{1}} - {(κ_{1} - {\hat{κ}}_{1})}^{ω_{1}}} + ω_{1} {(κ_{1} + s)}^{ω_{j} - 1} . \end{array}$
  
  Letting s → −κ̂₁, the L.H.S. of the equation either converges to 0 (if ω_j − ω̂₁ > 1) or diverges to −∞ (if ω_j − ω̂₁ < 1), whereas the R.H.S. converges to ω₁(κ₁ − κ̂₁)^ω_j−1 ∈ (0, ∞). Hence, eqn. (33) has no admissible solutions.
4. If ω_j = ω₁ and ω_j > ω̂₁, eqn. (32) can be rewritten as
  ${({\hat{κ}}_{1} + s)}^{ω_{j} - {\hat{ω}}_{1}} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}} = {{(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}} .$ (34)
  
  Taking the derivative with respect to s on both sides of eqn. (34) gives
  $(ω_{j} - {\hat{ω}}_{1}) {({\hat{κ}}_{1} + s)}^{ω_{j} - {\hat{ω}}_{1} - 1} {{({\hat{κ}}_{1} + s)}^{{\hat{ω}}_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{{\hat{ω}}_{1}}} + {\hat{ω}}_{1} {({\hat{κ}}_{1} + s)}^{ω_{j} - 1} = ω_{1} {(κ_{1} + s)}^{ω_{1} - 1} .$
  
  Letting s → −κ̂₁, the L.H.S. of the equation either converges to 0 (if ω_j − ω̂₁ > 1) or diverges to −∞ (if ω_j − ω̂₁ < 1), whereas the R.H.S. converges to ω₁(κ₁ − κ̂₁)^ω_j−1 ∈ (0, ∞). Hence, eqn. (34) has no admissible solutions. The case ω_j > ω₁ and ω_j = ω̂₁ is handled similarly, and has no solutions either.
5. If ω_j = ω₁ and ω_j = ω̂₁, eqn. (32) reduces to ${({\hat{κ}}_{1} + s)}^{ω_{1}} - {\hat{p}}_{1} {\hat{κ}}_{1}^{ω_{1}} = {(κ_{1} + s)}^{ω_{1}} - p_{1} κ_{1}^{ω_{1}}$ , which has no admissible solutions because κ̂₁ ≠ κ₁.

3.3. A Smith-Martin process

We consider a generalization of the Smith-Martin (S.M.) model originally proposed in [8]. The process assumes that, conditional on ξ = j, j ∈ Inline graphic (p), the lifespan takes the form τ = τ_{A_j} + δ_j, where τ_{A_j} follows an exponential distribution with parameter ψ_j, and where δ_j is a non-negative constant. In the original formulation of the model, τ_A₂ represents essentially the duration spent by the cell in the G₀/G₁ phases, and δ₂ is the time spent by the cell in the S, G₂, M (and part of G₁) phases. Here, Inline graphic (s) = e^−δ_jsψ_j/(ψ_j + s), where it can be extended to s ∈ ℝ\{−ψ_j} by analytic continuation. This process is identical to the process of Section 3.1 if δ_j = 0, j ∈ (p). Let denote the family of S.M. processes. Write $C_{p, G}^{S M} = C_{p, G} \cap M^{S M}$ for the class of S.M. processes equivalent to the process with characteristics (p, G). This process is not always identifiable:

Corollary 3

Suppose that J = 2 and, for every j ∈ Inline graphic (p), that G_j(t) = 1−e^{−ψ_j(t−δ_j)} (t ≥ δ_j). Then, (p, G) is uniquely identified by {Z(t), t ≥ 0} except:

Case 1: If ψ_j = ψ, j ∈ (p), and δ₁ = 0 when p₁ = 0 (Bellman-Harris case), $C_{p, G}^{S M}$ includes the S.M. processes with characteristics (p̂, Ĝ) ∈ {p̂₁ ∈ (0, 1), p̂_j = p_j(1−p̂₁)/(1−p₁), δ̂₁ = 0, δ̂_j = δ_j, ψ̂₁ = ψ̂_j = ψ(1−p₁)/(1−p̂₁), j ∈ (p)\{1}} ∪ {p̂₁ = 0, p̂_j = p_j/(1 − p₁), δ̂_j = δ_j, ψ̂_j = ψ(1 − p₁), j ∈ (p)\{1}}.
Case 2. If p₁ ∈ (0, 1), p₁ ≠ 1 − ψ/ψ₁, δ₁ = 0, ψ_j = ψ, j ∈ (p)\{1}, and ψ < ψ₁ (“extended” Bellman-Harris case), $C_{p, G}^{S M}$ consists of the S.M. processes with characteristics (p, G) and (p̂, Ĝ) where p̂₁ = 1−ψ/ψ₁, p̂_j = p_jψ/{(1−p₁)ψ₁}, δ̂₁ = 0, δ̂_j = δ_j, ψ̂₁ = ψ₁, ψ̂_j = (1 − p₁)ψ₁, j ∈ {0, 2}.

Proof

Let (p̂, Ĝ) denote the characteristics of any process in Inline graphic . Assume first that p₁ = 0. If p̂₁ = 0, Lemma 1 yields (p̂, Ĝ) = (p, G). If p̂₁ ∈ (0, 1), eqn. (8) gives

(1 - {\hat{p}}_{1}) \frac{e^{- {\hat{δ}}_{j} s} {\hat{ψ}}_{j}}{{\hat{ψ}}_{j} + s} = \frac{e^{- δ_{j} s} ψ_{j}}{ψ_{j} + s} {1 - {\hat{p}}_{1} \frac{e^{- {\hat{δ}}_{1} s} {\hat{ψ}}_{1}}{{\hat{ψ}}_{1} + s}} .

(35)

Taking the logarithm of both sides of the equation, we obtain

- {\hat{δ}}_{j} s + log {\hat{ψ}}_{j} (1 - {\hat{p}}_{1}) - log ({\hat{ψ}}_{j} + s) = - δ_{j} s - log (ψ_{j} + s) + log ψ_{j} (1 - {\hat{p}}_{1} \frac{e^{- {\hat{δ}}_{1} s} {\hat{ψ}}_{1}}{{\hat{ψ}}_{1} + s}) .

Dividing both sides of the equation by s and letting s → ∞ entails that δ̂_j = δ_j, and eqn. (35) reduces to (1−p̂₁)ψ̂_j(ψ_j+s)(ψ̂₁+s)−ψ_j(ψ̂_j+s)(ψ̂₁+s) = ψ_j(ψ̂_j+s)p̂₁ψ̂₁e^−δ̂₁s. Taking again the logarithm of both sides of the equation, dividing by s, and letting s → ∞ yields δ̂₁ = 0. Hence, eqn. (35) leads to eqn. (19), from which we deduce that ψ̂₁ = ψ̂_j = ψ/(1 − p̂₁), where ψ: = ψ_j, j ∈ Inline graphic (p), and p̂_j = p_j(1 − p̂₁), p̂₁ ∈ (0, 1). This proves part of Case 1 of Corollary 3.

Assume next that p₁ ∈ (0, 1). If p̂₁ = 0, the same line of arguments applies, and, by symmetry, we find that the process with characteristics (p̂, Ĝ) satisfying δ̂_j = δ_j, ψ̂_j = ψ(1 − p₁), and p̂_j = p_j/(1 − p₁), j ∈ Inline graphic (p)\{1}, belongs to $C_{p, G}^{S M}$ if ψ_j = ψ, j ∈ (p), and δ₁ = 0. This proves also part of Case 1.

Assume now that p₁ ∈ (0, 1) and p̂₁ ∈ (0, 1). Then, for every j ∈ Inline graphic (p)\{1}, eqn. (7) gives

(1 - {\hat{p}}_{1}) \frac{e^{- {\hat{δ}}_{j} s} {\hat{ψ}}_{j}}{{\hat{ψ}}_{j} + s} {1 - p_{1} \frac{e^{- δ_{1} s} ψ_{1}}{ψ_{1} + s}} = (1 - p_{1}) \frac{e^{- δ_{j} s} ψ_{j}}{ψ_{j} + s} {1 - {\hat{p}}_{1} \frac{e^{- {\hat{δ}}_{1} s} {\hat{ψ}}_{1}}{{\hat{ψ}}_{1} + s}} .

(36)

Taking the logarithm, dividing both sides of eqn. (36) by s, and letting s → ∞ implies that δ̂_j = δ_j, and eqn. (36) reduces to

(1 - {\hat{p}}_{1}) \frac{{\hat{ψ}}_{j}}{{\hat{ψ}}_{j} + s} {1 - p_{1} \frac{e^{- δ_{1} s} ψ_{1}}{ψ_{1} + s}} = (1 - p_{1}) \frac{ψ_{j}}{ψ_{j} + s} {1 - {\hat{p}}_{1} \frac{e^{- {\hat{δ}}_{1} s} {\hat{ψ}}_{1}}{{\hat{ψ}}_{1} + s}} .

Multiplying both sides by s and letting s → ∞, we obtain that (1 − p̂₁)ψ̂_j = (1 − p₁)ψ_j. Then eqn. (36) becomes

(ψ_{j} - {\hat{ψ}}_{j}) (ψ_{1} + s) ({\hat{ψ}}_{1} + s) = p_{1} ψ_{1} e^{- δ_{1} s} (ψ_{j} + s) ({\hat{ψ}}_{1} + s) - {\hat{p}}_{1} {\hat{ψ}}_{1} e^{- {\hat{δ}}_{1} s} ({\hat{ψ}}_{j} + s) (ψ_{1} + s) .

(37)

We distinguish four sets of solutions:

If δ₁ > 0 and δ̂₁ > 0, dividing both sides of eqn. (37) by s² and letting s → ∞ yields ψ̂_j = ψ_j. Then, eqn. (37) reduces to p₁ψ₁e^−δ₁s (ψ̂₁+s) = p̂₁ψ̂₁e^−δ̂₁s (ψ₁+s), from which we deduce that δ̂₁ = δ₁, p̂₁ = p₁, and ψ̂₁ = ψ₁. Hence, (p̂, Ĝ) = (p, G).
If δ₁ > 0 and δ̂₁ = 0, rearranging the terms of eqn. (37) leads to (ψ₁ + s){(ψ_j − ψ̂_j)(ψ̂₁ + s) + p̂₁ψ̂₁(ψ̂_j + s)} = p₁ψ₁e^−δ₁s(ψ_j + s)(ψ̂₁ + s). Letting s → ∞, the L.H.S of the equation diverges to infinity, whereas the R.H.S. converges to zero. Hence, eqn. (37) has no admissible solutions.
If δ₁ = 0 and δ̂₁ > 0, a similar line of arguments shows that eqn. (37) has no admissible solutions.
If δ₁ = 0 and δ̂₁ = 0, eqn. (37) is equivalent to eqn. (20). The values of p̂ and ψ̂_j, j ∈ (p̂) that solves the equation are given in Corollary 1, and leads to part of Case 1 and to Case 2.

Acknowledgments

This research was supported by NIH R01 grants NS039511, CA134839, and AI069351 to OH.

Contributor Information

RUI CHEN, University of Rochester.

OLLIVIER HYRIEN, University of Rochester.

References

1.Gyllenberg M, Webb GF. A nonlinear structured population model of tumor growth with quiescence. Journal of Mathematical Biology. 1990;28:671–694. doi: 10.1007/BF00160231. [DOI] [PubMed] [Google Scholar]
2.Haccou P, Jagers P, Vatutin VA. Branching Processes: Variation, Growth, and Extinction of Populations. Cambridge University Press; 2005. [Google Scholar]
3.Hyrien O, Mayer-Pröschel M, Noble M, Yakovlev A. A stochastic model to analyze clonal data on multi type cell populations. Biometrics. 2005;61:199–207. doi: 10.1111/j.0006-341X.2005.031210.x. [DOI] [PubMed] [Google Scholar]
4.Jagers P. Branching Processes with Biological Applications. John Wiley and Sons; London: 1975. [Google Scholar]
5.Kimmel M, Axelrod DE. Branching Processes in Biology. Springer; New York: 2002. [Google Scholar]
6.Redner R. Note on the consistency of the maximum likelihood estimate for nonidentifiable distributions. Annals of Statistics. 1981;9:225–228. [Google Scholar]
7.Sevastyanov BA. Branching Processes. Nauka; Moscow: 1971. in Russian. [Google Scholar]
8.Smith JA, Martin L. Do cells cycle? Proceedings of the National Academy of Science. 1973;70:1263–1267. doi: 10.1073/pnas.70.4.1263. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Yakovlev AY, Yanev NM. Transient Processes in Cell Proliferation Kinetics. Springer-Verlag; Heidelberg: 1989. [Google Scholar]

[R1] 1.Gyllenberg M, Webb GF. A nonlinear structured population model of tumor growth with quiescence. Journal of Mathematical Biology. 1990;28:671–694. doi: 10.1007/BF00160231. [DOI] [PubMed] [Google Scholar]

[R2] 2.Haccou P, Jagers P, Vatutin VA. Branching Processes: Variation, Growth, and Extinction of Populations. Cambridge University Press; 2005. [Google Scholar]

[R3] 3.Hyrien O, Mayer-Pröschel M, Noble M, Yakovlev A. A stochastic model to analyze clonal data on multi type cell populations. Biometrics. 2005;61:199–207. doi: 10.1111/j.0006-341X.2005.031210.x. [DOI] [PubMed] [Google Scholar]

[R4] 4.Jagers P. Branching Processes with Biological Applications. John Wiley and Sons; London: 1975. [Google Scholar]

[R5] 5.Kimmel M, Axelrod DE. Branching Processes in Biology. Springer; New York: 2002. [Google Scholar]

[R6] 6.Redner R. Note on the consistency of the maximum likelihood estimate for nonidentifiable distributions. Annals of Statistics. 1981;9:225–228. [Google Scholar]

[R7] 7.Sevastyanov BA. Branching Processes. Nauka; Moscow: 1971. in Russian. [Google Scholar]

[R8] 8.Smith JA, Martin L. Do cells cycle? Proceedings of the National Academy of Science. 1973;70:1263–1267. doi: 10.1073/pnas.70.4.1263. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Yakovlev AY, Yanev NM. Transient Processes in Cell Proliferation Kinetics. Springer-Verlag; Heidelberg: 1989. [Google Scholar]

PERMALINK

ON CLASSES OF EQUIVALENCE AND IDENTIFIABILITY OF AGE-DEPENDENT BRANCHING PROCESSES

RUI CHEN

OLLIVIER HYRIEN

Abstract

1. Introduction

2. Main results

2.1. A collection of equivalent processes

Theorem 1

Proof

2.2. A larger collection of equivalent processes

2.3. Exhaustivity of when J = 2 and J = 3

Lemma 1

Proof

Theorem 2

Proof

2.4. Characterization of using moments when J = 2

Theorem 3

Proof

3. Application to model identifiability

3.1. Exponentially distributed lifespan

Corollary 1

Remark 1

Figure 1.

Remark 2

3.2. Gamma distributed lifespan

Corollary 2

Proof

3.3. A Smith-Martin process

Corollary 3

Proof

Acknowledgments

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

ON CLASSES OF EQUIVALENCE AND IDENTIFIABILITY OF AGE-DEPENDENT BRANCHING PROCESSES

RUI CHEN

OLLIVIER HYRIEN

Abstract

1. Introduction

2. Main results

2.1. A collection of equivalent processes

Theorem 1

Proof

2.2. A larger collection of equivalent processes

2.3. Exhaustivity of when J = 2 and J = 3

Lemma 1

Proof

Theorem 2

Proof

2.4. Characterization of using moments when J = 2

Theorem 3

Proof

3. Application to model identifiability

3.1. Exponentially distributed lifespan

Corollary 1

Remark 1

Figure 1.

Remark 2

3.2. Gamma distributed lifespan

Corollary 2

Proof

3.3. A Smith-Martin process

Corollary 3

Proof

Acknowledgments

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases