A test of homogeneity for age-dependent branching processes with immigration

Ollivier Hyrien; Nikolay M Yanev; Craig T Jordan

doi:10.1214/15-EJS1024

. Author manuscript; available in PMC: 2016 Apr 29.

Published in final edited form as: Electron J Stat. 2015 May 22;9(1):898–925. doi: 10.1214/15-EJS1024

A test of homogeneity for age-dependent branching processes with immigration

Ollivier Hyrien ^1,^✉, Nikolay M Yanev ², Craig T Jordan ³

PMCID: PMC4851446 NIHMSID: NIHMS777146 PMID: 27134694

Abstract

We propose a novel procedure to test whether the immigration process of a discretely observed age-dependent branching process with immigration is time-homogeneous. The construction of the test is motivated by the behavior of the coefficient of variation of the population size. When immigration is time-homogeneous, we find that this coefficient converges to a constant, whereas when immigration is time-inhomogeneous we find that it is time-dependent, at least transiently. Thus, we test the assumption that the immigration process is time-homogeneous by verifying that the sample coefficient of variation does not vary significantly over time. The test is simple to implement and does not require specification or fitting any branching process to the data. Simulations and an application to real data on the progression of leukemia are presented to illustrate the approach.

Keywords and phrases: Coefficient of Variation, Continuous-Time Branching Processes, Leukemia, Non-Homogeneous Poisson Process

1. Introduction

Age-dependent branching processes define a class of continuous-time stochastic processes that is increasingly popular in the biological sciences. They offer flexible, yet tractable, stochastic models that describe population dynamics at the individual level, and allow the lifespan to follow arbitrary distributions (e.g., [1, 2, 3, 4, 5, 6, 7]). Over the years, they have been extended in a number of useful ways. For example, they may include an immigration component to describe an influx of immigrants into the population of interest.

Sevastyanov (1957) was the first to study branching processes with immigration [8]. He investigated the Markov case, and extensions to age-dependent processes were subsequently considered by Jagers and other authors in the single-type [9, 10, 11, 12, 13, 14, 15, 16, 17, 18 ] and the multi-type case [19, 20 ]. Recently, this class of processes has been proposed to model the dynamics of cell populations in vivo under the assumption that immigration obeys a (possibly time-inhomogeneous) Poisson process [17, 18, 20, 21 ].

Statistical methods for the Bienaymé-Galton-Watson process with immigration are discussed in Guttorp’s monograph [22]. In contrast, statistical inference for age-dependent branching processes with immigration has received little, if any, attention to date. In particular, no methods have been developed to determine whether the rate of the immigration process should be taken as constant or allowed to vary with time. We see two reasons for developing a procedure that addresses this question: firstly, it is not always known in practice whether the influx of cells in the population of interest changes over time such that the procedure could lead to valuable biological insights; secondly, from a statistical standpoint, it would help to decide whether the data can support a model with time-inhomogeneous immigration, and thus could prevent over-parameterization issues. The question of how to specify the shape of the immigration rate is also important, but we do not address it here.

The goal of this paper is to propose a test to determine whether the immigration process of an age-dependent branching process with immigration is time-homogeneous based on observations of the population size at discrete time points. The proposed test applies when several independent populations are observed at discrete time points; such experimental designs are commonly used in biology. We first investigate the asymptotic behavior of the coefficient of variation of the population size under various immigration rates and when the branching process is sub-, super-, and critical. We find that the coefficient of variation converges over time to a strictly positive constant when the immigration process is time-homogeneous. In contrast, when the immigration process is time-inhomogeneous, we find that the coefficient of variation is either time-dependent, possibly after applying a suitable transformation, or transits to a different constant. Thus, we construct a test which verifies if the empirical co-efficient of variation changes significantly over time, which is accomplished by techniques of linear regression.

An attractive feature of the test is that it is simple to implement. In particular, it does not require any branching process to be fitted to the data, and it does not impose either that the distribution of the lifespan and the shape of the immigration rate, should it be time-dependent, be formulated. This simplicity is a consequence of the fact that the test is solely constructed from the asymptotic behavior of the process. Statistical methods for branching processes that rely on their asymptotic behavior have been successfully used in the past [22, 23, 24, 25, 26, 27], and the proposed test is built in the same vein. Asymptotic procedures for testing the homogeneity of coefficients of variation across samples have also been proposed [28, 29 ] (see also [30] for a derivation of the asymptotic distribution of the coefficient of variation). These tests do not apply in our setting because they make two assumptions that would not be valid: (1) observations are independent across samples; and (2) observations are normally distributed.

The class of branching processes that we consider is defined in Section 2. Although this work was motivated by a problem that arises from cell biology, we consider a process that is more broadly applicable because our procedure works identically under a more general set of assumptions about the offspring and lifespan distributions, and generalization comes at no cost. In Section 3, we study the asymptotic behavior of the coefficient of variation of the population size when the immigration rate is constant and when it is time-dependent. The Markov case is considered analytically and the non-Markov case is investigated numerically. We develop the test in Section 4. In Section 5, we present results from simulation studies in which we investigate the performance of the test. These studies show that the test possesses power under a variety of alternative hypotheses. An application to a real data set on the dynamics of leukemia is presented in Section 6. The analysis reveals that the immigration rate of normal (non-leukemic) cells into the blood changed over time. Concluding remarks are offered in Section 7. Technical details are provided in the Appendix.

2. Age-Dependent Branching Processes with Time-Inhomogeneous Immigration

2.1. The general process

Without loss of generality, the process begins at t = 0 with zero cells. Let ${T_{j}}_{j = 1}^{\infty}$ be a sequence of time points at which I_j new cells (thereafter referred to as immigrants) arrive in the population, where ${I_{j}}_{j = 1}^{\infty}$ is a collection of independent and identically distributed, integer-valued random variables (r.v.). Write γ = E(I_j), γ₂ = E{I_j(I_j − 1)}, and g(s) = E(s^I_j). The immigration process $P (t) = \sum_{j = 1}^{\infty} 1_{{T_{j} \leq t}}$ is assumed to be a non-homogeneous Poisson process with instantaneous and cumulative rates r(t) and $R (t) = \int_{0}^{t} r (u) d u$ . When r(t) = r (t ≥ 0), P(t) reduces to a time-homogeneous Poisson process.

Upon completion of its lifespan, every cell of the population, including immigrants, produces a random number ξ of offspring of age zero. Write p_k = pr(ξ = k), k = 0, 1 ⋯, for the distribution of ξ. Let q(s) = E(s^ξ), |s| ≤ 1, denote its probability generating function, and put m = E(ξ) and m₂ = E{ξ(ξ − 1)}. Applications to cell biology are primarily concerned with the special case q(s) = p₀+p₂s², in which cells may either die with probability p₀ or divide into two cells with probability p₂ = 1 − p₀. The duration of the lifespan is described by a non-negative r.v. τ with cumulative distribution function G(t) = pr(τ ≤ t), assumed non-lattice and satisfying G(0+) = 0. Write μ = E(τ), assumed finite. Every cell behaves independently of every other cell. Finally, assume that γ₂ < ∞, m₂ < ∞, and μ₂ < ∞. The distributions G(·) and q(·) define a Bellman-Harris process embedded in the branching process with immigration.

2.2. A special case applied to cancer stem cell biology

Recent studies have supported the hypothesis that stem cells play a central role not only in the generation and maintenance of multicellular systems, but also in the development of several cancers. For example, they have been identified in several types of leukemia [31]. As it appeared clear that stem cells should be targeted by cancer therapy, understanding their properties, including their dynamics, has become of considerable interest to cancer scientists.

Stem cells are characterized by the unique combination of three features: (1) they can self-renew by producing daughter cells that retain their properties; (2) they can generate multicellular lineages; and (3) they are able to maintain survival of these lineages. They also tend to be rare and are not always experimentally detectable, making the study of their behavior challenging.

The lack of direct observation on stem cells can be attenuated by modeling their contribution to disease progression via an immigration process that describes their influx into the pool of observable cells as they differentiate. The intensity of this influx of newly differentiated cells may vary over time, a feature that can be captured by allowing the immigration process to be time-inhomogeneous. Upon completing their lifespan, differentiated cells divide into two differentiated cells with probability p₂ or die with probability p₀ = 1 − p₂. The duration of the lifespan is often assumed to have a gamma distribution, but other choices are possible (e.g., a log-normal distribution).

3. The Coefficient of Variation and its Asymptotic Behavior

3.1. The general case

Let Z(t) denote the population size at time t. For every t ≥ 0 and |s| ≤ 1, put

Ψ (t; s) = E {s^{Z (t)} ∣ Z (0) = 0}

for the probability generating function of Z(t). It has the following expression:

Ψ (t; s) = exp {- \int_{0}^{t} r (t - u) [1 - g (F (u; s))] d u},

(3.1)

where Ψ(0, s) = 1, and where F(t; s), t ≥ 0, |s| ≤ 1, satisfies

F (t; s) = \int_{0}^{t} q (F (t - u; s)) d G (u) + s {1 - G (t)},

(3.2)

with the initial condition F(0; s) = s [35]. Define A(t) = ∂F(t; s)/∂s|_s₌₁ and B(t) = ∂²F(t; s)/∂s²|_s₌₁. These functions are the first and second order factorial moments of the embedded Bellman-Harris process started from a single cell at time t = 0. It follows from eqn. (3.2) that A(t) and B(t) are solutions to the renewal-type equations

A (t) = m \int_{0}^{t} A (t - u) d G (u) + 1 - G (t),

(3.3)

and

B (t) = m \int_{0}^{t} B (t - u) d G (u) + m_{2} \int_{0}^{t} A {(t - u)}^{2} d G (u),

(3.4)

with the initial conditions A(0) = 1 and B(0) = 0 [1].

Define the first and second order moments of Z(t): M(t) = E{Z(t) | Z(0) = 0}, M₂(t) = E{Z(t){Z(t) − 1} | Z(0) = 0}, V (t) = Var{Z(t) | Z(0) = 0} = M₂(t) + M (t){1 − M(t)}, and let C_v(t) = V (t)^1/2/M (t) denote the coefficient of variation of Z(t). We deduce from eqn. (3.1) that M(t) and M₂(t) take the expressions

M (t) = γ \int_{0}^{t} r (t - u) A (u) d u,

(3.5)

and

M_{2} (t) = γ \int_{0}^{t} r (t - u) B (u) d u + {γ \int_{0}^{t} r (t - u) A (u) d u}^{2} + γ_{2} \int_{0}^{t} r (t - u) A {(u)}^{2} d u,

(3.6)

with initial conditions M(0) = 0 and M₂(0) = 0. Eqns. (3.5) and (3.6) imply directly that:

V (t) = γ \int_{0}^{t} r (t - u) B (u) d u + γ_{2} \int_{0}^{t} r (t - u) A {(u)}^{2} d u + M (t) .

(3.7)

Let α denote the Malthusian parameter of the embedded Bellman-Harris process. Assuming it exists, α is the solution to the equation

m \int_{0}^{\infty} e^{- α u} d G (u) = 1.

The process is said to be sub-critical if α < 0, critical if α = 0, and super-critical if α > 0.

Define $\tilde{G} (t) = m \int_{0}^{t} e^{- α u} d G (u)$ and $\tilde{μ} = \int_{0}^{\infty} u d \tilde{G} (u)$ . Application of renewal theory to eqns. (3.3) and (3.4) gives A(t) = 1 if α = 0 and A(t) ~ K_Ae^αt if α ≠ 0, t ≥ 0, where K_A = (m − 1)/αmμ̃, and

B (t) ~ {\begin{cases} K_{B 1} e^{α t} & if α < 0 \\ K_{B 2} t & if α = 0 \\ K_{B 3} e^{2 α t} & if α > 0, \end{cases}

where $K_{B 1} = - m_{2} K_{A}^{2} / m α \tilde{μ}$ , K_B₂ = m₂/μ, and $K_{B 3} = m_{2} K_{A}^{2} \tilde{G} (2 α) / {1 - m \tilde{G} (2 α)}$ [1].

Additional constants that will appear in the limit of the coefficient of variation include $\bar{A} = \int_{0}^{\infty} A (x) d x$ and $\bar{B} = \int_{0}^{\infty} B (x) d x$ , which are both finite when α < 0. We also define ${\bar{E}}_{1} (α) = \int_{0}^{\infty} e^{- α x} {(1 + x)}^{- 1} d x$ , and ${\hat{A}}_{ρ} = \int_{0}^{\infty} e^{- ρ u} A (x) d x$ . Notice that Ē₁(α) < ∞ if α > 0, and Â_ρ < ∞ if ρ > α.

To simplify the presentation, we have discussed the asymptotic behavior of the expectation and variance of the process in Theorems 8.1, 8.2 and 8.3 which have been relegated in the Appendix. The asymptotic behavior of the coefficient of variation is easily deduced from these theorems. Beginning with the time-homogeneous case, we have:

Theorem 3.1 (time-homogeneous Poisson process)

Assume that r(·) ≡ r. Then $lim_{t \to \infty} C_{v} (t) = c_{0}$ , where

c_{0} = {\begin{cases} \sqrt{\frac{γ (\bar{A} + \bar{B}) + γ_{2} \bar{A^{2}}}{r γ^{2} {\bar{A}}^{2}}} & i f α < 0 \\ \sqrt{\frac{m_{2}}{2 r γ μ}} & i f α = 0 \\ \sqrt{\frac{α (γ K_{B 3} + γ_{2} K_{A}^{2})}{2 r γ^{2} K_{A}^{2}}} & i f α > 0. \end{cases}

When the immigration process is time-inhomogeneous, the behavior of C_v(t) differs from that exhibited in the time-homogeneous case. Using c as a generic notation to denote a positive constant that appears in the limit of C_v(t), and which differs in all cases, we have:

Theorem 3.2 (time-inhomogeneous Poisson process)

Case 1. If r(t) = r=(1 + t), then
$C_{v} (t) ~ {\begin{cases} c \sqrt{t} & i f α < 0 \\ c \sqrt{\frac{t}{log t}} & i f α = 0 \\ \sqrt{\frac{γ K_{B 3} + γ_{2} K_{A}^{2}}{r γ^{2} K_{A}^{2} {\bar{E}}_{1} (α)}} & i f α > 0. \end{cases}$
Case 2. If r(t) = rt^θ, θ > −1, then
$C_{v} (t) ~ {\begin{cases} {c t}^{- θ / 2} & i f α \leq 0 \\ \sqrt{\frac{α^{θ + 1} (γ K_{B 3} + γ_{2} K_{A}^{2})}{2 r γ^{2} K_{A}^{2} Γ (θ + 1)}} & i f α > 0. \end{cases}$
Case 3. If r(t) = re^ρt, ρ > 0, then
$C_{v} (t) ~ {\begin{cases} \sqrt{\frac{{(α - ρ)}^{2} (γ K_{B 3} + γ_{2} K_{A}^{2})}{r γ^{2} K_{A}^{2} (2 α - ρ)}} & i f ρ < α \\ {c t}^{- 1} & i f ρ = α \\ c e^{- (ρ - α) t} & i f α < ρ < 2 α \\ c \sqrt{t} e^{- α t} & i f ρ = 2 α \\ c e^{- ρ t / 2} & i f ρ > 2 α . \end{cases}$

Taken together, these results suggest the following conclusions. If the immigration process is time-homogeneous, we see from Theorem 3.1 that logC_v(t) converges to a constant log c₀ as t → ∞. If it is time-inhomogeneous, Theorem 3.2 indicates that log C_v(t) → ±∞ as t → ∞ in most of the considered cases, except (e.g.) for the exponential rate when 0 < ρ < α, where it converges to a constant that differs from log c₀. Moreover, when it diverges, log C_v(t) is, in most cases, asymptotically equivalent to an affine function of h(t), log C_v(t) ~ a₀ + a₁h(t), where h(t) is a function that depends solely on time (e.g., h(t) = t or h(t) = log t or h(t) = log log t) and no other parameters. We will use this property when constructing our test.

3.2. The Markov case

The moments of Z(t) are available in closed-form when the process is Markov and the immigration process is time-homogeneous; that is, when G(t) = 1 − e⁻^t/μ and r(t) = r, t ≥ 0. Specifically, the expectation and variance of the population size are

M (t) = {\begin{cases} \frac{γ r}{α} (e^{α t} - 1) & if α \neq 0 \\ γ r t & if α = 0, \end{cases}

and

V (t) = {\begin{cases} \frac{γ r m_{2} + γ_{2} r α μ}{2 α^{2} μ} (e^{2 α t} - 1) - \frac{γ r}{α} (\frac{m_{2}}{α μ} - 1) (e^{α t} - 1) & if α \neq 0 \\ \frac{γ r m_{2}}{2 μ} t^{2} + r (γ_{2} + γ) t & if α = 0. \end{cases}

We deduce immediately that

C_{v} (t) = {\begin{cases} \sqrt{\frac{γ m_{2} + γ_{2} α μ}{2 μ γ^{2} r} \frac{e^{α t} + 1}{e^{α t} - 1} + \frac{α}{γ r} \frac{1 - m_{2} / α μ}{e^{α t} - 1}} & if α \neq 0 \\ \sqrt{\frac{m_{2}}{2 μ γ r} + \frac{γ_{2} + γ}{r γ^{2} t}} & if α = 0. \end{cases}

The expectation and variance are therefore asymptotically equivalent to

M (t) ~ {\begin{cases} - \frac{r γ}{α} & if α < 0 \\ r γ t & if α = 0 \\ \frac{r γ}{α} e^{α t} & if α > 0, \end{cases}

and

V (t) ~ {\begin{cases} - \frac{r γ}{α} + \frac{r (γ m_{2} - γ_{2} α μ)}{2 μ α^{2}} & if α < 0 \\ \frac{r γ m_{2}}{2 μ} t^{2} & if α = 0 \\ \frac{r}{2 μ α^{2}} (γ m_{2} + γ_{2} α μ) e^{2 α t} & if α > 0. \end{cases}

Finally, we deduce that $lim_{t \to \infty} C_{v} (t) = c_{0}$ , where

c_{0} = {\begin{cases} \sqrt{- \frac{γ_{2} α μ + γ m_{2}}{2 r μ γ^{2}} + \frac{α}{γ r} (1 - \frac{m_{2}}{α μ})} & if α < 0 \\ \sqrt{\frac{m_{2}}{2 r μ γ}} & if α = 0 \\ \sqrt{\frac{γ m_{2} + γ_{2} α μ}{2 r μ γ^{2}}} & if α > 0, \end{cases}

which is consistent with Theorem 3.1. Moreover,

C_{v} (t) - c_{0} = {\begin{cases} O (e^{- ∣ α ∣ t}) & if α \neq 0 \\ O (t^{- 1}) & if α = 0, \end{cases}

(3.8)

and convergence to c₀ occurs quickly over time in all cases.

3.3. Numerical investigations in the non-Markov case

We present results from numerical simulations that further illustrate the behavior of the coefficient of variation of the population size with various immigration rates, including some that were considered in Theorem 3.2.

The population size process Z(t) (t ≥ 0) was simulated by assuming that: (i) the lifespan of every cell has a gamma (non-exponential) distribution with mean 24 and variance 48; and (ii) upon completion of its lifespan, every cell either divides with probability p₂ or dies with probability p₀ = 1 − p₂. We considered different values of p₀ to run simulations with sub- (p₀ > 0.5), super- (p₀ < 0.5), and critical (p₀ = 0.5) processes.

We also considered various rates for the immigration process, including the time-homogeneous rate (r(·) ≡ r) and the following time-inhomogeneous rates:

r(t) = (r + θt) ∧ 0: the immigration rate increases or decreases affinely over time and is constrained to remain positive when θ < 0.
r(t) = r/(1 + t): the immigration rate decreases gradually to 0 over time.
r(t) = r{1 + cos(θt)}: the immigration rate oscillates over time between 0 and 2r
$r (t) = {\begin{cases} r & if t \in [0, t_{0}] \\ {r + θ (t - t_{0})} \lor 0 & if t > t_{0} . \end{cases}$
Here, the immigration rate remains constant between times 0 and t₀, and increases or decreases affinely thereafter.
r(t) = re^θt: the immigration rate increases exponentially fast over time; this scenario could describe immigration of malignant cells from tissues (e.g., bone marrow) in which they multiply exponentially fast.

We simulated the time-inhomogeneous Poisson process using Lewis and Shedler (1979)’s acceptance-rejection method [36]. A single immigrant entered the population at every time T_j (that is, pr(I_j = 1) = 1, j = 1, 2 ⋯).

We simulated 1,000 runs of the branching process with immigration over the time interval [0, 250], and computed the empirical mean, variance and coefficient of variation of these simulations at multiple time points. The results are shown in Figure 1. The plots in the left column were obtained with sub-critical processes (p₀ > 0.5); those in the center were obtained with critical processes (p₀ = 0.5); and those on the right were obtained with super-critical processes (p₀ < 0.5). In the first two rows, we used a time-homogeneous immigration process. The results displayed in the next four rows were obtained with time-inhomogeneous immigration processes. The immigration rate was different in all cases, as indicated in the figure.

Fig 1 — Results from simulations. For each scenario we report: 1,000 simulated sample paths (gray and violet lines), their empirical mean (black line) and coefficient of variation (CV(t)), as well as the immigration rate (r(t)). The panels on the top were obtained with time-homogeneous immigration processes and those below were obtained with various time-dependent immigration processes. The immigration rate is displayed in the subplot embedded in the plot of the coefficient of variation.

When the immigration process was time-homogeneous, the coefficient of variation of Z(t) decreased over time to c₀. Convergence was quick and virtually occurred within the time interval [0, 40]. When the immigration process is time-inhomogeneous, the simulations indicate that the coefficient of variation was time-dependent. Thus, these simulations corroborate results stated in Theorem 3.2. The magnitude of the change in the coefficient of variation differed across scenarios.

4. An Asymptotic Test of Homogeneity of the Immigration Process

We now develop a test of the null hypothesis:

H_{0} : “ r (\cdot) \equiv r ”

that the rate of the immigration process is a constant function of time, and we will test H₀ against the general alternative hypothesis H₁ = H̄₀. The proposed procedure applies when several, independent realizations of the process are observed at discrete points in time. This type of experimental design is commonly used in biology.

Thus, let {Z_i(t)}_t_≥0, i = 1, ⋯, n, denote n independent and identically distributed (i.i.d.) copies of the process {Z(t)}_t_≥0. Assume that each of them is observed at m (m < ∞) discrete time points t₁, …, t_m with 0 < t₁ < ⋯ < t_m < ∞. Write t = (t₁, …, t_m). We note that t₁ → ∞ implies that t_j → ∞ (j = 1, …, m).

We could construct a test of H₀ against the alternative hypothesis H₁ by fitting a branching process with time-inhomogeneous immigration to the data and assessing the significance of the association of the immigration rate with time. This approach would first require specifying a family of distributions for the lifespan and a functional form for the rate of the immigration process. We would next fit the postulated model and construct a p-value to decide whether H₀ should be rejected. Estimation methods are not yet available for age-dependent branching processes with immigration, but could arguably be developed (e.g., by adapting an existing pseudo-likelihood approach proposed for age-dependent branching processes [6, 37]). Here, we consider a different approach that requires none of these steps and assumptions to be made. The developed test only requires existence of a model in the class of branching processes defined in Section 2.1 that can describe the data. It also requires fitting a linear regression model to the empirical coefficients of variation.

The construction of our test rests on Theorem 3.1, which ensures, if H₀ is true, that the coefficient of variation of Z(t) will remain virtually constant between all times of observation, provided that t₁ is sufficiently large. Thus, we propose to test H₀ by checking whether the empirical coefficients of variation computed using the samples ${Z_{i} (t_{j})}_{i = 1}^{n}$ , j = 1, …, m, differ significantly between time points. The comparison is accomplished via linear regression.

For every t ≥ 0, define the kth central moment M₍_k₎(t) = E[{Z(t) − M(t)}^k], assumed finite for k = 1, …, 4. Let

\begin{matrix} \hat{m} (t) = n^{- 1} \sum_{i = 1}^{n} Z_{i} (t), \\ \hat{v} (t) = {(n - 1)}^{- 1} \sum_{i = 1}^{n} {Z_{i} (t) - \hat{m} (t)}^{2} \end{matrix}

and

{\hat{c}}_{v} (t) = \sqrt{\hat{v} (t)} / \hat{m} (t)

(4.1)

denote the sample mean, variance, and coefficient of variation. Using a result from [39], we have that

n^{1 / 2} {{\hat{c}}_{v} (t) - C_{v} (t)} \overset{D}{\to} N {0, σ^{2} (t)},

where

σ^{2} (t) = C_{v} {(t)}^{4} - M_{(3)} (t) M {(t)}^{- 3} + M_{(4)} (t) {4 M {(t)}^{2} V (t)}^{- 1} - C_{v} {(t)}^{2} 4^{- 1} .

This convergence uses a Central Limit Theorem for i.i.d. random variables [38], which applies here because ${Z_{i} (t)}_{i = 1}^{n}$ are i.i.d. by assumption. It also requires that M₍₄₎(t) < ∞. Hence, for any real-valued function L(·) with derivative L′(x) ≠ 0, x > 0, we have that

L ({\hat{c}}_{v} (t)) = L (C_{v} (t)) + n^{- 1 / 2} ε (t) + o_{p} (n^{- 1 / 2}),

(4.2)

where

ε (t) ~ N {0, L^{'} {(C_{v} (t))}^{2} σ^{2} (t)} .

Here, the function L(·) refers to any transformation of C_v(t) that may make it a diverging function of time, should the null hypothesis be violated. Relevant examples include L(u) = log u or L(u) = u, as discussed at the end of Section 3.1. Write Σ_t = Var[(ε(t₁), …, ε(t_m))] and let $\sum_{\infty} = lim_{t_{1} \to \infty} \sum_{t}$ , assuming the limit exists under H₀.

When r(·) ≡ r, Theorem 3.1 and eqn. (4.2) entail that

L ({\hat{c}}_{v} (t)) = L (c_{0}) + n^{- 1 / 2} ε (t) + o (1) + o_{p} (n^{- 1 / 2}),

(4.3)

where o(1) → 0 as t → ∞. Thus, L(ĉ_v(t)) is virtually centered about L(c₀) under H₀ if t is sufficiently large, and we saw in Sections 3.2 and 3.3 that this occurs quickly. From Theorem 3.2, we expect that this is not the case when r(·) is time-dependent.

In order to construct our test statistic, we now formulate the linear model

L ({\hat{c}}_{v} (t_{j})) = x_{j}^{T} θ + η (t_{j}) (j = 1, \dots, m)

(4.4)

where

x_{j} = x_{j} (t_{j}) : = {1 h_{1} (t_{j}) \dots h_{p - 1} (t_{j})}^{T}

is a vector of covariates defined using a given set of functions h₁(·), …, h_p₋₁(·), for some p ≥ 2, where θ = (θ₁, …, θ_p)^T is a p-dimensional parameter vector, and where the r.v. η(t_j) describes the error term of the model. The error terms η(t_j), j = 1, …, m, are not formally centered about zero; however, when H₀ is true, it follows from Theorem 3.1 that their expectation converges to zero as t_j → ∞.

When H₀ is true and when t₁ is sufficiently large, we expect from eqn. (4.3) that Model (4.4) captures the behavior of L(ĉ_v(t)) with θ ≃ θ^*, where

θ^{*} = {L (c_{0}), 0, \dots, 0}^{T},

because all coefficients associated with time should be zero. The functions h₁(·), …, h_p₋₁(·) are chosen so Model (4.4) can detect a change in L(ĉ_v(t)) over time when H₁ is true. With reasonably chosen h_k(·), we expect the relationship (4.4) to hold with θ ≠ θ^* under H₁ because L(ĉ_v(t_i)) should now be time-dependent. Thus, the gist of the proposed test is to assess H₀ by verifying whether θ = θ*.

Define

\begin{array}{l} {\hat{L}}_{c v} (t) = {L ({\hat{c}}_{v} (t_{1})), \dots, L ({\hat{c}}_{v} (t_{n}))}^{T}, \\ L_{c v} (t) = {L (C_{v} (t_{1})), \dots, L (C_{v} (t_{n}))}^{T}, \end{array}

and X = (x₁, ⋯, x_m)^T. Assume that

Assumption 1

rank X = p.

Assumption 1 implies that the matrix X^TX is invertible. Let

\hat{θ} = {(X^{T} X)}^{- 1} X^{T} {\hat{L}}_{c v} (t)

denote the least squares estimator of θ for Model (4.4), and define

θ_{t}^{*} = {(X^{T} X)}^{- 1} X^{T} L_{c v} (t) .

(4.5)

As n → ∞, we have that ${\hat{L}}_{c v} (t) \overset{P}{\to} L_{c v} (t), \hat{θ} \overset{P}{\to} θ_{t}^{*}$ , and

\begin{array}{l} \sqrt{n} (\hat{θ} - θ_{t}^{*}) = {(X^{T} X)}^{- 1} X^{T} {{\hat{L}}_{c v} (t) - L_{c v} (t)} \\ \overset{D}{\to} N (0, V_{t}), \end{array}

(4.6)

where

V_{t} = {(X^{T} X)}^{- 1} X^{T} \sum_{t} X {(X^{T} X)}^{- 1} .

Under H₀, when both n and t₁ increase at appropriate rates, we have that

\sqrt{n} (\hat{θ} - θ^{*}) \overset{D}{\to} N (0, V_{\infty}),

where

V_{\infty} = lim_{t_{1} \to \infty} V_{t} = {(X^{T} X)}^{- 1} X^{T} \sum_{\infty} X {(X^{T} X)}^{- 1} .

Let ρ(t) denote the rate at which C_v(t) converges to c₀ as t → ∞ when H₀ is true; that is, ρ(t) is such that

C_{v} (t) - c_{0} = O (ρ (t)) .

For example, in the Markov case, we showed that

ρ (t) = {\begin{cases} e^{- ∣ α ∣ t} & if α \neq 0 \\ t^{- 1} & if α = 0 \end{cases}

(4.7)

(see eqn. (3.8)). Since θ^* can be expressed as

θ^{*} = {(X^{T} X)}^{- 1} X^{T} L (c_{0}) 1_{p},

where 1_p is a p × 1 vector with all entries equal to 1, it follows from eqn. (4.5) and a Taylor series expansion that

\begin{array}{l} θ_{t}^{*} - θ^{*} = {(X^{T} X)}^{- 1} X^{T} [L_{c v} (t) - L (c_{0}) 1_{p}] \\ = {(X^{T} X)}^{- 1} X^{T} [L^{'} (c_{0}) {C_{v} (t) - c_{0} 1_{p}} + o (‖ C_{v} (t) - c_{0} 1_{p} ‖)] . \end{array}

Thus, when H₀ holds, we deduce that

‖ θ_{t}^{*} - θ^{*} ‖ = O [max_{j = 1, \dots, m} ρ (t_{j})] .

(4.8)

Although converging to 0 under H₀ as t₁ → 0, the residual difference between $θ_{t}^{*}$ and θ^* could be detected if n is large enough, even if H₀ is true. Thus, to impose conditions controlling inflation of the type-1 error rate of our test, we further assume that the study is designed such that:

Assumption 2

If H₀ holds true, then $n = o [{max_{j = 1, \dots, m} ρ (t_{j})}^{- 2}]$ .

This second assumption prevents the sample size n from being too large relative to the time points t_j, j = 1, …, m. It is needed to eliminate the asymptotic bias that may persist in θ̂ under H₀ as n → ∞ as a result of the fact that C_v(t) is not equal to c₀ when t is finite. It is verified if the process has been running for a sufficiently long time before the first observations are made. For example, in the application presented in Section 6, Assumption 2 is expected to be satisfied by the population of normal cells since hemopoiesis, the process of blood cell production, had reached steady state before leukemia cells were inoculated. We note that the assumption pre-assumes that the unit of time remains fixed as n increases.

To further understand the implications of Assumption 2, we can consider the Markov case for which ρ(t) is given in eqn. (4.7). Hence, we have that $max_{j = 1, \dots, m} ρ (t_{j}) = ρ (t_{1})$ , and Assumption 2 is satisfied in this case if n increases with t₁ at the following rate:

n = {\begin{cases} o (e^{2 ∣ α ∣ t_{1}}) & if α \neq 0 \\ o (t_{1}^{2}) & if α = 0. \end{cases}

Therefore, in the Markov case, the impact of increasing the sample size on type-1 errors may be overcome by a moderate increase in t₁. We note that sample sizes are rarely large in biological experiments, such that the validity of Assumption 2 may not be of concern as long as the process is not observed too shortly after the population started to grow.

When H₀ is true, Assumption 2 together with eqns. (4.3), (4.6) and (4.8) imply that

\begin{array}{l} \sqrt{n} (\hat{θ} - θ^{*}) = \sqrt{n} (\hat{θ} - θ_{t}^{*}) + \sqrt{n} (θ_{t}^{*} - θ^{*}) \\ \overset{D}{\to} N (0, V_{\infty}), \end{array}

as n → ∞. For every k = 1, …, q, let θ̂_k denote the kth entry of θ̂. Write θ̂_2:_p = (θ̂₂, …, θ̂_p)^T, and let $V_{t}^{(2 : p)}$ denote the sub-matrix of V_t that corresponds to the asymptotic variance-covariance matrix of θ̂_2:_p. Define the Wald statistic

W = n {\hat{θ}}_{2 : p}^{T} {V_{t}^{(2 : p)}}^{- 1} {\hat{θ}}_{2 : p} .

The above derivations yield immediately the following result:

Theorem 4.1

Assume that Assumptions 1–2 hold. Then $W \overset{D_{H_{0}}}{\to} χ_{p - 1}^{2}$ as n → ∞ where $χ_{p - 1}^{2}$ is a chi-squared distributed random variable with p−1 degrees of freedom.

To implement the test, we must estimate the variance-covariance matrix Σ_t in order to compute the matrix $V_{t}^{(2 : p)}$ used in the expression for W. In simulation studies and in analyses of our experimental data on the progression of leukemia, we have used a bootstrap estimator [32, 33]. We generate B independent samples ${Z_{i}^{*, b} (t), t \geq 0}_{i = 1}^{n}$ by sampling with replacement from ${Z_{i} (t), t \geq 0}_{i = 1}^{n}$ , and calculate ${\hat{c}}_{v}^{*, b} (t_{j})$ , j = 1, …, m, according to eqn. (4.1) based on the bootstrap sample ${Z_{i}^{*, b} (t_{j})}_{i = 1}^{n}$ , b = 1, …, B. We set B = 5, 000 in our simulations and application. Writing

L^{*, b} (t) = {(L^{*, b} ({\hat{c}}_{v}^{*, b} (t_{1})), \dots, L^{*, b} ({\hat{c}}_{v}^{*, b} (t_{m})))}^{T},

we finally approximate Σ_t by

{\sum^{^}}_{t} = \frac{n}{B - 1} \sum_{b = 1}^{B} {L^{*, b} (t) - \frac{1}{B} \sum_{b = 1}^{B} L^{*, b} (t)} {L^{*, b} (t) - \frac{1}{B} \sum_{b = 1}^{B} L^{*, b} (t)}^{T} .

Shao [34] proposed a simple modification of this bootstrap variance estimator based on the idea of truncation. This estimator achieves consistency under broader assumptions that hold in the present setting if M₍₄₎(t) < ∞.

Once an estimator for Σ_t has been chosen, the test is implemented by rejecting H₀ at the significance level δ if $W \geq χ_{p - 1}^{2} (1 - δ)$ , where $χ_{p - 1}^{2} (1 - δ)$ is the 100(1 − δ)th percentile of a chi-squared distribution with p − 1 degrees of freedom.

In practice, the processes {Z_i(t)}_t_≥0, i = 1, …, n, need not all be observed under identical experimental conditions. For example, in our leukemia experiment, we have two groups of observations, each corresponding to a particular sampling scheme. Specificities of the experimental design may be accommodated in the test by including appropriate covariates in the vector of predictors x_j. The test should then reject H₀ if the resulting test statistic W is greater than or equal to the percentile of a chi-squared distribution with p₀ degrees of freedom where p₀ denotes the number of predictors included in x_j that are associated with time, t_j. See Section 6 for specific examples.

To obtain the power function of the test, we expand W as

W = n {({\hat{θ}}_{2 : p} - θ_{t, 2 : p}^{*})}^{T} {V_{t}^{(2 : p)}}^{- 1} ({\hat{θ}}_{2 : p} - θ_{t, 2 : p}^{*}) + n {({\hat{θ}}_{2 : p} - θ_{t, 2 : p}^{*})}^{T} {V_{t}^{(2 : p)}}^{- 1} θ_{t, 2 : p}^{*} + n θ_{t, 2 : p}^{* T} {V_{t}^{(2 : p)}}^{- 1} θ_{t, 2 : p}^{*} .

(4.9)

The first term in the right-hand side of eq. (4.9) converges in distribution under Assumption 1 to a chi-squared distributed random variable with p−1 degrees of freedom; the second term is asymptotically equivalent to $\sqrt{n} U^{T} {V_{t}^{(2 : p)}}^{- T / 2} θ_{t, 2 : p}^{*}$ , where U is a standard normal (p − 1) × 1 random vector. An asymptotic approximation to the power function of the test in large samples is

Q (δ) ≃ pr (U^{T} U - 2 \sqrt{n} U^{T} Δ > χ_{p - 1}^{2} (1 - δ) - n Δ^{T} Δ),

where $Δ = {V_{t}^{(2 : p)}}^{- T / 2} θ_{t, 2 : p}^{*}$ represents a standardized effect size defined as the departure from the null hypothesis expressed through the coefficients of the regression model (4.4) that are associated with time, which we have normalized by the variability matrix $V_{t}^{(2 : p)}$ . The approximation of the power by Q(δ) as a function of the standardized effect size is illustrated in Figure 2.

Fig 2 — Asymptotic power function (Q) plotted against Δ for various sample sizes (n).

The power of the test may be affected by the choice of the regression function of Model (4.4), and one must carefully evaluate the benefits of increasing the number of functions h_k(·) included in the design matrix X in order to capture (potential) complex temporal patterns in the coefficient of variation. In our data analysis, the coefficient of variation appeared to increase linearly over time (for time points ≥ 6 days), such that building W with a regression Model (4.4) in which the regression function depends linearly on time should be an optimal choice. By increasing the flexibility of the regression function (e.g., allowing this function to depend quadratically on time), we increase the degrees of freedom of the asymptotic distribution of W, which could result in loss of power.

5. Simulation Studies

We performed simulations to evaluate the level and power of the proposed test under a variety of immigration rates. The branching process used to generate the data assumed that G(t) was a gamma distribution with mean 20 and variance 40. The offspring generating function was q(s) = p₀ +(1−p₀)s², and we considered three values for p₀ to evaluate the test when the process is subcritical (p₀ = 0.75), critical (p₀ = 0.5) and supercritical (p₀ = 0.35 or 0.45). These parameter values are biologically plausible.

We also considered immigration rates with various shapes to study the level of the test when immigration is time-homogeneous and its power when it depends on time. The functions that define the immigration rates are included in Tables 1 and 2 which show the percentages of times the test rejected the null hypothesis in each scenario. We note that the layout of these tables mirrors that of Figure 1 (for example, in both the figure and tables, the top rows display results obtained with a time-homogeneous immigration process), and thus the reader can examine results from these simulation studies in light of the behavior of the process Z(t) and its coefficient of variation.

Table 1.

Percentage of simulated data sets for which the test rejected H₀ at the 5% nominal level as a function of the sample size and for different types of immigration rates. For each tested scenario, the table reports results based on test statistics constructed using x_j = (1, t_j)′ (linear) and $x_{j} = {(1, t_{j}, t_{j}^{2})}^{'}$ (quadratic), and with log-transformed coefficients of variation (L(u) = log u).

	time-homogeneous immigration process
	subcritical		critical		supercritical
	p₀ = 0.75		p₀= 0.5		p₀ = 0.35
	r(t) = 2		r(t) = 2		r(t) = 2
sample size	linear	quadratic	linear	quadratic	linear	quadratic
25	6.9%	9.4%	6.7%	9.4%	7.6%	8.8%
50	7.1%	8.7%	6.6%	7.2%	7.9%	6.4%
100	5.6%	6.3%	5.8%	6.9%	9.4%	9.7%
200	5.4%	6.1%	6.8%	6.8%	9.8%	9.0%

time-inhomogeneous immigration process

subcritical

critical

supercritical

p₀ = 0.75

p₀= 0.5

p₀ = 0.35

r(t) = (2 − 0.01t) ∨ 0

r(t) = 0.05t

r (t) = \frac{3}{2 (1 + t)}

sample size

linear

quadratic

linear

quadratic

linear

quadratic

100.0%

24.8%

23.9%

11.9%

11.3%

100.0%

40.1%

38.9%

17.4%

16.7%

100

100.0%

67.6%

59.1%

25.5%

23.4%

200

100.0%

92.9%

89.2%

43.7%

37.5%

subcritical

critical

supercritical

p₀ = 0.75

p₀ = 0.5

p₀ = 0.45

r (t) = \frac{1}{2} {1 + cos (\frac{t}{12})}

r (t) = \frac{2}{1 + (t - 80) \lor 0}

r (t) = \frac{1}{2} e^{0.015 t}

sample size

linear

quadratic

linear

quadratic

linear

quadratic

7.2%

82.2%

40.0%

35.3%

80.0%

76.6%

8.9%

99.2%

63.6%

58.4%

99.1%

96.0%

100

9.5%

100.0%

91.7%

83.4%

100.0%

200

13.2%

100.0%

98.8%

100.0%

Open in a new tab

Table 2.

Percentage of simulated data sets for which the test rejected H₀ at the 5% nominal level as a function of the sample size and for different types of immigration rates. For each tested scenario, the table reports results based on test statistics constructed using x_j = (1, t_j)′ (linear) and $x_{j} = {(1, t_{j}, t_{j}^{2})}^{'}$ (quadratic). Here, we did not transformed the coefficients of variation (L(u) = u).

	time-homogeneous immigration process
	subcritical		critical		supercritical
	p₀ = 0.75		p₀= 0.5		p₀ = 0.35
	r(t) = 2		r(t) = 2		r(t) = 2
sample size	linear	quadratic	linear	quadratic	linear	quadratic
25	7.4%	9.9%	8.9%	12.3%	9.1%	11.6%
50	7.3%	6.8%	8.9%	8.8%	8.5%	9.1%
100	6.7%	6.6%	6.9%	7.0%	7.7%	6.9%
200	5.7%	6.4%	6.6%	7.6%	9.6%	10.2%

time-inhomogeneous immigration process

subcritical

critical

supercritical

p₀ = 0.75

p₀ = 0.5

p₀ = 0.35

r(t) = (2 − 0.01t) ∨ 0

r(t) = 0.05t

r (t) = \frac{3}{2 (1 + t)}

sample size

linear

quadratic

linear

quadratic

linear

quadratic

62.6%

82.6%

18.2%

27.1%

9.0%

11.2%

88.9%

100.0%

23.9%

35.7%

11.5%

13.2%

100

100.0%

42.0%

57.0%

16.9%

18.1%

200

100.0%

92.7%

87.6%

27.3%

24.0%

subcritical

critical

supercritical

p₀= 0.75

p₀= 0.5

p₀ = 0.45

r (t) = \frac{1}{2} {1 + cos (\frac{t}{12})}

r (t) = \frac{2}{1 + (t - 80) \lor 0}

r (t) = \frac{1}{2} e^{0.015 t}

sample size

linear

quadratic

linear

quadratic

linear

quadratic

8.8%

78.6%

41.2%

37.1%

84.0%

78.5%

8.5%

99.7%

64.3%

57.8%

98.6%

96.9%

100

11.2%

100.0%

90.3%

84.9%

100.0%

200

18.1%

100.0%

99.8%

99.0%

100.0%

Open in a new tab

For each scenario that was tested, we simulated 1,000 data sets of a given sample size (n). We considered different values of n : 25, 50, 100 and 200. The population size was observed every 12 hours from day 7 to day 10 (thus, m = 7), similar to the study design of our leukemia experiment. In Table 1, the test statistic was constructed with log-transformed coefficients of variation (that is, L(u) = log u). In Table 2, we used untransformed coefficients of variation (that is, L(u) = u). In either case, we set x_j = (1, t_j)^T and $x_{j} = {(1, t_{j}, t_{j}^{2})}^{T}$ , j = 1, …, 6, to study the influence of the choice of x_j on performances.

When the immigration rate was time-homogeneous (see top rows of Tables 1 and 2), the level of the test approached the nominal level of 5% as n increased, except when the process was supercritical where the level increased slightly when increasing the sample size from n = 100 to n = 200. The increase was possibly due to n being too large compared to t₁, violating Assumption 2. The level of the test was closer to the nominal ones when log-transforming the coefficients of variation (that is, when using L(u) = log u), and this was true whether the process was sub-, super- or critical. The test also achieved better levels at small sample sizes when time (t_j) was included only linearly in the vector of predictors (x_j) compared to when it was included both as a linear and a quadratic term ( $t_{j}^{2}$ ). A possible strategy to reduce the rate of type-1 errors is to use nominal significance levels smaller than the traditional 5% level.

The power of the test increased with n. It was generally highest when the process was sub-critical or critical (p₀ ≤ 0.5). With supercritical processes (p₀ > 0.5), larger sample sizes were required to achieve similar power because the contribution of the immigration to the dynamics of the population is eventually overwhelmed by that of the embedded Bellman-Harris process; that is, the cells that are present in the population eventually contribute substantially more to its dynamics than the immigration process, unless the immigration rate increases exponentially fast and has an exponent that is greater than the Malthusian parameter.

The choice of x_j (linear vs. quadratic) affected slightly the power of the test, except when the immigration rate oscillated over time (bottom of first column, $r (t) = \frac{1}{2} {1 + cos (\frac{t}{12})}$ ), where the test achieved limited power when constructed using x_j = (1, t_j)^T. With samples of size 100, and when log-transforming the coefficients of variation (Table 1), it rejected H₀ in only 9.5% of the data sets. By comparison, with $x_{j} = {(1, t_{j}, t_{j}^{2})}^{T}$ , the test regained power because W was built using a more flexible regression function, and rejected H₀ for 100% of the data sets.

We also observed that the test was slightly but consistently more powerful when constructed with log-transformed coefficients of variation (L(u) = log u) than with untransformed coefficients of variation (L(u) = u), and this despite exhibiting lower type-1 error rates.

6. An Application to the Progression of Leukemia

We conducted an experiment to study tumor growth in mice inoculated with leukemia stem cells. Eighteen mice were randomized into 2 groups of 9 mice each. In the first group, blood samples were collected every 24 hours from day 5 to day 10 post-inoculation; in the second group, samples were collected every 24 hours from day 5.5 to day 9.5. We quantified the number of leukemia blast cells and of normal (non-leukemic) cells using flow cytometry. Figure 3 shows the number of leukemic cells (panels A) and the number of normal cells (panels B) on a log-scale. The sample coefficients of variation are plotted against time in panels C and D.

Fig 3 — Cell counts (log-scale; top panels) and sample coefficients of variation (bottom panels) for leukemia blast cells (left panels) and for normal cells (right panels) in two groups of mice (circles: group 1; squares: group 2) plotted against time.

As tumor burden increases, the number of normal cells in the blood changes over time primarily through immigration of cells from the bone marrow and through cell death. Leukemia cells change in number via immigration of cells from other tissues of the body (e.g., the bone marrow) or from differentiation of stem/precursor cells that are already in the blood. These cells also undergo self-renewing division. The impact of apoptosis (cell death) on cancer cells is likely limited, but these cells may exit the blood stream by migrating into other tissues and organs of the body. Thus, the pool of normal cells could be described by a subcritical age-dependent branching process with immigration, whereas a supercritical process with immigration would be more appropriate to model the kinetics of leukemia cells.

The pool of normal cells in the blood expanded over time, and the increase in cell count accelerated around days 8–9. The number of leukemia cells increased exponentially, and there is no clear evidence that their immigration rate might have changed over time. The coefficients of variation for the number of leukemia cells were much higher on day 5 and on day 5.5 than in subsequent days, a possible consequence of the fact that some cell counts were too small to be reliably quantified at these time points. These values were excluded from the analyses. For both cell types, the coefficients of variation increased over time starting from day 6. The trend was similar for the two groups of mice, and suggested that the immigration rate was time-dependent. Based on Theorem 3.2, the shape of the coefficient of variation as observed for normal cells could be consistent with an immigration rate that decreases with time; for example, the immigration rate could have the form r(t) = rt^θ, for some exponent θ ∈ (−1, 0). The methodology presented in this paper does not permit confirming this observation, however.

To assess the assumption that the immigration rate remained constant over time, we constructed our test using the coefficients of variation (L(u) = u) and their log-transformed values (L(u) = log u), and considered two sets of linear predictors:

x_{j} = {[1, g_{j} - 1, t_{j}, (g_{j} - 1) t_{j}]}^{T}

and

x_{j} = {[1, g_{j} - 1, t_{j}, (g_{j} - 1) t_{j}, t_{j}^{2}]}^{T},

where g_j = 1 or 2 in accordance with group membership (i.e., observed from day 5 or from day 5.5). In the first set of predictors (referred to as linear), t_j is entered linearly in the vector x_j, and we compared the value of our test statistic to a chi-squared distribution with two degrees of freedom: one coming from t_j and one from the interaction between t_j and g_j. In the second set of predictors (referred to as quadratic), t_j is entered in x_j both as a linear and as a quadratic term, and we compared the value of our test statistic to a chi-squared distribution with three degrees of freedom because three of the predictors were associated with t_j. We note that group membership (g_j) is included in x_j to allow for potential differences between the coefficients of variation computed in the first and in the second group of mice.

The p-values for normal cells and for leukemia cells obtained with these tests are presented in Table 3. They were all highly significant for normal cells, suggesting that their immigration into the blood was likely time-dependent. The p-values for the coefficient of variation of leukemia cells were either non-significant but trending, or barely significant. Thus, immigration of leukemia cells was also possibly time-dependent. Immigration of leukemia cells could slow down as a result of cell exiting the mitotic cycle to undergo resting by entering the G0 phase. We performed additional experiments (see Figure 4) in which we measured the percentage of cells in the G0–G1 phase and in the S phase. We found that the former decreased while the latter increased over time. These results seemed to confirm our previous conclusions, which may have clinical implications. For example, it has been suggested that patients survival may be prolonged by employing therapeutic strategies designed to maintain the population of tumor cells at acceptable levels rather than attempting to eradicate all cancer cells, which could cause extensive damage to normal tissues. Such an approach would be viable if cancer cells were adapting their dynamics to their micro-environment, which is what our data suggests.

Table 3.

P-values resulting from the analysis of the leukemia data.

	L(u) = u		L(u) = log u
	linear	quadratic	linear	quadratic
normal cells	5.9 × 10⁻⁷	7.8 × 10⁻⁶	1.6 × 10⁻⁹	1.7 × 10⁻¹⁴
leukemia cells	0.051	0.054	0.13	0.027

Open in a new tab

Fig 4 — Percent normal cells (+) and leukemia cells (○) in the G0–G1 phase (left panel) and in the S phase (right panel) plotted against time.

7. Conclusion

We have proposed a test to assess the assumption that the rate of immigration of an age-dependent branching process with immigration is time-dependent. These models find many applications in biology.

Although the construction of our test hinges on the theory of branching processes, its implementation does not require any such process to be fitted. Furthermore, the distribution function of the lifespan and the shape of the immigration rate (if time-dependent) need not be specified. Since the asymptotic behavior of the coefficient of variation holds under fairly mild regularity conditions, this feature endows the test with robustness properties. Practically speaking, only required is fitting a linear model and estimating a variance-covariance matrix using (e.g.) a bootstrap procedure. It requires that multiple independent runs of the process be observed so coefficients of variation can be estimated, which is the case in many biological experiments.

The test exhibited good performances in simulation studies. As previously discussed, its power generally decreases when the embedded Bellman-Harris process is super-critical. We argued that this arises from the fact that immigration has less impact on the dynamics of the process than in the sub- and critical cases. This limitation is inherent to the property of super-critical processes. Thus, we do not expect it to be unique to the proposed test and it is likely that any test would suffer from this limitation. Nonetheless, in our simulation studies, the test exhibited reasonable power in situations that may be encountered in practice. Another limitation of the proposed test is that it is constructed using the asymptotic behavior of the coefficient of variation, and t₁ has to be large enough so C_v(t_j), j = 1, …, m, are close to c₀, should H₀ be true. Exact derivation in the Markov case and our simulations (non-Markov case) suggested that convergence occurs quickly.

This test not only offers insights into cell kinetics, but provides also an important tool in model construction. For example, the analysis presented in Section 6 indicated that the influx of normal (and may be leukemic) cells into the blood stream should be modeled using a time-dependent immigration rate. Estimators for this class of processes are yet to be constructed before we can validate our conclusions. This will be done in future work.

Acknowledgments

This work was funded by NIH R01 Grants NS039511, CA134839, and AI069351.

8. Appendices

8.1. Appendix A: Asymptotic behavior of the mean and variance

We recall the following result (e.g, see [18]):

Theorem 8.1 (Moments when immigration is time-homogeneous)

Assume that r(·) ≡ r.

If α < 0, then M(t) → rγĀ, and $V (t) \to r [γ (\bar{B} + \bar{A}) + γ_{2} \bar{A^{2}}]$ .
If α = 0, then M(t) = rγt, and V (t) ~ γrK_B₂t²/2.
If α > 0, then M(t) ~ rγK_Ae^αt/α, and $V (t) ~ r (γ K_{B 3} + γ_{2} K_{A}^{2}) e^{2 α t} / 2 α$ .

When the immigration rate changes over time, it follows from eqn. (3.5) that M(t) = γR(t) when α = 0. When α ≠ 0, the asymptotic behavior of M(t) is more complex. It is given in Theorem 3.2 for three important classes of immigration rates. Define the quantities: ${\bar{E}}_{1} (α) = \int_{0}^{\infty} e^{- α x} {(1 + x)}^{- 1} d x$ , and ${\hat{A}}_{ρ} = \int_{0}^{\infty} e^{- ρ u} A (x) d x$ . Notice that Ē₁(α) < ∞ if α > 0, and Â_ρ < ∞ if ρ > α.

Theorem 8.2 (Expectation with time-inhomogeneous immigration)

Case 1. If r(t) = r/(1 + t), then M(t) ~ γĀr/(1 + t) if α < 0, and M(t) ~ γrĒ₁(α)A(t) if α > 0.
Case 2. If r(t) = rt^θ, θ > −1, then M(t) ~ γĀr(t) if α < 0, and M(t) ~ γrΓ(θ + 1)A(t)/α^θ⁺¹ if α > 0.
Case 3. If r(t) = re^ρt, ρ > 0, then M(t) ~ γÂ_ρr(t) if ρ > α, M(t) ~ γrA(t)/(α −ρ) if ρ < α, and M(t) ~ γrtA(t) if ρ = α.

Theorem 8.3. (Variance when immigration is time-nonhomogeneous)

Case 1. If r(t) = r/(1 + t), then $V (t) ~ [γ (\bar{B} + \bar{A}) + γ_{2} \bar{A^{2}}] r (t)$ if α < 0, where $\bar{A^{2}} = \int_{0}^{\infty} A^{2} (x) d x < \infty$ ; V(t) ~ γrK_B₂t log(1 + t) if α = 0; $V (t) ~ r {\bar{E}}_{1} (α) (γ K_{B 3} + γ_{2} K_{A}^{2}) e^{2 α t}$ if α > 0.
Case 2. If r(t) = rt^θ, θ > −1, then $V (t) ~ [γ (\bar{B} + \bar{A}) + γ_{2} \bar{A^{2}}] r (t)$ if α < 0; V(t) ~ γrK_B₂t^θ⁺²/(θ + 1)(θ + 2) if α = 0; $V (t) ~ r Γ (θ + 1) (γ K_{B 3} + γ_{2} K_{A}^{2}) e^{2 α t} / {(2 α)}^{θ + 1}$ if α > 0.
Case 3. If r(t) = re^ρt, ρ > 0, then $V (t) ~ [γ ({\hat{A}}_{ρ} + {\hat{B}}_{ρ}) + γ_{2} \hat{A_{ρ}^{2}}] r (t)$ if ρ > 2α where ${\hat{A}}_{ρ} = \int_{0}^{\infty} e^{- ρ u} A (x) d x < \infty, {\hat{B}}_{ρ} = \int_{0}^{\infty} e^{- ρ u} B (x) d x < \infty, \hat{A_{ρ}^{2}} = \int_{0}^{\infty} e^{- ρ u} A^{2} (x) d x < \infty; V (t) ~ r (γ K_{B 3} + γ_{2} K_{A}^{2}) t e^{2 α t}$ if ρ = 2α; $V (t) ~ r (γ K_{B 3} + γ_{2} K_{A}^{2}) e^{2 α t} / (2 α - ρ)$ if ρ < 2α.

8.2. Appendix B: Proof of Theorem 8.2

It follows from eqn. (3.5) that M(t) can be decomposed as M(t) = γ{J₁(t) + J₂(t)}, where $J_{1} (t) = \int_{0}^{ε t} A (t - u) r (u) d u$ and $J_{2} (t) = \int_{ε t}^{t} A (t - u) r (u) d u$ , for some constant ε ∈ (0, 1).

Case 1

If α < 0 then

r (t) \int_{0}^{t (1 - ε)} A (u) d u \leq J_{2} (t) \leq r (t ε) \int_{0}^{t (1 - ε)} A (u) d u,

and we deduce that J₂(t) ~ Ār(t). Next, for large enough t, there exists Δ > 0 such that

J_{1} (t) \leq (K_{A} + Δ) A (t (1 - ε)) log (1 + ε t) = o (r (t)),

and we deduce that M(t) ~ γĀr(t).

Assume now that α > 0. Since Ã(t) = e⁻^αtA(t)/K_A → 1, we deduce that

M (t) = γ K_{A} e^{α t} \int_{0}^{t} \tilde{A} (t - u) e^{- α u} r (u) d u = γ K_{A} e^{α t} {I_{1} (t) + I_{2} (t)},

where $I_{1} (t) = \int_{0}^{ε t} \tilde{A} (t - u) e^{- α u} r (u) d u$ and $I_{2} (t) = \int_{ε t}^{t} \tilde{A} (t - u) e^{- α u} r (u) d u$ for any ε ∈ (0, 1). For large enough t, there exists δ ∈ (0, 1) such that

(1 - δ) \int_{0}^{ε t} e^{- α u} r (u) d u \leq I_{1} (t) \leq (1 + δ) \int_{0}^{ε t} e^{- α u} r (u) d u,

from which we deduce that I₁(t) → rĒ₁(α). There exists also a constant K > 0 such that

I_{2} (t) \leq K \int_{ε t}^{t} e^{- α u} (1 + u) d u \to 0.

Hence M(t) ~ γrĒ₁(α)A(t).

Case 2

Assume that α > 0. Then for large enough t, there exists δ ∈ (0, 1), such that

(1 - δ) r \int_{0}^{ε t} e^{- α u} u^{θ} d u \leq I_{1} (t) \leq (1 + δ) r \int_{0}^{ε t} e^{- α u} u^{θ} d u,

which implies that I₁(t) → rΓ(θ + 1)/α^θ⁺¹. Similarly to the previous case, we have that

I_{2} (t) \leq K r \int_{ε t}^{t} e^{- α u} u^{θ} d u \to 0,

which completes the proof in this case.

Assume next that α < 0. Then

J_{1} (t) \leq r ε^{θ} t^{θ} \int_{t (1 - ε)}^{t} A (x) d x = o (r (t)) .

Similarly to Case 1, we have, for θ ≥ 0, that

r (ε t) \int_{0}^{t (1 - ε)} A (u) d u \leq J_{2} (t) \leq r (t) \int_{0}^{t (1 - ε)} A (u) d u,

and, for θ ∈ (−1, 0),

r (t) \int_{0}^{t (1 - ε)} A (u) d u \leq J_{2} (t) \leq r (ε t) \int_{0}^{t (1 - ε)} A (u) d u,

from which we deduce, in both subcases, that J₂(t) ~ Ār(t). Therefore M(t) ~ γĀr(t).

Case 3

Notice that

M (t) = γ r e^{ρ t} \int_{0}^{t} e^{- ρ u} A (u) d u .

The proof follows directly from this expression when ρ > α. When 0 < ρ ≤ α, the proof results from the fact that, for any T ∈ (0, t),

M (t) = γ r e^{ρ t} {\int_{0}^{T} e^{- ρ u} A (u) d u + \int_{T}^{t} e^{- ρ u} A (u) d u} = γ r e^{ρ t} {L_{1} (T) + L_{2} (t)},

and, for large (but fixed) T, L₁(T) is finite whereas, as t → ∞, L₂(t) ~ K_Ae⁽^α⁻^ρ⁾^t/(α − ρ) when ρ < α and L₂(t) ~ K_At when ρ = α.

8.3. Appendix C: Proof of Theorem 8.3

When α ≠ 0, the proofs are similar to those of Theorem 8.2 and they are omitted. They are only provided for Case 1 when α = 0 (the proofs are similar in Cases 2 and 3).

Case 1

If α = 0, then B(t) = K_B₂t+o(t), and algebraic calculations show that

\begin{array}{l} \int_{0}^{t} r (t - u) udu = r [t log (1 + t) - t + log t] \\ ~ rt log (1 + t) . \end{array}

Moreover $\int_{0}^{t} r (t - u) o (u) d u = I_{1} (t) + I_{2} (t)$ , where, for any ε ∈ (0, 1), $I_{1} (t) = r \int_{0}^{ε t} o (u) {(1 + t - u)}^{- 1} d u$ and $I_{2} (t) = r \int_{ε t}^{t} o (u) {(1 + t - u)}^{- 1} d u$ . It can be shown that

I_{1} (t) \leq r {[1 + t (1 - ε)]}^{- 1} \int_{0}^{ε t} o (u) d u = r {[1 + t (1 - ε)]}^{- 1} o (t^{2}) = o (t) .

For every δ > 0, we can choose εt large enough such that o(x) ≤ δx for x ≥ εt. Therefore

\begin{array}{l} I_{2} (t) = r \int_{ε t}^{t} o (u) {(1 + t - u)}^{- 1} d u \\ \leq r δ \int_{0}^{ε t} u {(1 + t - u)}^{- 1} d u \\ = r δ {t log [1 + t (1 - ε)] - t (1 - ε) + log [1 + t (1 - ε)]}, \end{array}

from which we deduce that I₂(t) = o(t log[1 + t]). Hence $\int_{0}^{t} r (t - u) B (u) d u ~ r K_{B 2} t log (1 + t)$ . Since M(t) = γlog(1 + t), we finally deduce that V(t) ~ γrK_B₂t log(1 + t).

Contributor Information

Ollivier Hyrien, Email: Ollivier_Hyrien@urmc.rochester.edu, University of Rochester, Rochester, New York, USA.

Nikolay M. Yanev, Email: yanev@math.bas.bg, Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, Bulgaria

Craig T. Jordan, Email: craig.jordan@ucdenver.edu, University of Colorado, Denver, USA

References

1.Athreya KB, Ney PE. Branching Processes. Springer-Verlag; Berlin: 1972. [Google Scholar]
2.Jagers P. Branching Processes with Biological Applications. John Wiley and Sons; London: 1975. [Google Scholar]
3.Yakovlev AY, Yanev NM. Transient Processes in Cell Proliferation Kinetics. Heidelberg: Springer-Verlag; 1989. [Google Scholar]
4.Kimmel M, Axelrod DE. Branching Processes in Biology. New York: Springer-Verlag; 2002. [Google Scholar]
5.Haccou P, Jagers P, Vatutin VA. Branching Processes: Variation, Growth and Extinction of Populations. Cambridge: Cambridge University Press; 2005. [Google Scholar]
6.Hyrien O, Mayer-Pröschel M, Noble M, Yakovlev A. A stochastic model to analyze clonal data on multi-type cell populations. Biometrics. 2005;61:199–207. doi: 10.1111/j.0006-341X.2005.031210.x. [DOI] [PubMed] [Google Scholar]
7.Hyrien O, Chen R, Mayer-Pröschel M, Noble M. Saddlepoint approximations to the moments of multi-type age-dependent branching processes, with applications. Biometrics. 2010;66:567–577. doi: 10.1111/j.1541-0420.2009.01281.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Sevastyanov BA. Limit theorems for special types of branching processes. Theory of Probability and its Applications. 1957;2:339–348. [Google Scholar]
9.Jagers P. Age-dependent branching processes allowing immigration. Theory of Probability and its Applications. 1968;13:225–236. [Google Scholar]
10.Yanev NM. Branching stochastic processes with immigration. Bulletin de l’Institut Mathematique, Académie Bulgare des Sciences. 1972;15:71–88. [Google Scholar]
11.Yanev NM. On a class of decomposable age-dependent branching processes. Mathematica Balkanica. 1972;2:58–75. [Google Scholar]
12.Pakes AG. Limit theorems for an age-dependent branching process with immigration. Mathematical Biosciences. 1972;14:221–234. [Google Scholar]
13.Radcliffe J. The convergence of a super-critical age-dependent branching processes allowing immigration at the epochs of a renewal process. Mathematical Biosciences. 1972;14:37–44. [Google Scholar]
14.Kaplan N, Pakes AG. Supercritical age-dependent branching processes with immigration. Stochastic Processes and its Applications. 1974;2:371–389. [Google Scholar]
15.Pakes AG, Kaplan N. On the subcritical Bellman-Harris process with immigration. Journal of Applied Probability. 1974;11:652–668. [Google Scholar]
16.Olofsson P. General branching processes with immigration. Journal of Applied Probability. 1996;33:940–948. [Google Scholar]
17.Yakovlev A, Yanev NM. Branching stochastic processes with immigration in analysis of renewing cell populations. Mathematical Biosciences. 2006;203:37–63. doi: 10.1016/j.mbs.2006.06.001. [DOI] [PubMed] [Google Scholar]
18.Hyrien O, Yanev NM. Age-dependent branching processes with non-homogeneous Poisson immigration as models of cell kinetics. In: Oakes D, Hall WJ, Almudevar A, editors. Modeling and Inference in Biomedical Sciences: in Memory of Andrei Yakovlev. Beachwood, Ohio: IMS Collection Series, Institute of Mathematical Statistics; 2015. in press. [Google Scholar]
19.Vatutin VA. A critical Bellman-Harris branching process with immigration and several types of particles. Theory of Probability and its Applications. 1977;21:435–442. [Google Scholar]
20.Hyrien O, Yanev NM. Asymptotic behavior of cell populations described by two-type reducible age-dependent branching processes with non-homogeneous immigration. Mathematical Population Studies. 2012;19:164–176. doi: 10.1080/08898480.2012.718934. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Hyrien O, Peslak S, Yanev NM, Palis J. Stochastic modeling of stress erythropoiesis using a two-type age-dependent branching process with immigration. Journal of Mathematical Biology. 2015 doi: 10.1007/s00285-014-0803-x. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Guttorp P. Statistical Inference for Branching Processes. Wiley; New York: 1991. [Google Scholar]
23.Yanev NM. Statistical inference for branching processes. In: Ahsanullah M, Yanev GP, editors. Records and Branching Processes. Ch 7. Nova Science Publishers Inc; New York: 2008. pp. 123–142. [Google Scholar]
24.Yakovlev A, Stoimenova VK, Yanev NM. Branching processes as models of progenitor cell populations and estimation of the offspring distributions. Journal of the American Statistical Association. 2008;103:1357–1366. [Google Scholar]
25.Yakovlev A, Yanev NM. Relative frequencies in multitype branching processes. Annals of Applied Probability. 2009;19:1–14. [Google Scholar]
26.Yakovlev A, Yanev NM. Limiting distributions in multitype branching processes. Stochastic Analysis and Applications. 2010;28:1040–1060. doi: 10.1080/07362994.2010.515486. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Hanlon B, Vidyashankar AN. Inference for quantitation parameters in polymerase chain reactions via branching processes with random effects. Journal of the American Statistical Association. 2011;106:525–533. [Google Scholar]
28.Miller GE. Asymptotic test statistics for coefficients of variation. Communications in Statistics - Theory and Methods. 1991;20:3351–3363. [Google Scholar]
29.Feltz CJ, Miller GE. An asymptotic test for the equality of coefficients of variation from k populations. Statistics in Medicine. 1996;20:3351–3363. doi: 10.1002/(sici)1097-0258(19960330)15:6<647::aid-sim184>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]
30.McKay AT. Distribution of the coefficient of variation and the extended ‘t’ distribution. Journal of the Royal Statistical Society. 1932;95:695–698. [Google Scholar]
31.Jordan CT, Guzman ML, Noble M. Cancer stem cells. The New England Journal of Medicine. 2006;355:1253–61. doi: 10.1056/NEJMra061808. [DOI] [PubMed] [Google Scholar]
32.Efron B. Bootstrap methods: another look at the jackknife. Annals of Statistics. 1979;7:1–26. [Google Scholar]
33.Efron B, Tibshirani RJ. An Introduction to the Bootstrap. Chapman & Hall/CRC; New York: 1992. [Google Scholar]
34.Shao J. Bootstrap variance estimators with truncation. Statistics and Probability Letters. 1992;15:95–101. [Google Scholar]
35.Yakovlev A, Yanev NM. Age and residual lifetime distributions for branching processes. Statistics and Probability Letters. 2007;77:503–513. [Google Scholar]
36.Lewis PA, Shedler GS. Simulation of nonhomogeneous Poisson processes by thinning. Naval Research Logistics Quarterly. 1979;26:403–413. [Google Scholar]
37.Hyrien O. Pseudo-likelihood estimation for discretely observed multitype Bellman-Harris branching processes. Journal of Statistical Planning and Inference. 2007;137:1375–1388. doi: 10.1016/j.jspi.2011.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Durrett R. Probability Theory. London: Cambridge University Press; 2010. [Google Scholar]
39.Sering RJ. Approximation Theorems of Mathematical Statistics. New York: Wiley; 1980. [Google Scholar]

[R1] 1.Athreya KB, Ney PE. Branching Processes. Springer-Verlag; Berlin: 1972. [Google Scholar]

[R2] 2.Jagers P. Branching Processes with Biological Applications. John Wiley and Sons; London: 1975. [Google Scholar]

[R3] 3.Yakovlev AY, Yanev NM. Transient Processes in Cell Proliferation Kinetics. Heidelberg: Springer-Verlag; 1989. [Google Scholar]

[R4] 4.Kimmel M, Axelrod DE. Branching Processes in Biology. New York: Springer-Verlag; 2002. [Google Scholar]

[R5] 5.Haccou P, Jagers P, Vatutin VA. Branching Processes: Variation, Growth and Extinction of Populations. Cambridge: Cambridge University Press; 2005. [Google Scholar]

[R6] 6.Hyrien O, Mayer-Pröschel M, Noble M, Yakovlev A. A stochastic model to analyze clonal data on multi-type cell populations. Biometrics. 2005;61:199–207. doi: 10.1111/j.0006-341X.2005.031210.x. [DOI] [PubMed] [Google Scholar]

[R7] 7.Hyrien O, Chen R, Mayer-Pröschel M, Noble M. Saddlepoint approximations to the moments of multi-type age-dependent branching processes, with applications. Biometrics. 2010;66:567–577. doi: 10.1111/j.1541-0420.2009.01281.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Sevastyanov BA. Limit theorems for special types of branching processes. Theory of Probability and its Applications. 1957;2:339–348. [Google Scholar]

[R9] 9.Jagers P. Age-dependent branching processes allowing immigration. Theory of Probability and its Applications. 1968;13:225–236. [Google Scholar]

[R10] 10.Yanev NM. Branching stochastic processes with immigration. Bulletin de l’Institut Mathematique, Académie Bulgare des Sciences. 1972;15:71–88. [Google Scholar]

[R11] 11.Yanev NM. On a class of decomposable age-dependent branching processes. Mathematica Balkanica. 1972;2:58–75. [Google Scholar]

[R12] 12.Pakes AG. Limit theorems for an age-dependent branching process with immigration. Mathematical Biosciences. 1972;14:221–234. [Google Scholar]

[R13] 13.Radcliffe J. The convergence of a super-critical age-dependent branching processes allowing immigration at the epochs of a renewal process. Mathematical Biosciences. 1972;14:37–44. [Google Scholar]

[R14] 14.Kaplan N, Pakes AG. Supercritical age-dependent branching processes with immigration. Stochastic Processes and its Applications. 1974;2:371–389. [Google Scholar]

[R15] 15.Pakes AG, Kaplan N. On the subcritical Bellman-Harris process with immigration. Journal of Applied Probability. 1974;11:652–668. [Google Scholar]

[R16] 16.Olofsson P. General branching processes with immigration. Journal of Applied Probability. 1996;33:940–948. [Google Scholar]

[R17] 17.Yakovlev A, Yanev NM. Branching stochastic processes with immigration in analysis of renewing cell populations. Mathematical Biosciences. 2006;203:37–63. doi: 10.1016/j.mbs.2006.06.001. [DOI] [PubMed] [Google Scholar]

[R18] 18.Hyrien O, Yanev NM. Age-dependent branching processes with non-homogeneous Poisson immigration as models of cell kinetics. In: Oakes D, Hall WJ, Almudevar A, editors. Modeling and Inference in Biomedical Sciences: in Memory of Andrei Yakovlev. Beachwood, Ohio: IMS Collection Series, Institute of Mathematical Statistics; 2015. in press. [Google Scholar]

[R19] 19.Vatutin VA. A critical Bellman-Harris branching process with immigration and several types of particles. Theory of Probability and its Applications. 1977;21:435–442. [Google Scholar]

[R20] 20.Hyrien O, Yanev NM. Asymptotic behavior of cell populations described by two-type reducible age-dependent branching processes with non-homogeneous immigration. Mathematical Population Studies. 2012;19:164–176. doi: 10.1080/08898480.2012.718934. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Hyrien O, Peslak S, Yanev NM, Palis J. Stochastic modeling of stress erythropoiesis using a two-type age-dependent branching process with immigration. Journal of Mathematical Biology. 2015 doi: 10.1007/s00285-014-0803-x. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Guttorp P. Statistical Inference for Branching Processes. Wiley; New York: 1991. [Google Scholar]

[R23] 23.Yanev NM. Statistical inference for branching processes. In: Ahsanullah M, Yanev GP, editors. Records and Branching Processes. Ch 7. Nova Science Publishers Inc; New York: 2008. pp. 123–142. [Google Scholar]

[R24] 24.Yakovlev A, Stoimenova VK, Yanev NM. Branching processes as models of progenitor cell populations and estimation of the offspring distributions. Journal of the American Statistical Association. 2008;103:1357–1366. [Google Scholar]

[R25] 25.Yakovlev A, Yanev NM. Relative frequencies in multitype branching processes. Annals of Applied Probability. 2009;19:1–14. [Google Scholar]

[R26] 26.Yakovlev A, Yanev NM. Limiting distributions in multitype branching processes. Stochastic Analysis and Applications. 2010;28:1040–1060. doi: 10.1080/07362994.2010.515486. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Hanlon B, Vidyashankar AN. Inference for quantitation parameters in polymerase chain reactions via branching processes with random effects. Journal of the American Statistical Association. 2011;106:525–533. [Google Scholar]

[R28] 28.Miller GE. Asymptotic test statistics for coefficients of variation. Communications in Statistics - Theory and Methods. 1991;20:3351–3363. [Google Scholar]

[R29] 29.Feltz CJ, Miller GE. An asymptotic test for the equality of coefficients of variation from k populations. Statistics in Medicine. 1996;20:3351–3363. doi: 10.1002/(sici)1097-0258(19960330)15:6<647::aid-sim184>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]

[R30] 30.McKay AT. Distribution of the coefficient of variation and the extended ‘t’ distribution. Journal of the Royal Statistical Society. 1932;95:695–698. [Google Scholar]

[R31] 31.Jordan CT, Guzman ML, Noble M. Cancer stem cells. The New England Journal of Medicine. 2006;355:1253–61. doi: 10.1056/NEJMra061808. [DOI] [PubMed] [Google Scholar]

[R32] 32.Efron B. Bootstrap methods: another look at the jackknife. Annals of Statistics. 1979;7:1–26. [Google Scholar]

[R33] 33.Efron B, Tibshirani RJ. An Introduction to the Bootstrap. Chapman & Hall/CRC; New York: 1992. [Google Scholar]

[R34] 34.Shao J. Bootstrap variance estimators with truncation. Statistics and Probability Letters. 1992;15:95–101. [Google Scholar]

[R35] 35.Yakovlev A, Yanev NM. Age and residual lifetime distributions for branching processes. Statistics and Probability Letters. 2007;77:503–513. [Google Scholar]

[R36] 36.Lewis PA, Shedler GS. Simulation of nonhomogeneous Poisson processes by thinning. Naval Research Logistics Quarterly. 1979;26:403–413. [Google Scholar]

[R37] 37.Hyrien O. Pseudo-likelihood estimation for discretely observed multitype Bellman-Harris branching processes. Journal of Statistical Planning and Inference. 2007;137:1375–1388. doi: 10.1016/j.jspi.2011.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.Durrett R. Probability Theory. London: Cambridge University Press; 2010. [Google Scholar]

[R39] 39.Sering RJ. Approximation Theorems of Mathematical Statistics. New York: Wiley; 1980. [Google Scholar]

PERMALINK

A test of homogeneity for age-dependent branching processes with immigration

Ollivier Hyrien

Nikolay M Yanev

Craig T Jordan

Abstract

1. Introduction

2. Age-Dependent Branching Processes with Time-Inhomogeneous Immigration

2.1. The general process

2.2. A special case applied to cancer stem cell biology

3. The Coefficient of Variation and its Asymptotic Behavior

3.1. The general case

Theorem 3.1 (time-homogeneous Poisson process)

Theorem 3.2 (time-inhomogeneous Poisson process)

3.2. The Markov case

3.3. Numerical investigations in the non-Markov case

Fig 1.

4. An Asymptotic Test of Homogeneity of the Immigration Process

Assumption 1

Assumption 2

Theorem 4.1

Fig 2.

5. Simulation Studies

Table 1.

Table 2.

6. An Application to the Progression of Leukemia

Fig 3.

Table 3.

Fig 4.

7. Conclusion

Acknowledgments

8. Appendices

8.1. Appendix A: Asymptotic behavior of the mean and variance

Theorem 8.1 (Moments when immigration is time-homogeneous)

Theorem 8.2 (Expectation with time-inhomogeneous immigration)

Theorem 8.3. (Variance when immigration is time-nonhomogeneous)

8.2. Appendix B: Proof of Theorem 8.2

Case 1

Case 2

Case 3

8.3. Appendix C: Proof of Theorem 8.3

Case 1

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases