Abstract
We propose a novel procedure to test whether the immigration process of a discretely observed age-dependent branching process with immigration is time-homogeneous. The construction of the test is motivated by the behavior of the coefficient of variation of the population size. When immigration is time-homogeneous, we find that this coefficient converges to a constant, whereas when immigration is time-inhomogeneous we find that it is time-dependent, at least transiently. Thus, we test the assumption that the immigration process is time-homogeneous by verifying that the sample coefficient of variation does not vary significantly over time. The test is simple to implement and does not require specification or fitting any branching process to the data. Simulations and an application to real data on the progression of leukemia are presented to illustrate the approach.
Keywords and phrases: Coefficient of Variation, Continuous-Time Branching Processes, Leukemia, Non-Homogeneous Poisson Process
1. Introduction
Age-dependent branching processes define a class of continuous-time stochastic processes that is increasingly popular in the biological sciences. They offer flexible, yet tractable, stochastic models that describe population dynamics at the individual level, and allow the lifespan to follow arbitrary distributions (e.g., [1, 2, 3, 4, 5, 6, 7]). Over the years, they have been extended in a number of useful ways. For example, they may include an immigration component to describe an influx of immigrants into the population of interest.
Sevastyanov (1957) was the first to study branching processes with immigration [8]. He investigated the Markov case, and extensions to age-dependent processes were subsequently considered by Jagers and other authors in the single-type [9, 10, 11, 12, 13, 14, 15, 16, 17, 18 ] and the multi-type case [19, 20 ]. Recently, this class of processes has been proposed to model the dynamics of cell populations in vivo under the assumption that immigration obeys a (possibly time-inhomogeneous) Poisson process [17, 18, 20, 21 ].
Statistical methods for the Bienaymé-Galton-Watson process with immigration are discussed in Guttorp’s monograph [22]. In contrast, statistical inference for age-dependent branching processes with immigration has received little, if any, attention to date. In particular, no methods have been developed to determine whether the rate of the immigration process should be taken as constant or allowed to vary with time. We see two reasons for developing a procedure that addresses this question: firstly, it is not always known in practice whether the influx of cells in the population of interest changes over time such that the procedure could lead to valuable biological insights; secondly, from a statistical standpoint, it would help to decide whether the data can support a model with time-inhomogeneous immigration, and thus could prevent over-parameterization issues. The question of how to specify the shape of the immigration rate is also important, but we do not address it here.
The goal of this paper is to propose a test to determine whether the immigration process of an age-dependent branching process with immigration is time-homogeneous based on observations of the population size at discrete time points. The proposed test applies when several independent populations are observed at discrete time points; such experimental designs are commonly used in biology. We first investigate the asymptotic behavior of the coefficient of variation of the population size under various immigration rates and when the branching process is sub-, super-, and critical. We find that the coefficient of variation converges over time to a strictly positive constant when the immigration process is time-homogeneous. In contrast, when the immigration process is time-inhomogeneous, we find that the coefficient of variation is either time-dependent, possibly after applying a suitable transformation, or transits to a different constant. Thus, we construct a test which verifies if the empirical co-efficient of variation changes significantly over time, which is accomplished by techniques of linear regression.
An attractive feature of the test is that it is simple to implement. In particular, it does not require any branching process to be fitted to the data, and it does not impose either that the distribution of the lifespan and the shape of the immigration rate, should it be time-dependent, be formulated. This simplicity is a consequence of the fact that the test is solely constructed from the asymptotic behavior of the process. Statistical methods for branching processes that rely on their asymptotic behavior have been successfully used in the past [22, 23, 24, 25, 26, 27], and the proposed test is built in the same vein. Asymptotic procedures for testing the homogeneity of coefficients of variation across samples have also been proposed [28, 29 ] (see also [30] for a derivation of the asymptotic distribution of the coefficient of variation). These tests do not apply in our setting because they make two assumptions that would not be valid: (1) observations are independent across samples; and (2) observations are normally distributed.
The class of branching processes that we consider is defined in Section 2. Although this work was motivated by a problem that arises from cell biology, we consider a process that is more broadly applicable because our procedure works identically under a more general set of assumptions about the offspring and lifespan distributions, and generalization comes at no cost. In Section 3, we study the asymptotic behavior of the coefficient of variation of the population size when the immigration rate is constant and when it is time-dependent. The Markov case is considered analytically and the non-Markov case is investigated numerically. We develop the test in Section 4. In Section 5, we present results from simulation studies in which we investigate the performance of the test. These studies show that the test possesses power under a variety of alternative hypotheses. An application to a real data set on the dynamics of leukemia is presented in Section 6. The analysis reveals that the immigration rate of normal (non-leukemic) cells into the blood changed over time. Concluding remarks are offered in Section 7. Technical details are provided in the Appendix.
2. Age-Dependent Branching Processes with Time-Inhomogeneous Immigration
2.1. The general process
Without loss of generality, the process begins at t = 0 with zero cells. Let be a sequence of time points at which Ij new cells (thereafter referred to as immigrants) arrive in the population, where is a collection of independent and identically distributed, integer-valued random variables (r.v.). Write γ = E(Ij), γ2 = E{Ij(Ij − 1)}, and g(s) = E(sIj). The immigration process is assumed to be a non-homogeneous Poisson process with instantaneous and cumulative rates r(t) and . When r(t) = r (t ≥ 0), P(t) reduces to a time-homogeneous Poisson process.
Upon completion of its lifespan, every cell of the population, including immigrants, produces a random number ξ of offspring of age zero. Write pk = pr(ξ = k), k = 0, 1 ⋯, for the distribution of ξ. Let q(s) = E(sξ), |s| ≤ 1, denote its probability generating function, and put m = E(ξ) and m2 = E{ξ(ξ − 1)}. Applications to cell biology are primarily concerned with the special case q(s) = p0+p2s2, in which cells may either die with probability p0 or divide into two cells with probability p2 = 1 − p0. The duration of the lifespan is described by a non-negative r.v. τ with cumulative distribution function G(t) = pr(τ ≤ t), assumed non-lattice and satisfying G(0+) = 0. Write μ = E(τ), assumed finite. Every cell behaves independently of every other cell. Finally, assume that γ2 < ∞, m2 < ∞, and μ2 < ∞. The distributions G(·) and q(·) define a Bellman-Harris process embedded in the branching process with immigration.
2.2. A special case applied to cancer stem cell biology
Recent studies have supported the hypothesis that stem cells play a central role not only in the generation and maintenance of multicellular systems, but also in the development of several cancers. For example, they have been identified in several types of leukemia [31]. As it appeared clear that stem cells should be targeted by cancer therapy, understanding their properties, including their dynamics, has become of considerable interest to cancer scientists.
Stem cells are characterized by the unique combination of three features: (1) they can self-renew by producing daughter cells that retain their properties; (2) they can generate multicellular lineages; and (3) they are able to maintain survival of these lineages. They also tend to be rare and are not always experimentally detectable, making the study of their behavior challenging.
The lack of direct observation on stem cells can be attenuated by modeling their contribution to disease progression via an immigration process that describes their influx into the pool of observable cells as they differentiate. The intensity of this influx of newly differentiated cells may vary over time, a feature that can be captured by allowing the immigration process to be time-inhomogeneous. Upon completing their lifespan, differentiated cells divide into two differentiated cells with probability p2 or die with probability p0 = 1 − p2. The duration of the lifespan is often assumed to have a gamma distribution, but other choices are possible (e.g., a log-normal distribution).
3. The Coefficient of Variation and its Asymptotic Behavior
3.1. The general case
Let Z(t) denote the population size at time t. For every t ≥ 0 and |s| ≤ 1, put
for the probability generating function of Z(t). It has the following expression:
(3.1) |
where Ψ(0, s) = 1, and where F(t; s), t ≥ 0, |s| ≤ 1, satisfies
(3.2) |
with the initial condition F(0; s) = s [35]. Define A(t) = ∂F(t; s)/∂s|s=1 and B(t) = ∂2F(t; s)/∂s2|s=1. These functions are the first and second order factorial moments of the embedded Bellman-Harris process started from a single cell at time t = 0. It follows from eqn. (3.2) that A(t) and B(t) are solutions to the renewal-type equations
(3.3) |
and
(3.4) |
with the initial conditions A(0) = 1 and B(0) = 0 [1].
Define the first and second order moments of Z(t): M(t) = E{Z(t) | Z(0) = 0}, M2(t) = E{Z(t){Z(t) − 1} | Z(0) = 0}, V (t) = Var{Z(t) | Z(0) = 0} = M2(t) + M (t){1 − M(t)}, and let Cv(t) = V (t)1/2/M (t) denote the coefficient of variation of Z(t). We deduce from eqn. (3.1) that M(t) and M2(t) take the expressions
(3.5) |
and
(3.6) |
with initial conditions M(0) = 0 and M2(0) = 0. Eqns. (3.5) and (3.6) imply directly that:
(3.7) |
Let α denote the Malthusian parameter of the embedded Bellman-Harris process. Assuming it exists, α is the solution to the equation
The process is said to be sub-critical if α < 0, critical if α = 0, and super-critical if α > 0.
Define and . Application of renewal theory to eqns. (3.3) and (3.4) gives A(t) = 1 if α = 0 and A(t) ~ KAeαt if α ≠ 0, t ≥ 0, where KA = (m − 1)/αmμ̃, and
where , KB2 = m2/μ, and [1].
Additional constants that will appear in the limit of the coefficient of variation include and , which are both finite when α < 0. We also define , and . Notice that Ē1(α) < ∞ if α > 0, and Âρ < ∞ if ρ > α.
To simplify the presentation, we have discussed the asymptotic behavior of the expectation and variance of the process in Theorems 8.1, 8.2 and 8.3 which have been relegated in the Appendix. The asymptotic behavior of the coefficient of variation is easily deduced from these theorems. Beginning with the time-homogeneous case, we have:
Theorem 3.1 (time-homogeneous Poisson process)
Assume that r(·) ≡ r. Then , where
When the immigration process is time-inhomogeneous, the behavior of Cv(t) differs from that exhibited in the time-homogeneous case. Using c as a generic notation to denote a positive constant that appears in the limit of Cv(t), and which differs in all cases, we have:
Theorem 3.2 (time-inhomogeneous Poisson process)
- Case 1. If r(t) = r=(1 + t), then
- Case 2. If r(t) = rtθ, θ > −1, then
- Case 3. If r(t) = reρt, ρ > 0, then
Taken together, these results suggest the following conclusions. If the immigration process is time-homogeneous, we see from Theorem 3.1 that logCv(t) converges to a constant log c0 as t → ∞. If it is time-inhomogeneous, Theorem 3.2 indicates that log Cv(t) → ±∞ as t → ∞ in most of the considered cases, except (e.g.) for the exponential rate when 0 < ρ < α, where it converges to a constant that differs from log c0. Moreover, when it diverges, log Cv(t) is, in most cases, asymptotically equivalent to an affine function of h(t), log Cv(t) ~ a0 + a1h(t), where h(t) is a function that depends solely on time (e.g., h(t) = t or h(t) = log t or h(t) = log log t) and no other parameters. We will use this property when constructing our test.
3.2. The Markov case
The moments of Z(t) are available in closed-form when the process is Markov and the immigration process is time-homogeneous; that is, when G(t) = 1 − e−t/μ and r(t) = r, t ≥ 0. Specifically, the expectation and variance of the population size are
and
We deduce immediately that
The expectation and variance are therefore asymptotically equivalent to
and
Finally, we deduce that , where
which is consistent with Theorem 3.1. Moreover,
(3.8) |
and convergence to c0 occurs quickly over time in all cases.
3.3. Numerical investigations in the non-Markov case
We present results from numerical simulations that further illustrate the behavior of the coefficient of variation of the population size with various immigration rates, including some that were considered in Theorem 3.2.
The population size process Z(t) (t ≥ 0) was simulated by assuming that: (i) the lifespan of every cell has a gamma (non-exponential) distribution with mean 24 and variance 48; and (ii) upon completion of its lifespan, every cell either divides with probability p2 or dies with probability p0 = 1 − p2. We considered different values of p0 to run simulations with sub- (p0 > 0.5), super- (p0 < 0.5), and critical (p0 = 0.5) processes.
We also considered various rates for the immigration process, including the time-homogeneous rate (r(·) ≡ r) and the following time-inhomogeneous rates:
r(t) = (r + θt) ∧ 0: the immigration rate increases or decreases affinely over time and is constrained to remain positive when θ < 0.
r(t) = r/(1 + t): the immigration rate decreases gradually to 0 over time.
r(t) = r{1 + cos(θt)}: the immigration rate oscillates over time between 0 and 2r
Here, the immigration rate remains constant between times 0 and t0, and increases or decreases affinely thereafter.
r(t) = reθt: the immigration rate increases exponentially fast over time; this scenario could describe immigration of malignant cells from tissues (e.g., bone marrow) in which they multiply exponentially fast.
We simulated the time-inhomogeneous Poisson process using Lewis and Shedler (1979)’s acceptance-rejection method [36]. A single immigrant entered the population at every time Tj (that is, pr(Ij = 1) = 1, j = 1, 2 ⋯).
We simulated 1,000 runs of the branching process with immigration over the time interval [0, 250], and computed the empirical mean, variance and coefficient of variation of these simulations at multiple time points. The results are shown in Figure 1. The plots in the left column were obtained with sub-critical processes (p0 > 0.5); those in the center were obtained with critical processes (p0 = 0.5); and those on the right were obtained with super-critical processes (p0 < 0.5). In the first two rows, we used a time-homogeneous immigration process. The results displayed in the next four rows were obtained with time-inhomogeneous immigration processes. The immigration rate was different in all cases, as indicated in the figure.
Fig 1.
Results from simulations. For each scenario we report: 1,000 simulated sample paths (gray and violet lines), their empirical mean (black line) and coefficient of variation (CV(t)), as well as the immigration rate (r(t)). The panels on the top were obtained with time-homogeneous immigration processes and those below were obtained with various time-dependent immigration processes. The immigration rate is displayed in the subplot embedded in the plot of the coefficient of variation.
When the immigration process was time-homogeneous, the coefficient of variation of Z(t) decreased over time to c0. Convergence was quick and virtually occurred within the time interval [0, 40]. When the immigration process is time-inhomogeneous, the simulations indicate that the coefficient of variation was time-dependent. Thus, these simulations corroborate results stated in Theorem 3.2. The magnitude of the change in the coefficient of variation differed across scenarios.
4. An Asymptotic Test of Homogeneity of the Immigration Process
We now develop a test of the null hypothesis:
that the rate of the immigration process is a constant function of time, and we will test H0 against the general alternative hypothesis H1 = H̄0. The proposed procedure applies when several, independent realizations of the process are observed at discrete points in time. This type of experimental design is commonly used in biology.
Thus, let {Zi(t)}t≥0, i = 1, ⋯, n, denote n independent and identically distributed (i.i.d.) copies of the process {Z(t)}t≥0. Assume that each of them is observed at m (m < ∞) discrete time points t1, …, tm with 0 < t1 < ⋯ < tm < ∞. Write t = (t1, …, tm). We note that t1 → ∞ implies that tj → ∞ (j = 1, …, m).
We could construct a test of H0 against the alternative hypothesis H1 by fitting a branching process with time-inhomogeneous immigration to the data and assessing the significance of the association of the immigration rate with time. This approach would first require specifying a family of distributions for the lifespan and a functional form for the rate of the immigration process. We would next fit the postulated model and construct a p-value to decide whether H0 should be rejected. Estimation methods are not yet available for age-dependent branching processes with immigration, but could arguably be developed (e.g., by adapting an existing pseudo-likelihood approach proposed for age-dependent branching processes [6, 37]). Here, we consider a different approach that requires none of these steps and assumptions to be made. The developed test only requires existence of a model in the class of branching processes defined in Section 2.1 that can describe the data. It also requires fitting a linear regression model to the empirical coefficients of variation.
The construction of our test rests on Theorem 3.1, which ensures, if H0 is true, that the coefficient of variation of Z(t) will remain virtually constant between all times of observation, provided that t1 is sufficiently large. Thus, we propose to test H0 by checking whether the empirical coefficients of variation computed using the samples , j = 1, …, m, differ significantly between time points. The comparison is accomplished via linear regression.
For every t ≥ 0, define the kth central moment M(k)(t) = E[{Z(t) − M(t)}k], assumed finite for k = 1, …, 4. Let
and
(4.1) |
denote the sample mean, variance, and coefficient of variation. Using a result from [39], we have that
where
This convergence uses a Central Limit Theorem for i.i.d. random variables [38], which applies here because are i.i.d. by assumption. It also requires that M(4)(t) < ∞. Hence, for any real-valued function L(·) with derivative L′(x) ≠ 0, x > 0, we have that
(4.2) |
where
Here, the function L(·) refers to any transformation of Cv(t) that may make it a diverging function of time, should the null hypothesis be violated. Relevant examples include L(u) = log u or L(u) = u, as discussed at the end of Section 3.1. Write Σt = Var[(ε(t1), …, ε(tm))] and let , assuming the limit exists under H0.
When r(·) ≡ r, Theorem 3.1 and eqn. (4.2) entail that
(4.3) |
where o(1) → 0 as t → ∞. Thus, L(ĉv(t)) is virtually centered about L(c0) under H0 if t is sufficiently large, and we saw in Sections 3.2 and 3.3 that this occurs quickly. From Theorem 3.2, we expect that this is not the case when r(·) is time-dependent.
In order to construct our test statistic, we now formulate the linear model
(4.4) |
where
is a vector of covariates defined using a given set of functions h1(·), …, hp−1(·), for some p ≥ 2, where θ = (θ1, …, θp)T is a p-dimensional parameter vector, and where the r.v. η(tj) describes the error term of the model. The error terms η(tj), j = 1, …, m, are not formally centered about zero; however, when H0 is true, it follows from Theorem 3.1 that their expectation converges to zero as tj → ∞.
When H0 is true and when t1 is sufficiently large, we expect from eqn. (4.3) that Model (4.4) captures the behavior of L(ĉv(t)) with θ ≃ θ*, where
because all coefficients associated with time should be zero. The functions h1(·), …, hp−1(·) are chosen so Model (4.4) can detect a change in L(ĉv(t)) over time when H1 is true. With reasonably chosen hk(·), we expect the relationship (4.4) to hold with θ ≠ θ* under H1 because L(ĉv(ti)) should now be time-dependent. Thus, the gist of the proposed test is to assess H0 by verifying whether θ = θ*.
Define
and X = (x1, ⋯, xm)T. Assume that
Assumption 1
rank X = p.
Assumption 1 implies that the matrix XTX is invertible. Let
denote the least squares estimator of θ for Model (4.4), and define
(4.5) |
As n → ∞, we have that , and
(4.6) |
where
Under H0, when both n and t1 increase at appropriate rates, we have that
where
Let ρ(t) denote the rate at which Cv(t) converges to c0 as t → ∞ when H0 is true; that is, ρ(t) is such that
For example, in the Markov case, we showed that
(4.7) |
(see eqn. (3.8)). Since θ* can be expressed as
where 1p is a p × 1 vector with all entries equal to 1, it follows from eqn. (4.5) and a Taylor series expansion that
Thus, when H0 holds, we deduce that
(4.8) |
Although converging to 0 under H0 as t1 → 0, the residual difference between and θ* could be detected if n is large enough, even if H0 is true. Thus, to impose conditions controlling inflation of the type-1 error rate of our test, we further assume that the study is designed such that:
Assumption 2
If H0 holds true, then .
This second assumption prevents the sample size n from being too large relative to the time points tj, j = 1, …, m. It is needed to eliminate the asymptotic bias that may persist in θ̂ under H0 as n → ∞ as a result of the fact that Cv(t) is not equal to c0 when t is finite. It is verified if the process has been running for a sufficiently long time before the first observations are made. For example, in the application presented in Section 6, Assumption 2 is expected to be satisfied by the population of normal cells since hemopoiesis, the process of blood cell production, had reached steady state before leukemia cells were inoculated. We note that the assumption pre-assumes that the unit of time remains fixed as n increases.
To further understand the implications of Assumption 2, we can consider the Markov case for which ρ(t) is given in eqn. (4.7). Hence, we have that , and Assumption 2 is satisfied in this case if n increases with t1 at the following rate:
Therefore, in the Markov case, the impact of increasing the sample size on type-1 errors may be overcome by a moderate increase in t1. We note that sample sizes are rarely large in biological experiments, such that the validity of Assumption 2 may not be of concern as long as the process is not observed too shortly after the population started to grow.
When H0 is true, Assumption 2 together with eqns. (4.3), (4.6) and (4.8) imply that
as n → ∞. For every k = 1, …, q, let θ̂k denote the kth entry of θ̂. Write θ̂2:p = (θ̂2, …, θ̂p)T, and let denote the sub-matrix of Vt that corresponds to the asymptotic variance-covariance matrix of θ̂2:p. Define the Wald statistic
The above derivations yield immediately the following result:
Theorem 4.1
Assume that Assumptions 1–2 hold. Then as n → ∞ where is a chi-squared distributed random variable with p−1 degrees of freedom.
To implement the test, we must estimate the variance-covariance matrix Σt in order to compute the matrix used in the expression for W. In simulation studies and in analyses of our experimental data on the progression of leukemia, we have used a bootstrap estimator [32, 33]. We generate B independent samples by sampling with replacement from , and calculate , j = 1, …, m, according to eqn. (4.1) based on the bootstrap sample , b = 1, …, B. We set B = 5, 000 in our simulations and application. Writing
we finally approximate Σt by
Shao [34] proposed a simple modification of this bootstrap variance estimator based on the idea of truncation. This estimator achieves consistency under broader assumptions that hold in the present setting if M(4)(t) < ∞.
Once an estimator for Σt has been chosen, the test is implemented by rejecting H0 at the significance level δ if , where is the 100(1 − δ)th percentile of a chi-squared distribution with p − 1 degrees of freedom.
In practice, the processes {Zi(t)}t≥0, i = 1, …, n, need not all be observed under identical experimental conditions. For example, in our leukemia experiment, we have two groups of observations, each corresponding to a particular sampling scheme. Specificities of the experimental design may be accommodated in the test by including appropriate covariates in the vector of predictors xj. The test should then reject H0 if the resulting test statistic W is greater than or equal to the percentile of a chi-squared distribution with p0 degrees of freedom where p0 denotes the number of predictors included in xj that are associated with time, tj. See Section 6 for specific examples.
To obtain the power function of the test, we expand W as
(4.9) |
The first term in the right-hand side of eq. (4.9) converges in distribution under Assumption 1 to a chi-squared distributed random variable with p−1 degrees of freedom; the second term is asymptotically equivalent to , where U is a standard normal (p − 1) × 1 random vector. An asymptotic approximation to the power function of the test in large samples is
where represents a standardized effect size defined as the departure from the null hypothesis expressed through the coefficients of the regression model (4.4) that are associated with time, which we have normalized by the variability matrix . The approximation of the power by Q(δ) as a function of the standardized effect size is illustrated in Figure 2.
Fig 2.
Asymptotic power function (Q) plotted against Δ for various sample sizes (n).
The power of the test may be affected by the choice of the regression function of Model (4.4), and one must carefully evaluate the benefits of increasing the number of functions hk(·) included in the design matrix X in order to capture (potential) complex temporal patterns in the coefficient of variation. In our data analysis, the coefficient of variation appeared to increase linearly over time (for time points ≥ 6 days), such that building W with a regression Model (4.4) in which the regression function depends linearly on time should be an optimal choice. By increasing the flexibility of the regression function (e.g., allowing this function to depend quadratically on time), we increase the degrees of freedom of the asymptotic distribution of W, which could result in loss of power.
5. Simulation Studies
We performed simulations to evaluate the level and power of the proposed test under a variety of immigration rates. The branching process used to generate the data assumed that G(t) was a gamma distribution with mean 20 and variance 40. The offspring generating function was q(s) = p0 +(1−p0)s2, and we considered three values for p0 to evaluate the test when the process is subcritical (p0 = 0.75), critical (p0 = 0.5) and supercritical (p0 = 0.35 or 0.45). These parameter values are biologically plausible.
We also considered immigration rates with various shapes to study the level of the test when immigration is time-homogeneous and its power when it depends on time. The functions that define the immigration rates are included in Tables 1 and 2 which show the percentages of times the test rejected the null hypothesis in each scenario. We note that the layout of these tables mirrors that of Figure 1 (for example, in both the figure and tables, the top rows display results obtained with a time-homogeneous immigration process), and thus the reader can examine results from these simulation studies in light of the behavior of the process Z(t) and its coefficient of variation.
Table 1.
Percentage of simulated data sets for which the test rejected H0 at the 5% nominal level as a function of the sample size and for different types of immigration rates. For each tested scenario, the table reports results based on test statistics constructed using xj = (1, tj)′ (linear) and (quadratic), and with log-transformed coefficients of variation (L(u) = log u).
time-homogeneous immigration process | ||||||
---|---|---|---|---|---|---|
subcritical | critical | supercritical | ||||
p0 = 0.75 | p0= 0.5 | p0 = 0.35 | ||||
r(t) = 2
|
r(t) = 2
|
r(t) = 2
|
||||
sample size | linear | quadratic | linear | quadratic | linear | quadratic |
25 | 6.9% | 9.4% | 6.7% | 9.4% | 7.6% | 8.8% |
50 | 7.1% | 8.7% | 6.6% | 7.2% | 7.9% | 6.4% |
100 | 5.6% | 6.3% | 5.8% | 6.9% | 9.4% | 9.7% |
200 | 5.4% | 6.1% | 6.8% | 6.8% | 9.8% | 9.0% |
time-inhomogeneous immigration process | |||||||||
---|---|---|---|---|---|---|---|---|---|
subcritical | critical | supercritical | |||||||
p0 = 0.75 | p0= 0.5 | p0 = 0.35 | |||||||
r(t) = (2 − 0.01t) ∨ 0
|
r(t) = 0.05t
|
|
|||||||
sample size | linear | quadratic | linear | quadratic | linear | quadratic | |||
25 | 100.0% | 100.0% | 24.8% | 23.9% | 11.9% | 11.3% | |||
50 | 100.0% | 100.0% | 40.1% | 38.9% | 17.4% | 16.7% | |||
100 | 100.0% | 100.0% | 67.6% | 59.1% | 25.5% | 23.4% | |||
200 | 100.0% | 100.0% | 92.9% | 89.2% | 43.7% | 37.5% | |||
subcritical | critical | supercritical | |||||||
p0 = 0.75 | p0 = 0.5 | p0 = 0.45 | |||||||
|
|
|
|||||||
sample size | linear | quadratic | linear | quadratic | linear | quadratic | |||
25 | 7.2% | 82.2% | 40.0% | 35.3% | 80.0% | 76.6% | |||
50 | 8.9% | 99.2% | 63.6% | 58.4% | 99.1% | 96.0% | |||
100 | 9.5% | 100.0% | 91.7% | 83.4% | 100.0% | 100.0% | |||
200 | 13.2% | 100.0% | 100.0% | 98.8% | 100.0% | 100.0% |
Table 2.
Percentage of simulated data sets for which the test rejected H0 at the 5% nominal level as a function of the sample size and for different types of immigration rates. For each tested scenario, the table reports results based on test statistics constructed using xj = (1, tj)′ (linear) and (quadratic). Here, we did not transformed the coefficients of variation (L(u) = u).
time-homogeneous immigration process | ||||||
---|---|---|---|---|---|---|
subcritical | critical | supercritical | ||||
p0 = 0.75 | p0= 0.5 | p0 = 0.35 | ||||
r(t) = 2
|
r(t) = 2
|
r(t) = 2
|
||||
sample size | linear | quadratic | linear | quadratic | linear | quadratic |
25 | 7.4% | 9.9% | 8.9% | 12.3% | 9.1% | 11.6% |
50 | 7.3% | 6.8% | 8.9% | 8.8% | 8.5% | 9.1% |
100 | 6.7% | 6.6% | 6.9% | 7.0% | 7.7% | 6.9% |
200 | 5.7% | 6.4% | 6.6% | 7.6% | 9.6% | 10.2% |
time-inhomogeneous immigration process | |||||||||
---|---|---|---|---|---|---|---|---|---|
subcritical | critical | supercritical | |||||||
p0 = 0.75 | p0 = 0.5 | p0 = 0.35 | |||||||
r(t) = (2 − 0.01t) ∨ 0
|
r(t) = 0.05t
|
|
|||||||
sample size | linear | quadratic | linear | quadratic | linear | quadratic | |||
25 | 62.6% | 82.6% | 18.2% | 27.1% | 9.0% | 11.2% | |||
50 | 88.9% | 100.0% | 23.9% | 35.7% | 11.5% | 13.2% | |||
100 | 100.0% | 100.0% | 42.0% | 57.0% | 16.9% | 18.1% | |||
200 | 100.0% | 100.0% | 92.7% | 87.6% | 27.3% | 24.0% | |||
subcritical | critical | supercritical | |||||||
p0= 0.75 | p0= 0.5 | p0 = 0.45 | |||||||
|
|
|
|||||||
sample size | linear | quadratic | linear | quadratic | linear | quadratic | |||
25 | 8.8% | 78.6% | 41.2% | 37.1% | 84.0% | 78.5% | |||
50 | 8.5% | 99.7% | 64.3% | 57.8% | 98.6% | 96.9% | |||
100 | 11.2% | 100.0% | 90.3% | 84.9% | 100.0% | 100.0% | |||
200 | 18.1% | 100.0% | 99.8% | 99.0% | 100.0% | 100.0% |
For each scenario that was tested, we simulated 1,000 data sets of a given sample size (n). We considered different values of n : 25, 50, 100 and 200. The population size was observed every 12 hours from day 7 to day 10 (thus, m = 7), similar to the study design of our leukemia experiment. In Table 1, the test statistic was constructed with log-transformed coefficients of variation (that is, L(u) = log u). In Table 2, we used untransformed coefficients of variation (that is, L(u) = u). In either case, we set xj = (1, tj)T and , j = 1, …, 6, to study the influence of the choice of xj on performances.
When the immigration rate was time-homogeneous (see top rows of Tables 1 and 2), the level of the test approached the nominal level of 5% as n increased, except when the process was supercritical where the level increased slightly when increasing the sample size from n = 100 to n = 200. The increase was possibly due to n being too large compared to t1, violating Assumption 2. The level of the test was closer to the nominal ones when log-transforming the coefficients of variation (that is, when using L(u) = log u), and this was true whether the process was sub-, super- or critical. The test also achieved better levels at small sample sizes when time (tj) was included only linearly in the vector of predictors (xj) compared to when it was included both as a linear and a quadratic term ( ). A possible strategy to reduce the rate of type-1 errors is to use nominal significance levels smaller than the traditional 5% level.
The power of the test increased with n. It was generally highest when the process was sub-critical or critical (p0 ≤ 0.5). With supercritical processes (p0 > 0.5), larger sample sizes were required to achieve similar power because the contribution of the immigration to the dynamics of the population is eventually overwhelmed by that of the embedded Bellman-Harris process; that is, the cells that are present in the population eventually contribute substantially more to its dynamics than the immigration process, unless the immigration rate increases exponentially fast and has an exponent that is greater than the Malthusian parameter.
The choice of xj (linear vs. quadratic) affected slightly the power of the test, except when the immigration rate oscillated over time (bottom of first column, ), where the test achieved limited power when constructed using xj = (1, tj)T. With samples of size 100, and when log-transforming the coefficients of variation (Table 1), it rejected H0 in only 9.5% of the data sets. By comparison, with , the test regained power because W was built using a more flexible regression function, and rejected H0 for 100% of the data sets.
We also observed that the test was slightly but consistently more powerful when constructed with log-transformed coefficients of variation (L(u) = log u) than with untransformed coefficients of variation (L(u) = u), and this despite exhibiting lower type-1 error rates.
6. An Application to the Progression of Leukemia
We conducted an experiment to study tumor growth in mice inoculated with leukemia stem cells. Eighteen mice were randomized into 2 groups of 9 mice each. In the first group, blood samples were collected every 24 hours from day 5 to day 10 post-inoculation; in the second group, samples were collected every 24 hours from day 5.5 to day 9.5. We quantified the number of leukemia blast cells and of normal (non-leukemic) cells using flow cytometry. Figure 3 shows the number of leukemic cells (panels A) and the number of normal cells (panels B) on a log-scale. The sample coefficients of variation are plotted against time in panels C and D.
Fig 3.
Cell counts (log-scale; top panels) and sample coefficients of variation (bottom panels) for leukemia blast cells (left panels) and for normal cells (right panels) in two groups of mice (circles: group 1; squares: group 2) plotted against time.
As tumor burden increases, the number of normal cells in the blood changes over time primarily through immigration of cells from the bone marrow and through cell death. Leukemia cells change in number via immigration of cells from other tissues of the body (e.g., the bone marrow) or from differentiation of stem/precursor cells that are already in the blood. These cells also undergo self-renewing division. The impact of apoptosis (cell death) on cancer cells is likely limited, but these cells may exit the blood stream by migrating into other tissues and organs of the body. Thus, the pool of normal cells could be described by a subcritical age-dependent branching process with immigration, whereas a supercritical process with immigration would be more appropriate to model the kinetics of leukemia cells.
The pool of normal cells in the blood expanded over time, and the increase in cell count accelerated around days 8–9. The number of leukemia cells increased exponentially, and there is no clear evidence that their immigration rate might have changed over time. The coefficients of variation for the number of leukemia cells were much higher on day 5 and on day 5.5 than in subsequent days, a possible consequence of the fact that some cell counts were too small to be reliably quantified at these time points. These values were excluded from the analyses. For both cell types, the coefficients of variation increased over time starting from day 6. The trend was similar for the two groups of mice, and suggested that the immigration rate was time-dependent. Based on Theorem 3.2, the shape of the coefficient of variation as observed for normal cells could be consistent with an immigration rate that decreases with time; for example, the immigration rate could have the form r(t) = rtθ, for some exponent θ ∈ (−1, 0). The methodology presented in this paper does not permit confirming this observation, however.
To assess the assumption that the immigration rate remained constant over time, we constructed our test using the coefficients of variation (L(u) = u) and their log-transformed values (L(u) = log u), and considered two sets of linear predictors:
and
where gj = 1 or 2 in accordance with group membership (i.e., observed from day 5 or from day 5.5). In the first set of predictors (referred to as linear), tj is entered linearly in the vector xj, and we compared the value of our test statistic to a chi-squared distribution with two degrees of freedom: one coming from tj and one from the interaction between tj and gj. In the second set of predictors (referred to as quadratic), tj is entered in xj both as a linear and as a quadratic term, and we compared the value of our test statistic to a chi-squared distribution with three degrees of freedom because three of the predictors were associated with tj. We note that group membership (gj) is included in xj to allow for potential differences between the coefficients of variation computed in the first and in the second group of mice.
The p-values for normal cells and for leukemia cells obtained with these tests are presented in Table 3. They were all highly significant for normal cells, suggesting that their immigration into the blood was likely time-dependent. The p-values for the coefficient of variation of leukemia cells were either non-significant but trending, or barely significant. Thus, immigration of leukemia cells was also possibly time-dependent. Immigration of leukemia cells could slow down as a result of cell exiting the mitotic cycle to undergo resting by entering the G0 phase. We performed additional experiments (see Figure 4) in which we measured the percentage of cells in the G0–G1 phase and in the S phase. We found that the former decreased while the latter increased over time. These results seemed to confirm our previous conclusions, which may have clinical implications. For example, it has been suggested that patients survival may be prolonged by employing therapeutic strategies designed to maintain the population of tumor cells at acceptable levels rather than attempting to eradicate all cancer cells, which could cause extensive damage to normal tissues. Such an approach would be viable if cancer cells were adapting their dynamics to their micro-environment, which is what our data suggests.
Table 3.
P-values resulting from the analysis of the leukemia data.
L(u) = u
|
L(u) = log u
|
|||
---|---|---|---|---|
linear | quadratic | linear | quadratic | |
normal cells | 5.9 × 10−7 | 7.8 × 10−6 | 1.6 × 10−9 | 1.7 × 10−14 |
leukemia cells | 0.051 | 0.054 | 0.13 | 0.027 |
Fig 4.
Percent normal cells (+) and leukemia cells (○) in the G0–G1 phase (left panel) and in the S phase (right panel) plotted against time.
7. Conclusion
We have proposed a test to assess the assumption that the rate of immigration of an age-dependent branching process with immigration is time-dependent. These models find many applications in biology.
Although the construction of our test hinges on the theory of branching processes, its implementation does not require any such process to be fitted. Furthermore, the distribution function of the lifespan and the shape of the immigration rate (if time-dependent) need not be specified. Since the asymptotic behavior of the coefficient of variation holds under fairly mild regularity conditions, this feature endows the test with robustness properties. Practically speaking, only required is fitting a linear model and estimating a variance-covariance matrix using (e.g.) a bootstrap procedure. It requires that multiple independent runs of the process be observed so coefficients of variation can be estimated, which is the case in many biological experiments.
The test exhibited good performances in simulation studies. As previously discussed, its power generally decreases when the embedded Bellman-Harris process is super-critical. We argued that this arises from the fact that immigration has less impact on the dynamics of the process than in the sub- and critical cases. This limitation is inherent to the property of super-critical processes. Thus, we do not expect it to be unique to the proposed test and it is likely that any test would suffer from this limitation. Nonetheless, in our simulation studies, the test exhibited reasonable power in situations that may be encountered in practice. Another limitation of the proposed test is that it is constructed using the asymptotic behavior of the coefficient of variation, and t1 has to be large enough so Cv(tj), j = 1, …, m, are close to c0, should H0 be true. Exact derivation in the Markov case and our simulations (non-Markov case) suggested that convergence occurs quickly.
This test not only offers insights into cell kinetics, but provides also an important tool in model construction. For example, the analysis presented in Section 6 indicated that the influx of normal (and may be leukemic) cells into the blood stream should be modeled using a time-dependent immigration rate. Estimators for this class of processes are yet to be constructed before we can validate our conclusions. This will be done in future work.
Acknowledgments
This work was funded by NIH R01 Grants NS039511, CA134839, and AI069351.
8. Appendices
8.1. Appendix A: Asymptotic behavior of the mean and variance
We recall the following result (e.g, see [18]):
Theorem 8.1 (Moments when immigration is time-homogeneous)
Assume that r(·) ≡ r.
If α < 0, then M(t) → rγĀ, and .
If α = 0, then M(t) = rγt, and V (t) ~ γrKB2t2/2.
If α > 0, then M(t) ~ rγKAeαt/α, and .
When the immigration rate changes over time, it follows from eqn. (3.5) that M(t) = γR(t) when α = 0. When α ≠ 0, the asymptotic behavior of M(t) is more complex. It is given in Theorem 3.2 for three important classes of immigration rates. Define the quantities: , and . Notice that Ē1(α) < ∞ if α > 0, and Âρ < ∞ if ρ > α.
Theorem 8.2 (Expectation with time-inhomogeneous immigration)
Case 1. If r(t) = r/(1 + t), then M(t) ~ γĀr/(1 + t) if α < 0, and M(t) ~ γrĒ1(α)A(t) if α > 0.
Case 2. If r(t) = rtθ, θ > −1, then M(t) ~ γĀr(t) if α < 0, and M(t) ~ γrΓ(θ + 1)A(t)/αθ+1 if α > 0.
Case 3. If r(t) = reρt, ρ > 0, then M(t) ~ γÂρr(t) if ρ > α, M(t) ~ γrA(t)/(α −ρ) if ρ < α, and M(t) ~ γrtA(t) if ρ = α.
Theorem 8.3. (Variance when immigration is time-nonhomogeneous)
Case 1. If r(t) = r/(1 + t), then if α < 0, where ; V(t) ~ γrKB2t log(1 + t) if α = 0; if α > 0.
Case 2. If r(t) = rtθ, θ > −1, then if α < 0; V(t) ~ γrKB2tθ+2/(θ + 1)(θ + 2) if α = 0; if α > 0.
Case 3. If r(t) = reρt, ρ > 0, then if ρ > 2α where if ρ = 2α; if ρ < 2α.
8.2. Appendix B: Proof of Theorem 8.2
It follows from eqn. (3.5) that M(t) can be decomposed as M(t) = γ{J1(t) + J2(t)}, where and , for some constant ε ∈ (0, 1).
Case 1
If α < 0 then
and we deduce that J2(t) ~ Ār(t). Next, for large enough t, there exists Δ > 0 such that
and we deduce that M(t) ~ γĀr(t).
Assume now that α > 0. Since Ã(t) = e−αtA(t)/KA → 1, we deduce that
where and for any ε ∈ (0, 1). For large enough t, there exists δ ∈ (0, 1) such that
from which we deduce that I1(t) → rĒ1(α). There exists also a constant K > 0 such that
Hence M(t) ~ γrĒ1(α)A(t).
Case 2
Assume that α > 0. Then for large enough t, there exists δ ∈ (0, 1), such that
which implies that I1(t) → rΓ(θ + 1)/αθ+1. Similarly to the previous case, we have that
which completes the proof in this case.
Assume next that α < 0. Then
Similarly to Case 1, we have, for θ ≥ 0, that
and, for θ ∈ (−1, 0),
from which we deduce, in both subcases, that J2(t) ~ Ār(t). Therefore M(t) ~ γĀr(t).
Case 3
Notice that
The proof follows directly from this expression when ρ > α. When 0 < ρ ≤ α, the proof results from the fact that, for any T ∈ (0, t),
and, for large (but fixed) T, L1(T) is finite whereas, as t → ∞, L2(t) ~ KAe(α−ρ)t/(α − ρ) when ρ < α and L2(t) ~ KAt when ρ = α.
8.3. Appendix C: Proof of Theorem 8.3
When α ≠ 0, the proofs are similar to those of Theorem 8.2 and they are omitted. They are only provided for Case 1 when α = 0 (the proofs are similar in Cases 2 and 3).
Case 1
If α = 0, then B(t) = KB2t+o(t), and algebraic calculations show that
Moreover , where, for any ε ∈ (0, 1), and . It can be shown that
For every δ > 0, we can choose εt large enough such that o(x) ≤ δx for x ≥ εt. Therefore
from which we deduce that I2(t) = o(t log[1 + t]). Hence . Since M(t) = γlog(1 + t), we finally deduce that V(t) ~ γrKB2t log(1 + t).
Contributor Information
Ollivier Hyrien, Email: Ollivier_Hyrien@urmc.rochester.edu, University of Rochester, Rochester, New York, USA.
Nikolay M. Yanev, Email: yanev@math.bas.bg, Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, Bulgaria
Craig T. Jordan, Email: craig.jordan@ucdenver.edu, University of Colorado, Denver, USA
References
- 1.Athreya KB, Ney PE. Branching Processes. Springer-Verlag; Berlin: 1972. [Google Scholar]
- 2.Jagers P. Branching Processes with Biological Applications. John Wiley and Sons; London: 1975. [Google Scholar]
- 3.Yakovlev AY, Yanev NM. Transient Processes in Cell Proliferation Kinetics. Heidelberg: Springer-Verlag; 1989. [Google Scholar]
- 4.Kimmel M, Axelrod DE. Branching Processes in Biology. New York: Springer-Verlag; 2002. [Google Scholar]
- 5.Haccou P, Jagers P, Vatutin VA. Branching Processes: Variation, Growth and Extinction of Populations. Cambridge: Cambridge University Press; 2005. [Google Scholar]
- 6.Hyrien O, Mayer-Pröschel M, Noble M, Yakovlev A. A stochastic model to analyze clonal data on multi-type cell populations. Biometrics. 2005;61:199–207. doi: 10.1111/j.0006-341X.2005.031210.x. [DOI] [PubMed] [Google Scholar]
- 7.Hyrien O, Chen R, Mayer-Pröschel M, Noble M. Saddlepoint approximations to the moments of multi-type age-dependent branching processes, with applications. Biometrics. 2010;66:567–577. doi: 10.1111/j.1541-0420.2009.01281.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sevastyanov BA. Limit theorems for special types of branching processes. Theory of Probability and its Applications. 1957;2:339–348. [Google Scholar]
- 9.Jagers P. Age-dependent branching processes allowing immigration. Theory of Probability and its Applications. 1968;13:225–236. [Google Scholar]
- 10.Yanev NM. Branching stochastic processes with immigration. Bulletin de l’Institut Mathematique, Académie Bulgare des Sciences. 1972;15:71–88. [Google Scholar]
- 11.Yanev NM. On a class of decomposable age-dependent branching processes. Mathematica Balkanica. 1972;2:58–75. [Google Scholar]
- 12.Pakes AG. Limit theorems for an age-dependent branching process with immigration. Mathematical Biosciences. 1972;14:221–234. [Google Scholar]
- 13.Radcliffe J. The convergence of a super-critical age-dependent branching processes allowing immigration at the epochs of a renewal process. Mathematical Biosciences. 1972;14:37–44. [Google Scholar]
- 14.Kaplan N, Pakes AG. Supercritical age-dependent branching processes with immigration. Stochastic Processes and its Applications. 1974;2:371–389. [Google Scholar]
- 15.Pakes AG, Kaplan N. On the subcritical Bellman-Harris process with immigration. Journal of Applied Probability. 1974;11:652–668. [Google Scholar]
- 16.Olofsson P. General branching processes with immigration. Journal of Applied Probability. 1996;33:940–948. [Google Scholar]
- 17.Yakovlev A, Yanev NM. Branching stochastic processes with immigration in analysis of renewing cell populations. Mathematical Biosciences. 2006;203:37–63. doi: 10.1016/j.mbs.2006.06.001. [DOI] [PubMed] [Google Scholar]
- 18.Hyrien O, Yanev NM. Age-dependent branching processes with non-homogeneous Poisson immigration as models of cell kinetics. In: Oakes D, Hall WJ, Almudevar A, editors. Modeling and Inference in Biomedical Sciences: in Memory of Andrei Yakovlev. Beachwood, Ohio: IMS Collection Series, Institute of Mathematical Statistics; 2015. in press. [Google Scholar]
- 19.Vatutin VA. A critical Bellman-Harris branching process with immigration and several types of particles. Theory of Probability and its Applications. 1977;21:435–442. [Google Scholar]
- 20.Hyrien O, Yanev NM. Asymptotic behavior of cell populations described by two-type reducible age-dependent branching processes with non-homogeneous immigration. Mathematical Population Studies. 2012;19:164–176. doi: 10.1080/08898480.2012.718934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Hyrien O, Peslak S, Yanev NM, Palis J. Stochastic modeling of stress erythropoiesis using a two-type age-dependent branching process with immigration. Journal of Mathematical Biology. 2015 doi: 10.1007/s00285-014-0803-x. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Guttorp P. Statistical Inference for Branching Processes. Wiley; New York: 1991. [Google Scholar]
- 23.Yanev NM. Statistical inference for branching processes. In: Ahsanullah M, Yanev GP, editors. Records and Branching Processes. Ch 7. Nova Science Publishers Inc; New York: 2008. pp. 123–142. [Google Scholar]
- 24.Yakovlev A, Stoimenova VK, Yanev NM. Branching processes as models of progenitor cell populations and estimation of the offspring distributions. Journal of the American Statistical Association. 2008;103:1357–1366. [Google Scholar]
- 25.Yakovlev A, Yanev NM. Relative frequencies in multitype branching processes. Annals of Applied Probability. 2009;19:1–14. [Google Scholar]
- 26.Yakovlev A, Yanev NM. Limiting distributions in multitype branching processes. Stochastic Analysis and Applications. 2010;28:1040–1060. doi: 10.1080/07362994.2010.515486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hanlon B, Vidyashankar AN. Inference for quantitation parameters in polymerase chain reactions via branching processes with random effects. Journal of the American Statistical Association. 2011;106:525–533. [Google Scholar]
- 28.Miller GE. Asymptotic test statistics for coefficients of variation. Communications in Statistics - Theory and Methods. 1991;20:3351–3363. [Google Scholar]
- 29.Feltz CJ, Miller GE. An asymptotic test for the equality of coefficients of variation from k populations. Statistics in Medicine. 1996;20:3351–3363. doi: 10.1002/(sici)1097-0258(19960330)15:6<647::aid-sim184>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]
- 30.McKay AT. Distribution of the coefficient of variation and the extended ‘t’ distribution. Journal of the Royal Statistical Society. 1932;95:695–698. [Google Scholar]
- 31.Jordan CT, Guzman ML, Noble M. Cancer stem cells. The New England Journal of Medicine. 2006;355:1253–61. doi: 10.1056/NEJMra061808. [DOI] [PubMed] [Google Scholar]
- 32.Efron B. Bootstrap methods: another look at the jackknife. Annals of Statistics. 1979;7:1–26. [Google Scholar]
- 33.Efron B, Tibshirani RJ. An Introduction to the Bootstrap. Chapman & Hall/CRC; New York: 1992. [Google Scholar]
- 34.Shao J. Bootstrap variance estimators with truncation. Statistics and Probability Letters. 1992;15:95–101. [Google Scholar]
- 35.Yakovlev A, Yanev NM. Age and residual lifetime distributions for branching processes. Statistics and Probability Letters. 2007;77:503–513. [Google Scholar]
- 36.Lewis PA, Shedler GS. Simulation of nonhomogeneous Poisson processes by thinning. Naval Research Logistics Quarterly. 1979;26:403–413. [Google Scholar]
- 37.Hyrien O. Pseudo-likelihood estimation for discretely observed multitype Bellman-Harris branching processes. Journal of Statistical Planning and Inference. 2007;137:1375–1388. doi: 10.1016/j.jspi.2011.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Durrett R. Probability Theory. London: Cambridge University Press; 2010. [Google Scholar]
- 39.Sering RJ. Approximation Theorems of Mathematical Statistics. New York: Wiley; 1980. [Google Scholar]