Birth/birth-death processes and their computable transition probabilities with biological applications

Lam Si Tung Ho; Jason Xu; Forrest W Crawford; Vladimir N Minin; Marc A Suchard

doi:10.1007/s00285-017-1160-3

. Author manuscript; available in PMC: 2019 Mar 1.

Published in final edited form as: J Math Biol. 2017 Jul 24;76(4):911–944. doi: 10.1007/s00285-017-1160-3

Birth/birth-death processes and their computable transition probabilities with biological applications

Lam Si Tung Ho ¹, Jason Xu ², Forrest W Crawford ³, Vladimir N Minin ⁴, Marc A Suchard ⁵

PMCID: PMC5783825 NIHMSID: NIHMS895156 PMID: 28741177

Abstract

Birth-death processes track the size of a univariate population, but many biological systems involve interaction between populations, necessitating models for two or more populations simultaneously. A lack of efficient methods for evaluating finite-time transition probabilities of bivariate processes, however, has restricted statistical inference in these models. Researchers rely on computationally expensive methods such as matrix exponentiation or Monte Carlo approximation, restricting likelihood-based inference to small systems, or indirect methods such as approximate Bayesian computation. In this paper, we introduce the birth/birth-death process, a tractable bivariate extension of the birth-death process, where rates are allowed to be nonlinear. We develop an efficient algorithm to calculate its transition probabilities using a continued fraction representation of their Laplace transforms. Next, we identify several exemplary models arising in molecular epidemiology, macro-parasite evolution, and infectious disease modeling that fall within this class, and demonstrate advantages of our proposed method over existing approaches to inference in these models. Notably, the ubiquitous stochastic susceptible-infectious-removed (SIR) model falls within this class, and we emphasize that computable transition probabilities newly enable direct inference of parameters in the SIR model. We also propose a very fast method for approximating the transition probabilities under the SIR model via a novel branching process simplification, and compare it to the continued fraction representation method with application to the 17th century plague in Eyam. Although the two methods produce similar maximum a posteriori estimates, the branching process approximation fails to capture the correlation structure in the joint posterior distribution.

Keywords: stochastic models, birth-death process, infectious disease, SIR model, transition probabilities

1 Introduction

Birth-death processes have been used extensively in many applications including evolutionary biology, ecology, population genetics, epidemiology, and queuing theory (see e.g. Novozhilov et al, 2006; Crawford and Suchard, 2012; Doss et al, 2013; Rabier et al, 2014; Crawford et al, 2015). However, establishing analytic and computationally practical formulae for their transition probabilities is usually difficult (Novozhilov et al, 2006). The state-of-the-art method for computing the transition probabilities of birth-death processes proposed in Crawford and Suchard (2012) enables statistical estimation for general birth-death processes using likelihood-based inference (Crawford et al, 2014). Unfortunately, birth-death processes inherently only track one population, and extending this technique beyond the univariate case is nontrivial. Many applied models require the consideration of two or more interacting populations simultaneously to model behavior such as competition, predation, or infection. Examples of such bivariate models include epidemic models (McKendrick, 1926; Kermack and McKendrick, 1927; Griffiths, 1972), predator-prey models (Hitchcock, 1986; Owen et al, 2015), genetic models (Rosenberg et al, 2003; Xu et al, 2015), and within-host macro-parasite models (Drovandi and Pettitt, 2011).

The most general extensions of birth-death processes to bivariate processes are competition processes (Reuter, 1961). These processes allow not only “birth” and “death” events in each population, but also “transition” events where an individual moves from one population to the other. Unlike birth-death processes, few attempts have been made to compute the transition probabilities of competition processes or their special cases. Hence, researchers usually rely on classical continuous-time Markov chain methods such as matrix exponentiation and diffusion approximation. Unfortunately, these methods fail to leverage the specific structure of competition processes, and have several intrinsic limitations. Matrix exponentiation methods compute the transition probability matrix P(t) by solving the matrix form of Kolmogorov’s forward equation P′(t) = P(t)Q with initial condition P(0) = I, where Q is the instantaneous rate matrix of the process. While this equation admits a unique solution P(t) = exp(Qt) (Ephraim and Mark, 2012), numerical evaluation of the matrix exponential is often troublesome (Moler and Loan, 2003). Its computational cost via eigenvalue decomposition, for instance, is cubic in the size of the state-space and thus becomes computationally prohibitive even with moderately sized state-spaces (Drovandi and Pettitt, 2011; Crawford and Suchard, 2012). For example, Keeling and Ross (2008) demonstrate that computing transition probabilities via matrix exponentiation for the simplest epidemic models is practical only when modeling spread of an infectious disease through a very small population (e.g., 100 people). Moreover, matrix exponentiation can introduce serious rounding errors for certain rate matrices even for biologically reasonable values (Schranz et al, 2008; Crawford and Suchard, 2012; Crawford et al, 2014). Diffusion approximations, on the other hand, require the state-space to be large in order to justify approximating a discrete process by a continuous-valued diffusion process (Karev et al, 2005; Golightly and Wilkinson, 2005), and can often remain inaccurate for simulation even in settings with large state-spaces (Golightly and Wilkinson, 2005). Branching processes form another closely related class of processes, and have been used in a likelihood-based framework to study bivariate populations (Xu et al, 2015). Branching processes are at once more general than competition processes, permitting events that increment populations by more than one, and also more restrictive in that linearity is implied by an assumption that particles act independently. The latter assumption is limiting in epidemiological applications, for instance, which commonly feature non-linear interactions between populations.

The lack of a reliable method for computing transition probabilities in bivariate processes forces researchers to apply alternative likelihood-free approaches such as approximate Bayesian computation (ABC) (Blum and Tran, 2010; Drovandi and Pettitt, 2011; Owen et al, 2015). The ABC approach uses simulated and observed summary statistics to bypass likelihood evaluation. Nonetheless, this is not a panacea approach that can completely replace traditional likelihood-based methods. The ABC method itself has several sources for loss of information such as non-zero tolerance, and non-sufficient summary statistics (Sunnåker et al, 2013). The tolerance is an ad hoc threshold to decide whether ABC accepts a new proposal. If the tolerance is zero and the summary statistics are sufficient, ABC is guaranteed to return the correct posterior distribution. In practice, however, tolerance is always positive which often leads to bias. In the context of counting processes, sufficient summary statistics usually do not exist because the data are observed partially. Thus, credible interval estimates under ABC are potentially inflated due to the loss of information (Csilléry et al, 2010). Also, when sufficient summary statistics are not available, the ABC method can not be trusted in selecting between models (Robert et al, 2011). Because of all these limitations, direct likelihood-based methods are often more favorable.

In this paper, we develop an efficient method to compute the transition probabilities of a subclass of competition processes with two interacting populations of particles, enabling likelihood-based inference. We call this subclass birth(death)/birth-death processes, whose first population is increasing (decreasing). It is worth mentioning that we do not impose linearity condition for the rates of these processes. A rigorous characterization of this class of processes and derivation of recursive formulae to compute their transition probabilities are provided in Section 2. Our main tools are the Laplace transform and continued fractions that have been successfully applied for univariate birth-death processes in Crawford and Suchard (2012). These formulae enable accurate and computationally efficient numerical computation of transition probabilities. We implement this method in the new R package MultiBD https://github.com/msuchard/MultiBD. In Section 3, we discuss multiple scientifically relevant applications of birth(death)/birth-death processes including stochastic susceptible-infectious-removed (SIR) models in epidemiology (McKendrick, 1926; Kermack and McKendrick, 1927; Raggett, 1982), monomolecular reaction systems (Jahnke and Huisinga, 2007), a birth-death-shift model for transposable elements (Rosenberg et al, 2003; Xu et al, 2015), and a within-host macro-parasite model (Riley et al, 2003; Drovandi and Pettitt, 2011). We examine the accuracy of our method in simulation studies, including comparisons to branching process, matrix exponentiation method, and Monte Carlo approximations. Finally, we apply our method to estimate infection rates and death rates during the plague of Eyam in 1666 within a likelihood-based Bayesian framework in Section 4.

Previous work on computing the transition probabilities

Analytic expressions of the transition probabilities have only been found for some special cases such as linear birth-death processes (see e.g. Novozhilov et al, 2006) and monomolecular reaction systems (Jahnke and Huisinga, 2007). Therefore, matrix exponentiation is still the most common method for computing the transition probabilities of general Markov processes. The state-of-the-art software package for exponentiating sparse matrices is Expokit (Sidje, 1998; Moler and Loan, 2003), which uses Krylov subspace projection method. van den Eshof and Hochbruck (2006) propose a modified version using a simple preconditioned transformation to improve the convergence behavior of this method. Although matrix exponentiation has the advantage of generality in that it can be applied to any Markov process, it is not the most efficient method in many scenarios. Recently, Crawford and Suchard (2012) propose an efficient method for evaluating the transition probabilities of general birth-death processes using Laplace transform and continued fraction. However, efficient methods that extend this result to general bivariate birth-death processes have yet to be found.

2 Birth(death)/birth-death processes

2.1 Birth/birth-death processes

A birth/birth-death process is a bivariate continuous-time Markov process X(t) = (X₁(t), X₂(t)), t ≥ 0, whose state-space is in ℕ × ℕ, the Cartesian product of the non-negative integers. We can describe a birth/birth-death process as governing dynamics of a system consisting two types of particles, where one out of four possible events can happen in infinitesimal time: (1) a new type 1 particle enters the system; (2) a new type 2 particle enters the system; (3) a type 2 particle leaves the system; or (4) a type 2 particle becomes a type 1 particle. In this system, X₁(t) and X₂(t) track the number of type 1 and type 2 particles at time t respectively. Mathematically, there are five possibilities for X(t) during a small time interval (t, t + dt):

\begin{array}{l} Pr {\begin{cases} X_{1} (t + d t) = a + 1 & X_{1} (t) = a \\ X_{2} (t + d t) = b & X_{2} (t) = b \end{cases}} & = λ_{a b}^{(1)} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a & X_{1} (t) = a \\ X_{2} (t + d t) = b + 1 & X_{2} (t) = b \end{cases}} & = λ_{a b}^{(2)} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a & X_{1} (t) = a \\ X_{2} (t + d t) = b - 1 & X_{2} (t) = b \end{cases}} & = μ_{a b}^{(2)} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a + 1 & X_{1} (t) = a \\ X_{2} (t + d t) = b - 1 & X_{2} (t) = b \end{cases}} & = γ_{a b} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a & X_{1} (t) = a \\ X_{2} (t + d t) = b & X_{2} (t) = b \end{cases}} & = 1 - (λ_{a b}^{(1)} + λ_{a b}^{(2)} + μ_{a b}^{(2)} + γ_{a b}) d t + o (d t), \end{array}

(1)

where a, b ∈ ℕ, $λ_{a b}^{(1)} \geq 0$ is the birth rate of type 1 particles given a type 1 particles and b type 2 particles, $λ_{a b}^{(2)} \geq 0$ is the equivalent birth rate of type 2 particles, $μ_{a b}^{(2)} \geq 0$ is the death rate of type 2 particles, and γ_ab is the transition rate from type 2 particles to type 1 particles. We fix $λ_{- 1, b}^{(1)} = λ_{a, - 1}^{(2)} = μ_{a 0}^{(2)} = γ_{- 1, b} = γ_{a 0} = 0$ .

Letting $P_{a b}^{a_{0} b_{0}} (t) = Pr {X (t) = (a, b) ∣ X (0) = (a_{0}, b_{0})}$ , the forward Kolmogorov’s equations for the birth/birth-death process are

\begin{array}{l} \frac{d P_{a b}^{a_{0} b_{0}} (t)}{d t} = λ_{a - 1, b}^{(1)} P_{a - 1, b}^{a_{0} b_{0}} (t) + λ_{a, b - 1}^{(2)} P_{a, b - 1}^{a_{0} b_{0}} (t) + μ_{a, b + 1}^{(2)} P_{a, b + 1}^{a_{0} b_{0}} (t) \\ + γ_{a - 1, b + 1} P_{a - 1, b + 1}^{a_{0} b_{0}} (t) - (λ_{a b}^{(1)} + λ_{a b}^{(2)} + μ_{a b}^{(2)} + γ_{a b}) P_{a b}^{a_{0} b_{0}} (t), \end{array}

(2)

for all (a, b). In practice, we can usually only observe the process discretely. In this scenario, the likelihood function is the product of transition probabilities between consecutive observations. Therefore, computing $P_{a b}^{a_{0} b_{0}} (t)$ is an important step for any direct likelihood-based analysis.

In general, a birth/birth-death process is a special case of a competition process (Reuter, 1961) with rate matrix Q = {q_ij} where i, j ∈ ℕ × ℕ and

Competition process

Birth/birth-death

(a + 1, b)

q₍_a_,_b₎₍_a_+1,_b₎

λ_{a b}^{(1)}

(a − 1, b)

q₍_a_,_b₎₍_a_−1,_b₎

(a, b + 1)

q₍_a_,_b₎₍_a_,_b₊₁₎

λ_{a b}^{(2)}

(a, b − 1)

q₍_a_,_b₎₍_a_,_b₋₁₎

μ_{a b}^{(2)}

(a + 1, b − 1)

q₍_a_,_b₎₍_a_+1,_b₋₁₎

γ_ab

(a − 1, b + 1)

q₍_a_,_b₎₍_a_−1,_b₊₁₎

(a, b)

- \sum_{k, l \in {- 1, 0, 1}}^{k \neq l} q_{(a, b) (a + k, b + l)}

- (λ_{a b}^{(1)} + λ_{a b}^{(2)} + μ_{a b}^{(2)} + γ_{a b})

other

Open in a new tab

for i = (a, b). Competition processes are the most general bivariate Markov processes that only allow transitions between neighboring states. Many practical models in biology are special cases of these processes such as epidemic models (McKendrick, 1926; Kermack and McKendrick, 1927; Griffiths, 1972) and predator-prey models (Hitchcock, 1986; Owen et al, 2015).

2.1.1 Sufficient condition for regularity

Definition 1

A birth/birth-death process is regular if there is a unique set of transition probabilities $P_{a b}^{a_{0} b_{0}} (t)$ satisfying the system of equations (2).

Here, we establish the sufficient condition for regularity of a birth/birth-death process. For k ∈ ℕ, we denote:

\begin{array}{l} D_{k} = {(a, b) : a + b = k} \in ℕ \times ℕ, and \\ λ_{k} = max_{(a, b) \in D_{k}} {λ_{a b}^{(1)} + λ_{a b}^{(2)}} . \end{array}

(3)

Theorem 1

The sufficient condition for regularity of a general birth/birth-death process is $\sum_{k = 1}^{\infty} 1 / λ_{k} = \infty$ .

Proof

We will apply the following Reuter’s condition (Reuter, 1957):

Lemma 1

Let Q = {q_ij} be a conservative matrix, such that −q_ii = Σ_j_≠_i q_ij < ∞. A continuous-time Markov chain associated with Q is regular if and only if for some ζ > 0, the equation Qy = ζy subject to 0 ≤ y_i ≤ 1 has only trivial solution y = 0.

For a general birth/birth-death process, states i and j are in ℕ × ℕ. Let {y_ab}_a_,_b_∈ℕ be a solution of Qy = ζy such that y_ab ∈ [0, 1] for any a and b. Then, we have

(ζ + λ_{a b}^{(1)} + λ_{a b}^{(2)} + μ_{a b}^{(2)} + γ_{a b}) y_{a b} = λ_{a b}^{(1)} y_{a + 1, b} + λ_{a b}^{(2)} y_{a, b + 1} + μ_{a b}^{(2)} y_{a, b - 1} + γ_{a b} y_{a + 1, b - 1} .

(4)

Defining y_k = max_{(a,b)∈D_k}{y_ab} and (a_k, b_k) = argmax_{(a,b)∈D_k}{y_ab}, we deduce that

\begin{array}{l} (ζ + λ_{a_{k} b_{k}}^{(1)} + λ_{a_{k} b_{k}}^{(2)} + μ_{a_{k} b_{k}}^{(2)}) y_{k} & \leq (λ_{a_{k} b_{k}}^{(1)} + λ_{a_{k} b_{k}}^{(2)}) y_{k + 1} + μ_{a_{k} b_{k}}^{(2)} y_{k - 1}, and \\ ζ y_{k} + μ_{a_{k} b_{k}}^{(2)} (y_{k} - y_{k - 1}) & \leq (λ_{a_{k} b_{k}}^{(1)} + λ_{a_{k} b_{k}}^{(2)}) (y_{k + 1} - y_{k}) . \end{array}

(5)

Since $μ_{a_{- 1} b_{- 1}}^{(2)} = 0$ , y_k is an increasing sequence. Thus,

\frac{ζ}{λ_{k}} y_{k} \leq y_{k + 1} - y_{k} .

(6)

Assuming that there exists k₀ such that y_k₀ > 0, we obtain

y_{k} \geq y_{k_{0}} + ζ \sum_{i = k_{0}}^{k - 1} \frac{y_{i}}{λ_{i}} \geq y_{k_{0}} (1 + ζ \sum_{i = k_{0}}^{k - 1} \frac{1}{λ_{i}}),

(7)

that is larger than 1 if k is big enough. Hence y_k = 0 for every k. Then, the theorem is proved by applying Lemma 1.

Note that the condition in Theorem 1 generalizes the classical regularity condition of a pure birth process (Feller, 1968). From now on, we assume that our birth/birth-death processes are regular.

2.1.2 Recursive formula for transition probabilities

In this section, we establish a recursion to calculate the transition probabilities $P_{a b}^{a_{0} b_{0}} (t)$ of a birth/birth-death process. Since we assume that our birth/birth-death process is regular, these transition probabilities are unique.

We first note that $P_{a b}^{a_{0} b_{0}} (t) = 0$ for all a < a₀. Let f_ab(s), s ∈ ℂ, be the Laplace transform of $P_{a b}^{a_{0} b_{0} (t)}$ , that is

f_{a b} (s) = L [P_{a b}^{a_{0} b_{0}} (t)] (s) = \int_{0}^{\infty} e^{- s t} P_{a b}^{a_{0} b_{0}} (t) d t .

(8)

From (2), we have

\begin{array}{l} s f_{a b} (s) - P_{a b}^{a_{0} b_{0}} (0) = λ_{a - 1, b}^{(1)} f_{a - 1, b} (s) + λ_{a, b - 1}^{(2)} f_{a, b - 1} (s) + μ_{a, b + 1}^{(2)} f_{a, b + 1} (s) \\ + γ_{a - 1, b + 1} f_{a - 1, b + 1} (s) - (λ_{a b}^{(1)} + λ_{a b}^{(2)} + μ_{a b}^{(2)} + γ_{a b}) f_{a b} (s), (a, b) \in ℕ^{2} . \end{array}

(9)

Note that f_ab(s) is the unique solution of (9) by the uniqueness of $P_{a b}^{a_{0} b_{0}} (t)$ . We construct the recursive approximation formulae for f_ab(s) using continued fractions. Appendix A provides necessary background on continued fractions and their convergents. Denote

\begin{array}{l} x_{a 1} = - \frac{1}{μ_{a 1}^{(2)}}; x_{a b} = - \frac{λ_{a, b - 2}^{(2)}}{μ_{a b}^{(2)}}, b \geq 2 \\ y_{a b} = - \frac{s + λ_{a, b - 1}^{(1)} + λ_{a, b - 1}^{(2)} + μ_{a, b - 1}^{(2)} + γ_{a, b - 1}}{μ_{a b}^{(2)}}, b \geq 1, \end{array}

(10)

and consider the following continued fraction

ϕ_{a 0}^{(0)} (s) = \frac{x_{a 1}}{y_{a 1} + \frac{x_{a 2}}{y_{a 2} + \frac{x_{a 3}}{y_{a 3} + \dots} .}}

(11)

We can construct the sequence ${ϕ_{a b}^{(0)} (s)}_{b = 0}^{\infty}$ (Definition A3, Appendix A) as follows:

\begin{array}{l} (s + λ_{a 0}^{(1)} + λ_{a 0}^{(2)}) ϕ_{a 0}^{(0)} (s) - μ_{a 1}^{(2)} ϕ_{a 1}^{(0)} (s) = 1, and \\ (s + λ_{a, b - 1}^{(1)} + λ_{a, b - 1}^{(2)} + μ_{a, b - 1}^{(2)} + γ_{a, b - 1}) ϕ_{a, b - 1}^{(0)} (s) - λ_{a, b - 2}^{(2)} ϕ_{a, b - 2}^{(0)} (s) - μ_{a b}^{(2)} ϕ_{a b} (s) = 0, b \geq 2. \end{array}

(12)

Comparing the sequences in (12) with (9), we deduce that $L^{- 1} [ϕ_{a b}^{(0)} (s)] = P_{a b}^{a_{0} 0} (t)$ . Since $P_{a b}^{a_{0} 0} (t)$ is a probability distribution, we have $\sum_{(a, b) \in ℕ \times ℕ} P_{a b}^{a_{0} 0} (t) = 1$ . Taking the Laplace transform of the previous equation, we get $\sum_{(a, b) \in ℕ \times ℕ} ϕ_{a b}^{(0)} (s) = 1 / s$ . Hence, ${lim}_{b \to \infty} ϕ_{a_{0} b}^{(0)} (s) = 0$ for every s > 0. By Lemma A1 (Appendix A), $ϕ_{a 0}^{(0)} (s)$ converges for every s > 0, and

ϕ_{a b}^{(0)} (s) = \prod_{i = 1}^{b} x_{a i} \frac{x_{a, b + 1}}{Y_{a, b + 1} + \frac{x_{a, b + 2} Y_{a b}}{y_{a, b + 2} + \frac{x_{a, b + 3}}{y_{a, b + 3} + \frac{x_{a, b + 4}}{y_{a, b + 4} + \dots},}}}

(13)

where Y_ab is the denominator of the b^th convergent of $ϕ_{a 0}^{(0)} (s)$ .

From (9), we note that

(s + λ_{a_{0} b}^{(1)} + λ_{a_{0} b}^{(2)} + μ_{a_{0} b}^{(2)} + γ_{a_{0} b}) f_{a_{0} b} - λ_{a_{0}, b - 1}^{(2)} f_{a_{0}, b - 1} (s) - μ_{a_{0}, b + 1}^{(2)} f_{a_{0}, b + 1} (s) = 1_{{b = b_{0}}}, b \in ℕ .

(14)

By Lemma A2 (Appendix A), $f_{a_{0} b} (s) = ϕ_{a_{0} b}^{(b_{0})} (s)$ where

ϕ_{a b}^{(m)} (s) = {\begin{cases} \frac{{(- 1)}^{m - b + 1} Y_{a b}}{μ_{a, m + 1}^{(2)} \prod_{i = 1}^{m + 1} x_{a i}} ϕ_{a m}^{(0)} (s), & if b \leq m \\ \frac{- Y_{a m}}{μ_{a, m + 1}^{(2)} \prod_{i = 1}^{m + 1} x_{a i}} ϕ_{a b}^{(0)} (s), & if b \geq m . \end{cases}

(15)

Next, we obtain formulae for approximating f_ab(s) recursively assuming that we already have evaluated f_a_−1,_b(s). Again, from (2), we have

(s + λ_{a b}^{(1)} + λ_{a b}^{(2)} + μ_{a b}^{(2)} + γ_{a b}) f_{a b} (s) - λ_{a, b - 1}^{(2)} f_{a, b - 1} (s) - μ_{a, b + 1}^{(2)} f_{a, b + 1} (s) = λ_{a - 1, b}^{(1)} f_{a - 1, b} (s) + γ_{a - 1, b + 1} f_{a - 1, b + 1} (s),

(16)

for b ∈ ℕ. We approximate f_ab(s) by solving a truncated version of (16) for 0 ≤ b ≤ B, where B is sufficiently large. The intuition of how to choose B follows from the observation that we want $\sum_{a = a_{0}}^{\infty} \sum_{b = B + 1}^{\infty} P_{a b}^{a_{0} b_{0}} (t)$ to be small. By Lemma A2 (Appendix A), we have the following approximation:

f_{a b} (s) \approx \sum_{m = 0}^{B} [λ_{a - 1, m}^{(1)} f_{a - 1, m} (s) + γ_{a - 1, m + 1} f_{a - 1, m + 1} (s)] ϕ_{a b}^{(m)} (s) .

(17)

Therefore, the transition probabilities of a birth/birth-death process can be computed recursively using the following Theorem:

Theorem 2

Let $ϕ_{a b}^{(m)} (s)$ be defined as in (11), (13), and (15). We have

P_{a b}^{a_{0} b_{0}} (t) = {\begin{cases} 0, & i f a < a_{0} \\ L^{- 1} [f_{a b} (s)] (t), & i f a \geq a_{0}, \end{cases}

(18)

where $f_{a_{0} b} (s) = ϕ_{a_{0} b}^{(b_{0})} (s)$ and

f_{a b} (s) \approx \sum_{m = 0}^{B} [λ_{a - 1, m}^{(1)} f_{a - 1, m} (s) + γ_{a - 1, m + 1} f_{a - 1, m + 1} (s)] ϕ_{a b}^{(m)} (s), a > a_{0} .

(19)

Here, ℒ⁻¹(.) denotes the inverse Laplace transform and B is the truncation level.

If the number of type 2 particles is bounded by B^*, we choose B = B^*. In this case, the approximation in Theorem 2 is exact. We prove that the output of our approximation scheme (19) converges to f_ab(s) as B goes to infinity in Appendix C. Further, the transition probability returned by Theorem 2 converges to the true transition probability. This truncation error can be bounded explicitly by extending the coupling argument in Crawford et al (2016) to multivariate processes. However, we leave it as a subject of future work because a complete treatment is beyond the scope of this paper.

2.1.3 Numerical approximation of the transitions probabilities

To approximate $P_{a b}^{a_{0} b_{0}} (t)$ using Theorem 2, we need to compute two quantities: the continued fractions $ϕ_{a b}^{(m)} (s)$ , and the inverse Laplace transform ℒ⁻¹ [f_ab(s)] (t). We efficiently evaluate the continued fractions $ϕ_{a b}^{(m)} (s)$ through the modified Lentz method (Lentz, 1976; Thompson and Barnett, 1986); see Appendix B for more details. This algorithm enables us to control for and limit truncation error. To approximate the inverse Laplace transform ℒ⁻¹ [f_ab(s)] (t), we apply the method proposed in Abate and Whitt (1992) using a Riemann sum:

L^{- 1} [f_{a b} (s)] (t) \approx \frac{e^{H / 2}}{2 t} R [f_{a b} (\frac{H}{2 t})] + \frac{e^{H / 2}}{t} \sum_{k = 1}^{\infty} {(- 1)}^{k} R [f_{a b} (\frac{H + 2 k π i}{2 t})] .

(20)

Here ℛ[z] is the real part of z and H is a positive real number. Abate and Whitt (1992) show that the error that arises in (20) is bounded by 1/(e^H − 1). Moreover, we can use the Levin transform (Levin, 1973) to improve the rate of convergence because the series in (20) is an alternating series when ℛ{f_ab[(H + 2kπi)/(2t)]} have the same sign. These numerical methods have been successfully applied by Crawford and Suchard (2012) to compute the transition probabilities of birth-death processes.

In practice, to handle situations where $μ_{a b}^{(2)}$ can possibly equal to 0 for some (a, b), we re-parametrize x_ab and y_ab as follows:

\begin{array}{l} x_{a 1} = 1; x_{a b} = - λ_{a, b - 2}^{(2)} μ_{a, b - 1}^{(2)}, b \geq 2, and \\ y_{a b} = s + λ_{a, b - 1}^{(1)} + λ_{a, b - 1}^{(2)} + μ_{a, b - 1}^{(2)} + γ_{a, b - 1}, b \geq 1. \end{array}

(21)

With this new parametrization, we obtain

ϕ_{a b}^{(m)} (s) = {\begin{cases} \frac{(\prod_{i = b + 1}^{m} μ_{a i}^{(2)}) Y_{a b}}{Y_{a, m + 1} + \frac{x_{a, m + 2} Y_{a m}}{y_{a, m + 2} + \frac{x_{a, m + 3}}{y_{a, m + 3} + \frac{x_{a, m + 4}}{y_{a, m + 4} + \dots}}}}, & if b \leq m \\ \frac{(\prod_{i = m + 1}^{b} λ_{a i}^{(2)}) Y_{a m}}{Y_{a, b + 1} + \frac{x_{a, b + 2} Y_{a b}}{y_{a, b + 2} + \frac{x_{a, b + 3}}{y_{a, b + 3} + \frac{x_{a, b + 4}}{y_{a, b + 4} + \dots}}}}, & if b \geq m . \end{cases}

(22)

Our complete algorithm to compute the transition probabilities of birth/birth-death processes is implemented in the function bbd_prob in a new R package called MultiBD. The function takes t, a₀, b₀, $λ_{a b}^{(1)}, λ_{a b}^{(2)}, μ_{a b}^{(2)}$ , γ_ab, A, B as inputs and returns the transition probability matrix ${P_{a b}^{a_{0} b_{0}} (t)}_{a_{0} \leq a \leq A, 0 \leq b \leq B}$ . Here, there is no requirement for A while B needs to be large enough such that $\sum_{a = a_{0}}^{A} \sum_{b = B + 1}^{\infty} P_{a b}^{a_{0} b_{0}} (t)$ is small. We can check to see if B is large enough by checking if $\sum_{a = a_{0}}^{A} P_{a B}^{a_{0} b_{0}} (t)$ is sufficiently small.

In practice, the computational complexity of evaluating each term (f_ab(s))_{a₀≤a≤A,0≤b≤B} is 𝒪((A − a₀)B²) because the Lentz algorithm terminates quickly. Let K be the number of iterations required by the Levin acceleration method (Levin, 1973) to achieve a certain error bound for the Riemann sum in (20). Then, the total complexity of our algorithm is 𝒪((A − a₀)B²K). However, evaluation of ${f_{a b} [(H + 2 k π i) / (2 t)]}_{k = 1}^{K}$ can be efficiently parallelized across different values of k, and we exploit this parallelism via multicore processing, delegating most of the computational work to compiled C++ code.

2.2 Death/birth-death processes

Similar to the birth/birth-death process, a death/birth-death process is also a special case of competition processes. The only difference is that the number of type 1 particles is decreasing instead of increasing. Mathematically, possible transitions of a death/birth-death process X(t) = (X₁(t), X₂(t)) during (t, t+dt) are:

\begin{array}{l} Pr {\begin{cases} X_{1} (t + d t) = a - 1 & X_{1} (t) = a \\ X_{2} (t + d t) = b & X_{2} (t) = b \end{cases}} & = μ_{a b}^{(1)} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a & X_{1} (t) = a \\ X_{2} (t + d t) = b + 1 & X_{2} (t) = b \end{cases}} & = λ_{a b}^{(2)} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a & X_{1} (t) = a \\ X_{2} (t + d t) = b - 1 & X_{2} (t) = b \end{cases}} & = μ_{a b}^{(2)} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a - 1 & X_{1} (t) = a \\ X_{2} (t + d t) = b + 1 & X_{2} (t) = b \end{cases}} & = γ_{a b} d t + o (d t) \\ Pr {\begin{cases} X_{1} (t + d t) = a & X_{1} (t) = a \\ X_{2} (t + d t) = b & X_{2} (t) = b \end{cases}} & = 1 - (μ_{a b}^{(1)} + λ_{a b}^{(2)} + μ_{a b}^{(2)} + γ_{a b}) d t + o (d t), \end{array}

(23)

where $μ_{a b}^{(1)} \geq 0$ is the death rate of type 1 particles given a type 1 particles and b type 2 particles, $λ_{a b}^{(2)} \geq 0$ is the birth rate of type 2 particles, $μ_{a b}^{(2)} \geq 0$ is the death rate of type 2 particles, and γ_ab is the transition rate from type 1 particles to type 2 particles. Again, we fix $μ_{0, b}^{(1)} = λ_{a, - 1}^{(2)} = μ_{0, b}^{(2)} = γ_{0, b} = γ_{a, - 1} = 0$ .

Following a similar argument as in Section 2.1.1, we obtain a sufficient condition for regularity of a death/birth-death process. Denote

\begin{array}{l} D_{k} & = {(a, b) : a + b = k, a \leq a_{0}} \in N \times N \\ λ_{k} & = max_{(a, b) \in D_{k}} {λ_{a b}^{(2)}} \\ μ_{k} & = min_{(a, b) \in D_{k}} {μ_{a b}^{(1)} + μ_{a b}^{(2)}} \\ σ_{0} & = 1, σ_{k} = \frac{λ_{0} \dots λ_{k - 1}}{μ_{1} \dots μ_{k}}, \end{array}

(24)

where a₀ is the number of type 1 particles at time t = 0. The following Theorem is a direct application of Theorem 1 in Iglehart (1964)

Theorem 3

A sufficient condition for regularity of a death/birth-death process is

\sum_{k = 0}^{\infty} (\frac{1}{λ_{k} σ_{k}} \sum_{i = 0}^{k} σ_{i}) = \infty .

(25)

We note that if we do a transformation for a death/birth-death process X(t) = (X₁(t), X₂(t)) as follows:

\begin{array}{l} Y_{1} (t) & = a_{0} - X_{1} (t) \\ Y_{2} (t) & = B - X_{2} (t) . \end{array}

(26)

Then, Y(t) = (Y₁(t), Y₂(t)) can be considered as a birth/birth-death process. Therefore, the transition probabilities of a death/birth-death process can also be computed using the R function bbd_prob and the transformation (26). Again, we want to choose B such that $\sum_{a = 0}^{a_{0}} \sum_{b = B + 1}^{\infty} P_{a b}^{a_{0} b_{0}} (t)$ is small. We implement this procedure in the function dbd_prob in our R package MultiBD. The function takes t, a₀, b₀, $μ_{a b}^{(1)}, λ_{a b}^{(2)}, μ_{a b}^{(2)}$ , γ_ab, A, B as inputs and returns the transition probability matrix ${P_{a b}^{a_{0} b_{0}} (t)}_{A \leq a \leq a_{0}, 0 \leq b \leq B}$ . As for birth/birth-death processes, there is no requirement for A.

3 Applications

Birth(death)/birth-death processes are appropriate for modeling two-type populations where the size of the first population is monotonically increasing (decreasing). Here we examine our methods in four applications: a within-host macro-parasite model, a birth-death-shift model for transposable elements, monomolecular reaction systems, and the stochastic SIR epidemiological model. We demonstrate that a birth (death)/birth-death process well captures the dynamics of these common biological problems, and inference using its transition probabilities often outperforms existing approximations. In particular, we emphasize that the birth (death)/birth-death process approach allows us to compute finite-time transition probabilities in the stochastic SIR model that were previously considered unknown or intractable without model simplification (Cauchemez and Ferguson, 2008).

3.1 Monomolecular reaction systems

We illustrate the performance of our computational method by considering the following monomolecular reactions:

\begin{array}{l} Reaction & R_{a b} : A \overset{r_{a b}}{\to} B \\ Reaction & R_{b a} : B \overset{r_{b a}}{\to} A \\ Outflow & O_{b} : B \overset{o_{b}}{\to} * \end{array}

(27)

where r_ab, r_ba is the reaction rates, and o_b is the outflow rate. Denote

Q = (\begin{matrix} - r_{a b} & r_{b a} \\ r_{a b} & - r_{b a} - o_{b} \end{matrix}), p^{(a)} = e^{Q t} (\begin{matrix} 1 \\ 0 \end{matrix}), p^{(b)} = e^{Q t} (\begin{matrix} 0 \\ 1 \end{matrix}) .

By Theorem 1 in Jahnke and Huisinga (2007), the transition probabilities of the reaction system (27) at time t > 0 is

P_{a b}^{a_{0} b_{0}} (t) = M (., a_{0}, p^{(a)}) ★ M (., b_{0}, p^{(b)})

(28)

where ℳ(x, N, p) is the multinomial distribution and ★ denotes the convolution operator. As analytic expressions for transition probabilities exist for this class of reactions, this example serves as a baseline for comparison to assess the accuracy of our method.

To study these processes in our framework, let A(t) denote the total number of particle A at time t and L(t) be the total number of particle B leaving the system up to t. Then, {L(t), A(t)} is a birth/birth-death process with the following possible transitions during (t, t + dt):

\begin{array}{l} Pr {\begin{cases} L (t + d t) = i + 1 & L (t) = i \\ A (t + d t) = j & A (t) = j \end{cases}} & = o_{b} {(a_{0} + b_{0} - i - j)}^{+} d t + o (d t), \\ Pr {\begin{cases} L (t + d t) = i & L (t) = i \\ A (t + d t) = j + 1 & A (t) = j \end{cases}} & = r_{b a} {(a_{0} + b_{0} - i - j)}^{+} d t + o (d t), \\ Pr {\begin{cases} L (t + d t) = i & L (t) = i \\ A (t + d t) = j - 1 & A (t) = j \end{cases}} & = r_{a b} jdt + o (d t), and \\ Pr {\begin{cases} L (t + d t) = i & L (t) = i \\ A (t + d t) = j & A (t) = j \end{cases}} & = 1 - [r_{a b} j + (o b + r_{b a}) {(a_{0} + b_{0} - i - j)}^{+}] d t + o (d t) . \end{array}

Here x⁺ = max(0, x). Therefore, $P_{a b}^{a_{0} b_{0}} (t)$ can be computed using our method implemented in the R function bbd_prob.

We use bbd_prob to calculate ${P_{a b}^{20, 0} (1)}_{0 \leq a \leq 20, 0 \leq b \leq 20}$ of the reaction system (27) with r_ab = 2, r_ba = 0.5 and o_b = 1. The L₁ distance between our result and the analytic result (28) is less than 4.7 × 10⁻⁹, thus confirming the accuracy of our method compared to explicit analytic solutions.

3.2 Birth-death-shift model for transposable elements

Transposable elements or transposons are genomic sequences that can either duplicate, with a new copy moving to a new genomic location, move to a different genomic location, or be deleted from the genome. Rosenberg et al (2003) model the number of copies of a particular transposon using a linear birth-death-shift process; a birth is a duplication event, a death is a deletion event, and shift is a switching position event. Xu et al (2015) propose representing this birth-death-shift process by a linear multi-type branching process X(t) = (X_old(t), X_new(t)) tracking the number of occupied sites where X_old(t) is the number of initially occupied sites and X_new(t) is the number of newly occupied sites. Let λ, μ, and ν be the birth, death, and shift rates respectively. The transitions of X(t) during a small time interval occur with probabilities

\begin{array}{l} Pr {\begin{cases} X_{old} (t + d t) = x_{old} - 1 & X_{old} (t) = x_{old} \\ X_{new} (t + d t) = x_{new} & X_{new} (t) = x_{new} \end{cases}} & = (μ x_{old}) d t + o (d t), \\ Pr {\begin{cases} X_{old} (t + d t) = x_{old} & X_{old} (t) = x_{old} \\ X_{new} (t + d t) = x_{new} - 1 & X_{new} (t) = x_{new} \end{cases}} & = (μ x_{new}) d t + o (d t), \\ Pr {\begin{cases} X_{old} (t + d t) = x_{old} & X_{old} (t) = x_{old} \\ X_{new} (t + d t) = x_{new} + 1 & X_{new} (t) = x_{new} \end{cases}} & = λ (x_{old} + x_{new}) d t + o (d t), \\ Pr {\begin{cases} X_{old} (t + d t) = x_{old} - 1 & X_{old} (t) = x_{old} \\ X_{new} (t + d t) = x_{new} + 1 & X_{new} (t) = x_{new} \end{cases}} & = (ν x_{old}) d t + o (d t), and \\ Pr {\begin{cases} X_{old} (t + d t) = x_{old} & X_{old} (t) = x_{old} \\ X_{new} (t + d t) = x_{new} & X_{new} (t) = x_{new} \end{cases}} & = 1 - (μ + λ + ν) x_{old} - (μ + λ) x_{new} d t + o (d t) . \end{array}

(29)

Equivalent to the branching process representation, notice that in this case X(t) is also a death/birth-death process. Hence, we can effectively compute its transition probabilities. In contrast, Xu et al (2015) consider the probability generating function

Φ_{a_{0} b_{0}} (t, s_{1}, s_{2}) = E (s_{1}^{X_{old} (t)} s_{2}^{X_{new} (t)} ∣ X_{old} (0) = a_{0}, X_{new} (0) = b_{0}) = \sum_{a = 0}^{\infty} \sum_{b = 0}^{\infty} P_{a b}^{a_{0} b_{0}} (t) s_{1}^{a} s_{, 2}^{b}

(30)

where

P_{a b}^{a_{0} b_{0}} (t) = Pr {\begin{cases} X_{old} (t) = a & X_{old} (0) = a_{0} \\ X_{new} (t) = b & X_{new} (0) = b_{0} \end{cases}} .

(31)

Because of the model-specific linearity in terms of a and b of the birth and death rates, one can evaluate Φ_jk(t, s₁, s₂) by solving an ordinary differential equation. Further transforming s₁ = e^2πiw₁, s₂ = e^2πiw₂, the generating function becomes a Fourier series

Φ_{a_{0} b_{0}} (t, e^{2 π i w_{1}}, e^{2 π i w_{2}}) = \sum_{a = 0}^{\infty} \sum_{b = 0}^{\infty} P_{a b}^{a_{0} b_{0}} (t) e^{2 π {iaw}_{1}} e^{2 π {ibw}_{2}}

(32)

Therefore, Xu et al (2015) retrieve the transition probabilities through approximating the integral as a Riemann sum

\begin{array}{l} P_{a b}^{a_{0} b_{0}} (t) = \int_{0}^{1} \int_{0}^{1} Φ_{a_{0} b_{0}} (t, e^{2 π i w_{1}}, e^{2 π i w_{2}}) e^{- 2 π {iaw}_{1}} e^{- 2 π {ibw}_{2}} d w_{1} d w_{2} \\ \approx \frac{1}{H_{2}} \sum_{u = 0}^{H - 1} \sum_{v = 0}^{H - 1} Φ_{j k} (t, e^{2 π i u / H}, e^{2 π i v / H}) e^{- 2 π iau / H} e^{- 2 π ibv / H}, \end{array}

(33)

and show that choosing H as the smallest power of 2 greater than max(a, b) produces accurate estimates of the true transition probabilities of the model. The authors implement this method in the R package bdsem. Using their method, evaluating ${P_{a b}^{a_{0} b_{0}} (t)}_{0 \leq a, b \leq H}$ requires numerically solving H² linear ordinary differential equations (ODEs). We perform a simulation to compare the performance between bdsem and our function dbd_prob. Because Xu et al (2015) already provide a thorough empirical validation that bdsem produces accurate transition probabilities compared to Monte Carlo estimates from the true model, we consider a comparison to their method and omit a complete reproduction of their simulation study. Using both routines to compute the transition probabilities of a birth-death-shift process with rates λ = 0.0188, μ = 0.0147, ν = 0.00268 (estimated from the IS6110 data by Rosenberg et al (2003)) repeatedly over one hundred trials leads to a negligible difference in estimated probabilities. Specifically, we computed ${P_{a b}^{10, 0} (t)}_{0 \leq a \leq 10, 0 \leq b \leq 50}$ at three different observation period lengths t = 1, 5, 10, and found that the L₁ distance between probabilities estimated by each method is less than 4 × 10⁻⁸ across all cases. Here, the L₁ distance between two matrices U = (u_ij) and V = (v_ij) are defined as ||U − V|| = Σ_i_,_j |u_ij − v_ij|.

Having validated the accuracy of our approach, we turn to a runtime comparison. The ratios of CPU time required using bdsem compared to dbd_prob are summarized in Figure 1, and note that this result is obtained using a single-thread option for dbd_prob. We see that dbd_prob is about 15 to 30 times faster than the bdsem implementation, while producing very similar results.

Fig. 1 — CPU compute time ratios of `bdsem` to `dbd_prob` over 100 replications.

While there is a large performance difference in wall clock time, we cannot immediately conclude that our method is faster then the method in Xu et al (2015) because computation time may depend heavily on implementation. Nonetheless, we can make some remarks about the performance of both methods that are platform-independent. Notably, the bdsem implementation grows slower as t increases while dbd_prob does not. This is expected because solving ODEs is slower when the domain increases. However, it is worth mentioning that we can use the solution paths to get the solutions of these ODEs at other time points in the domain. For example, when we solve the ODEs at t = 10, we also get the solutions at t = 1 and 5 for free. This point becomes important in applications where we need to compute the transition probabilities at several time points. Another downside of bdsem is that it computes ${P_{a b}^{10, 0} (t)}_{0 \leq a, b \leq 50}$ instead of evaluating ${P_{a b}^{10, 0} (t)}_{0 \leq a \leq 10, 0 \leq b \leq 50}$ directly as is done by dbd_prob.

3.3 Within-host macro-parasite model

Riley et al (2003) posit a stochastic model to describe a within-host macro-parasite population where Brugia pahangi is the parasite and Felis catus is the host. Brugia pahangi is closely related to Brugia malayi which infects millions of people in South and Southeast Asia. The model tracks the number of B. pahangi larvae L(t), the number of mature parasites M(t), and hosts experience of infection I(t) at time t. The dynamics of {L(t), M(t), I(t)} follow a system of differential equations:

\begin{array}{l} \frac{d L}{d t} (t) & = - μ_{L} L (t) - β I (t) L (t) - γ L (t), \\ \frac{d M}{d t} (t) & = γ L (t) - μ_{M} M (t), and \\ \frac{d I}{d t} (t) & = ν L (t) - μ_{I} I (t) \end{array}

(34)

where μ_L is the natural death rate and γ is the maturation rate of larvae; β is the death rate of larvae due to the immune response from the host; μ_M is the death rate of mature parasites; ν is the acquisition rate and μ_I is the loss rate of immunity.

Drovandi and Pettitt (2011) propose a simplification of this model by applying a pseudoequilibrium assumption for immunity, such that the immunity is constant over time. Under this pseudoequilibrium assumption, the dynamics of {L(t), M(t)} becomes

\begin{array}{l} \frac{d L}{d t} (t) & = - μ_{L} L (t) - η {[L (t)]}^{2} - γ L (t), and \\ \frac{d M}{d t} (t) & = γ L (t) - μ_{M} M (t) \end{array}

(35)

where η = βν/μ_I. We illustrate the dynamic of (35) in Figure 2. The corresponding stochastic formulation of this model is:

\begin{array}{l} Pr {\begin{cases} L (t + d t) = i - 1 & L (t) = i \\ M (t + d t) = j + 1 & M (t) = j \end{cases}} & = (γ i) d t + o (d t), \\ Pr {\begin{cases} L (t + d t) = i - 1 & L (t) = i \\ M (t + d t) = j & M (t) = j \end{cases}} & = (μ_{L} i + η i^{2}) d t + o (d t), \\ Pr {\begin{cases} L (t + d t) = i & L (t) = i \\ M (t + d t) = j - 1 & M (t) = j \end{cases}} & = (μ_{M} j) d t + o (d t), and \\ Pr {\begin{cases} L (t + d t) = i & L (t) = i \\ M (t + d t) = j & M (t) = j \end{cases}} & = 1 - (γ i + μ_{L} i + η i^{2} + μ_{M} j) d t + o (d t) . \end{array}

(36)

Fig. 2 — The dynamic of {L(t), M(t)} under the deterministic model (35) with *μ_L* = 0.0682, *μ_M* = 0.0015, η = 0.0009, γ = 0.04 and {L(0), M(0)} = {100, 0}.

Notably, {L(t), M(t)} follow a death/birth-death process.

For this model, γ and μ_M has been estimated at 0.04 and 0.0015 previously (see Drovandi and Pettitt, 2011, for more details). To estimate the remaining parameters, Drovandi and Pettitt (2011) examine the number of mature parasites at host autopsy time (at most 400 days) of those injected with approximately 100 juveniles, assume a priori μ_L and η are uniform[0,1) and apply ABC to draw inference because the traditional matrix exponentiation method is computationally prohibitive here. The basic idea of ABC involves sampling from an approximate posterior distribution

f (θ, Y ∣ ρ (Y, Y_{s}) \leq ε) \propto f (Y_{s} ∣ θ) π (θ) 1_{ρ (Y, Y_{s}) \leq ε},

(37)

where θ is the vector of unknown parameters, ε > 0 is an ad hoc tolerance, and ρ(Y, Y_s) is a discrepancy measure between summary statistics of the observed data Y and the simulated data Y_s. Because the sufficient statistics are not available for this problem, the authors use a goodness-of-fit statistic. However, the ABC method suffers from loss of information because of non-zero tolerance and non-sufficient summary statistics (Sunnåker et al, 2013). Therefore, credible intervals obtained by the ABC approach are potentially inflated (Csilléry et al, 2010).

In contrast, our method makes direct likelihood computation and in turn evaluation of the posterior density feasible. Figure 3 displays a visualization of the posterior density surface of (log μ_L, log η) computed using our method, given the collection of numbers of mature parasites M(t) at autopsy under this model (see Drovandi and Pettitt, 2011, for more details about the data). Importantly in this example, we are able to efficiently integrate out the unobserved larvae counts L(t) at autopsy. The approximate estimate obtained by Drovandi and Pettitt (2011) using ABC is overlaid on this density surface for comparison, and does not align with the highest density region of our computed posterior. Note that the posterior is flat when η is close to 0, and has an unusual tail toward the region where the ABC estimate lies. This suggests that the previous ABC approach fails to explore the region with high posterior probability well, likely due to loss of information incurred by the method, resulting in a poor estimate from the data.

Fig. 3 — Posterior density surface of (log *μ_L*, log η) under within-host macro-parasite model. The “×” symbol represents the estimate from Drovandi and Pettitt (2011) using the ABC method.

Finally, we consider this example toward a second runtime comparison between our method and Expokit, a state-of-the-art matrix exponentiation package with efficient implementation. In particular, we compute the transition probability matrix ${P_{i j}^{100, 0} (t)}_{0 \leq i \leq 100, 0 \leq j \leq 100}$ of {L(t),M(t)} with μ_L = 0.0682, μ_M = 0.0015, η = 0.0009, γ = 0.04 at t = 100, 200, 400 using our function dbd_prob and the function expv in expoRkit, an R-interface to the Fortran package Expokit. Both methods produce similar results: the L₁ distance between the two estimated transition probability matrices is less than 3 × 10⁻⁹ across all cases. In terms of speed, we see that dbd_prob is roughly twice as fast as expv when t = 100, 200, but about 9-fold faster when t = 400 (Figure 4). It is worth mentioning that dbd_prob can be further accelerated via parallelization.

Fig. 4 — CPU compute time ratios of `expv` to `dbd_prob` over 100 replications.

3.4 Stochastic SIR model in epidemiology

McKendrick (1926) models the spread of an infectious disease in a closed population by dividing the population into three categories: susceptible persons (S), infectious persons (I) and removed persons (R). Since the population is closed, the total population size N obeys the conservation equation N = S(t)+I(t)+R(t) for all time t. The deterministic dynamics of these three subpopulations follow a system of nonlinear ordinary differential equations (Kermack and McKendrick, 1927):

\begin{array}{l} \frac{d S}{d t} (t) & = - β S (t) I (t), \\ \frac{d I}{d t} (t) & = β S (t) I (t) - α I (t), and \\ \frac{d R}{d t} (t) & = α I (t), \end{array}

(38)

where α > 0 is the removal rate and β > 0 is the infection rate of the disease. This system of equations cannot be solved analytically, but we can obtain its solution numerically. An important quantity for the SIR model is the basic reproduction number R₀ = βN/α (Earn, 2008). This quantity determines whether a spread of an infectious disease becomes an epidemic. In particular, an epidemic can only occur when R₀ > 1.

Unfortunately, the deterministic model is not suitable when the community is small (Britton, 2010). In these situations, the original stochastic SIR model (McKendrick, 1926) becomes more appropriate. Moreover, Andersson and Britton (2000) argue that stochastic epidemic models are preferable when their analysis is possible because (1) stochastics are the most natural way to describe a spread of diseases, (2) some phenomena do not satisfy the law of large numbers and can only be analyzed in the stochastic setting (for example, the extinction of endemic diseases only occurs when the epidemic process deviates from its expected value), and (3) quantifying the uncertainty in estimates requires stochastic models. Nonetheless, one can bypass Andersson and Britton’s third argument by imposing random sampling errors around the deterministic compartments. Therefore, it is important to distinguish between the deterministic SIR model with sampling errors and the stochastic SIR model.

Without loss of generality, the stochastic SIR model needs only track S(t) and I(t) because S(t) + I(t)+R(t) remains constant. All possible transitions of {S(t), I(t)} during a small time interval (t, t+dt) occur with probabilities

\begin{array}{l} Pr {\begin{cases} S (t + d t) = s & S (t) = s \\ I (t + d t) = i - 1 & I (t) = i \end{cases}} & = (α i) d t + o (d t), \\ Pr {\begin{cases} S (t + d t) = s - 1 & S (t) = s \\ I (t + d t) = i + 1 & I (t) = i \end{cases}} & = (β s i) d t + o (d t), and \\ Pr {\begin{cases} S (t + d t) = s & S (t) = s \\ I (t + d t) = i & I (t) = i \end{cases}} & = 1 - (α i + β s i) d t + o (d t) . \end{array}

(39)

We see that {S(t), I(t)} is a death/birth-death process with $μ_{s i}^{(1)} = λ_{s, i}^{(2)} = 0, μ_{s,}^{(2)} = α i$ , γ_si = βsi.

Due to the interaction between populations and nonlinear nature of the model, mechanistic analysis of the stochastic SIR model is difficult, and the lack of an expression for transition probabilities has been a bottleneck for statistical inference. Renshaw (2011) remarks that while one can write out the Kolmogorov forward equation for the system, the “associated mathematical manipulations required to generate solutions can only be described as heroic.” Instead, the majority of efforts involve either simulation based methods or simplifications and tractable approximations to the SIR model. For instance, the stochastic SIR model can be analyzed using ABC (McKinley et al, 2009), but we have already mentioned limitations of this approach. Particle filter methods can be used to analyze SIR models within maximum likelihood (Ionides et al, 2006, 2015) and Bayesian frameworks (Andrieu et al, 2010; Dukic et al, 2012), but these methods are computationally very demanding and often suffer from convergence problems. When examining large epidemics, to make the likelihood tractable it is reasonable to apply a continuous approximation to the large populations, modeled as a diffusion process with exact solutions (Cauchemez and Ferguson, 2008). However, such an approach is a poor proxy for the SIR model when observed counts are low. When data are collected at regular intervals and coincide with disease generation timescales, it is also possible to study discrete-time epidemic models— the time-series SIR (TSIR) model is one well-known example (Finkenstädt and Grenfell, 2000). However, these simplifications also have their shortcomings, relying on the relatively strong assumption that populations are constant over each interval between observation times.

In the death/birth-death framework, our method enables practical computation of these quantities without any simplifying model assumptions. In Section 4, we will apply our method to analyze the population of Eyam during the plague of 1666 (Raggett, 1982) to estimate the infection and the death rates of this disease, using the death/birth-death transition probabilities within a Metropolis-Hastings algorithm. Here, we first examine the accuracy of these transition probabilities themselves. We compare the continued fraction method to empirical transition probabilities obtained via simulation from the true model as ground-truth, and to a new two-type branching approximation to the SIR model introduced below. The branching process approximation is appropriate when transition probabilities need to be computed for short time intervals, and its simple expressions for transition probabilities enable much more efficient computation. However, we show that as transition time intervals increase, the branching approximation becomes less accurate, while the transition probabilities computed under the death/birth-death model remain very accurate.

While branching processes fundamentally rely on independence of each member of the population, we can nonetheless make a fair approximation by mimicking the interaction effect of infection over short time intervals. In the branching model, let X₁(t) denote the susceptible population and X₂(t) denote the infected population at time t, with details and derivation included in Appendix D. Over any time interval [t₀, t₁), we use the initial population X₂(0) as a constant scalar for the instantaneous rates. This branching process model has instantaneous infection rate βX₂(0)X₁(t) and recovery rate αX₂(t) for all t ∈ [t₀, t₁), closely resembling the true SIR model rates, with the exception of fixing X₂(0) in place of X₂(t) in the rate of infection. This constant initial population fixes a piecewise homogeneous per-particle birth rate to satisfy particle independence while mimicking interactions, but notice that both populations can change over the interval, offering much more flexibility than models such as TSIR that assume constant populations and rates between discrete observations.

This branching model admits closed-form solutions to the transition probabilities that can be evaluated quickly and accurately. The transition probabilities of the two-type branching approximation to the SIR model over any time interval of length t are given by

Pr {X (t + τ) = (k, l) ∣ X (τ) = (m, n)} : = P_{k l}^{m n} (t) = \sum_{i = 0}^{l} (\begin{matrix} l \\ i \end{matrix}) A (l - i) B (i),

(40)

where

\begin{array}{l} B (i) & = 0 for all i \geq n, otherwise, \\ B (i) & = \frac{n!}{(n - i)!} {(1 - e^{- α t})}^{n - i} e^{- i α t} \end{array}

(41)

and

\begin{array}{l} A (l - i) & = 0 for all (l - i) \geq (m - k), otherwise, \\ A (l - i) & = \frac{m!}{(m - k - (l - i))!} e^{- k β n t} {[1 - \frac{β n}{β n - α} e^{- α t} - (1 - \frac{β n}{β n - α}) e^{- β n t}]}^{m - k - (l - i)} \times {[\frac{β n}{β n - α} (e^{- α t} - e^{- β n t})]}^{l - i} . \end{array}

(42)

The sum over products of expressions (41) and (42) in equation (40) may look unwieldy, but this sum is computed extremely quickly with a vectorized implementation, and with high degrees of numerical stability. In settings when such a model is appropriate and (X₁(t),X₂(t)) ≈ (S(t), I(t)), the branching approximation can offer a much more computationally efficient alternative to the continued fraction method.

3.5 Transition probabilities of the SIR model

Figure 5 provides a comparison between methods of computing transition probabilities. Included are transition probabilities corresponding to the nine pairs of system states {(m, n), (k, l)}_j, j = 1, …, 9, such that $P_{k l}^{m n} (0.5)$ is largest. Fixing these indices, we plot the set of probabilities { $P_{k l}^{m n} (t)$ } while varying t between 0.1 and 1.0. We see that transition probabilities computed using the continued fraction method under the death/birth-death model very closely match those computed empirically via simulation from the model, taken to be the ground truth. Almost all such probabilities in Figure 5 fall within the 95% confidence interval, while the branching process transitions follow a similar shape over time, but fall outside of the confidence intervals for many observation intervals. An additional heatmap visualization comparing the support of transition probabilities is included in the Appendix, and shows that the branching approximation is accurate with similar support to the empirical transition probabilities for a shorter time interval of length t = 0.5, but becomes visibly further from the truth when we increase the observation length to t = 1.0.

Fig. 5 — The plot above displays the values of the nine largest transition probabilities when t = 0.5 as we vary t from 0.1, …, 1.0. Parameters used to generate data are initialized at I₀ = 15, S₀ = 110, α = 3.2, β = 0.025. Empirical Monte Carlo 95% confidence intervals over 150, 000 simulations from the true model are depicted in orange. Probabilities computed using the continued fraction expansion are depicted by purple triangles, while probabilities computed under the branching approximation are denoted by green squares.

4 The Plague in Eyam revisited

We revisit the outbreak of plague in Eyam, a village in the Derbyshire Dales district, England, over the period from June 18th to October 20th, 1666. This plague outbreak is widely accepted to originate from the Great Plague of London, that killed about 15% of London’s population at that time. To prevent further spread of the plague after infestation, the Eyam villagers did not escape the village, instead isolating themselves from the outside world. At the end of this horrific event, only 83 people had survived out of an initial population of 350. We summarize data recording the spread of the disease (Raggett, 1982) in Table 1. As mentioned in Raggett (1982), this data are obtained by counting the number of deaths from the dead list and estimating the infective population from the list of future deaths assuming a fixed length of illness prior to death. Then, the susceptible population can be computed easily because the the town is isolated.

Table 1.

Susceptible and infectious population size in Eyam from June 18th to October 20th, 1666.


	Time (months)
	0	0.5	1	1.5	2	2.5	3	4
Susceptible population	254	235	201	153	121	110	97	83
Infective population	7	14	22	29	20	8	8	0

Open in a new tab

Raggett (1982) analyzes these data using the stochastic SIR model (39). In this model, α is the unknown death rate of infective people and β is the unknown infection rate of the plague. The author uses a simple approximation method for the forward differential equation and comes up with a point estimate (α̂, β̂) = (3.39, 0.0212). We take a Bayesian approach to re-analyze these data.

With n observations ${(s_{k}, i_{k})}_{k = 1}^{n}$ at time ${t_{k}}_{k = 1}^{n}$ , the log of the likelihood function is:

log l (α, β ∣ {(s_{k}, i_{k})}_{k = 1}^{n}) = \sum_{k = 1}^{n - 1} log Pr {\begin{cases} S (t_{k + 1}) = s_{k + 1} & S (t_{k}) = s_{k} \\ I (t_{k + 1}) = i_{k + 1} & I (t_{k}) = i_{k} \end{cases}} .

(43)

Because {S(t), I(t)} is a death/birth-death process, the individual transition probabilities can be computed efficiently using our continued fraction method. Hence, the log of the likelihood (43) can be computed easily. Since α and β are non-negative, we opt to use log α and log β as our model parameters and assume a priori that log α ~ 𝒩(μ = 0, σ = 100) and log β ~ 𝒩(μ = 0, σ = 100). We explore the posterior distribution of (log α, log β) using a random-walk Metropolis algorithm implemented in the R function MCMCmetrop1R from package MCMCpack (Martin et al, 2011). We start the chain from Raggett’s estimated value (log(3.39), log(0.0212)) and run it for 100000 iterations. We discard the first 20000 iterations and summarize the posterior distribution of (α, β) using the remaining iterations. We illustrate the density of this posterior distribution in Figure 6(a). The posterior mean of α is 3.22 and the 95% Bayesian credible interval for α lies in (2.69, 3.82). Those corresponding quantities for β are 0.0197 and (0.0164, 0.0234). Notice that our credible intervals include the point estimate (α̂, β̂) = (2.73, 0.0178) from Brauer (2008) using the deterministic SIR model and Raggett’s point estimate (α̂, β̂) = (3.39, 0.0212).

Fig. 6 — Posterior distributions (log scale) of the death rate α and the infection rate β during the plague of Eyam in 1666. The “+” symbol represents the estimate from Brauer (2008) using the deterministic SIR model, and the “×” symbol represents the Raggett’s point estimate.

We also apply the two-type branching approximation to compute the log of the likelihood (43). Using the same random-walk Metropolis algorithm as before, we explore the posterior distribution of (α, β) and visualize it in Figure 6(b). The posterior mean of α is 3.237 and the 95% Bayesian credible interval for α is (2.7, 3.84), while those quantities for β are 0.02 and (0.0171, 0.023). Although the posterior means and the 95% Bayesian credible intervals are similar to ones from the continued fraction method, we see in Figure 6(b) that this method fails to fully capture the posterior correlation structure between α and β.

The posterior distribution of the basic reproduction number R₀ from the continued fraction method and from the branching approximation method are similar (Figure 7). The posterior mean of R₀ from the continued fraction method is 1.61 and from the branching approximation method is 1.62. The estimate for R₀ from Brauer (2008) is 1.7, from Raggett (1982) is 1.63. These estimates are similar, and in particular the branching approximation estimate is very close to that under the continued fraction method, offering a very efficient way to provide reasonable estimates of quantities such as R₀ despite being less accurate than the continued fraction approach.

Fig. 7 — Posterior distribution of the basic reproduction number R₀ (solid line: continued fraction method, dashed line: branching approximation method). The “+”, and the “×” symbols represent the estimate of R₀ from Brauer (2008), and from Raggett (1982) respectively.

From the results, we can see that estimates of R₀ from different methods are roughly the same while estimates of α and β are different. Although the basic reproduction number R₀ is an important quantity in the SIR model, it is not the only parameter driving the dynamic of the epidemic. Correia-Gomes et al (2014) demonstrated the important of accurately estimating the transmission parameters between compartments of the SIR model for Salmonella Typhimurium in pigs.

5 Discussion

Likelihood-based inference for bivariate continuous-time Markov processes is usually restricted to very small state spaces due to the computational bottleneck of transition probability calculation. In this paper, we provide tools for likelihood-based inference for birth(death)/birth-death processes by developing an efficient method to compute their transition probabilities. We provide a complete implementation of the algorithms to compute these transition probabilities in a new R package called MultiBD. Our functions employ sophisticated tools including continued fractions, the modified Lentz method, the method of Abate and Whitt for approximate inverse Laplace transforms, and the Levin acceleration method. Moreover, these methods are naturally amenable to parallelization, and we exploit multicore processing to speed up the algorithm. We remark that birth(death)/birth-death processes remain a limited subclass of general multivariate birth-death processes. For example, many population biology problems require a full bivariate birth-death process including predator-prey models (Hitchcock, 1986; Owen et al, 2015) and the SIR model with vital dynamics (Earn, 2008). Unfortunately, efficiently computing the transition probabilities of multivariate birth-death processes remains an open problem. Solving this problem will enable numerically stable statistical inference under birth-death processes and will be worth the “heroic” effort (Renshaw, 2011).

Fig. 8 — Heatmap visualizations of transition probabilities near the region of support across methods for t = 0.5, 1. We see that the branching approximation is noticeably different from the Monte Carlo ground truth when we increase t to 1, while the continued fraction approach remains accurate.

Acknowledgments

This work was partially supported by the National Institutes of Health (R01 HG006139, R01 AI107034, and U54 GM111274) and the National Science Foundation (IIS 1251151, DMS 1264153, DMS 1606177). We thank Christopher Drovandi, Edwin Michael, and David Denham for access to the Brugia pahangi count data.

A Continued fractions

In this section, we give some basic definitions and properties related to continued fractions.

Definition A1

A continued fraction ϕ₀ is a scalar quantity expressed in

ϕ_{0} = \frac{x_{1}}{y_{1} + \frac{x_{2}}{y_{2} + \frac{x_{3}}{y_{3} + \dots},}}

(A.1)

where ${x_{i}}_{i = 1}^{\infty}$ and ${y_{i}}_{i = 1}^{\infty}$ are infinite sequences of complex numbers.

Definition A2

The n^th convergent of ϕ₀ is

\frac{X_{n}}{Y_{n}} = \frac{x_{1}}{y_{1} + \frac{x_{2}}{y_{2} + \frac{x_{3}}{y_{3} + \dots + \frac{x_{n}}{y_{n}} .}}}

(A.2)

Definition A3

We define the corresponding sequence ${ϕ_{n}}_{n = 0}^{\infty}$ of a continued fraction (A.1) by the following recurrence formulae

\begin{array}{l} ϕ_{1} = x_{1} - y_{1} ϕ_{0}, and \\ ϕ_{n} = x_{n} ϕ_{n - 2} - y_{n} ϕ_{n - 1} for n \geq 2. \end{array}

(A.3)

Murphy and O’Donohoe (1975) provided the following sufficient condition for the convergence of (A.1):

Lemma A1

Assume that there exists N such that inf_n_>_N |Y_n| > 0 and lim_n_→∞ ϕ_n = 0. Then, the continued fraction (A.1) is convergent. Moreover,

ϕ_{n} = \prod_{i = 1}^{n} x_{i} \frac{x_{n + 1}}{Y_{n + 1} + \frac{x_{n + 2} Y_{n}}{y_{n + 2} + \frac{x_{n + 3}}{y_{n + 3} + \frac{x_{n + 4}}{y_{n + 4} + \dots} .}}}

(A.4)

Now, if we consider a more general recurrence formulae

\begin{array}{l} ϕ_{1}^{(m)} & = - y_{1} ϕ_{0}^{(m)} + k_{1} 1_{{m = 0}} \\ ϕ_{n}^{(m)} & = x_{n} ϕ_{n - 2}^{(m)} - y_{n} ϕ_{n - 1}^{(m)} + k_{m + 1} 1_{{m = n - 1}} for n \geq 2, \end{array}

(A.5)

then under the assumption of Lemma A1, we have the following lemma:

Lemma A2

The solution for (A.5) is

ϕ_{n}^{(m)} = {\begin{cases} \frac{{(- 1)}^{m - n} k_{m + 1}}{\prod_{i = 1}^{m + 1} x_{i}} Y_{n} ϕ_{m}, & i f n \leq m \\ \frac{k_{m + 1}}{\prod_{i = 1}^{m + 1} x_{i}} Y_{m} ϕ_{n}, & i f n \geq m . \end{cases}

(A.6)

B Modified Lentz method

Modified Lentz method (Lentz, 1976; Thompson and Barnett, 1986) is an efficient algorithm to finitely approximate the infinite expression of the continued fraction ϕ₀ in (A.1) to within a prescribed error tolerance. Let $ϕ_{0}^{(n)}$ be the n^th convergence of ϕ₀, that is $ϕ_{0}^{(n)} = X_{n} / Y_{n}$ . The main idea of Lentz’s algorithm lies in using the ratios

A_{n} = \frac{X_{n}}{X_{n - 1}} and B_{n} = \frac{Y_{n - 1}}{Y_{n}}

(B.1)

to stabilize the computation of $ϕ_{0}^{(n)}$ . We can calculate A_n, B_n, and $ϕ_{0}^{(n)}$ recursively as follows:

\begin{array}{l} A_{n} & = y_{n} + \frac{x_{n}}{A_{n - 1}} \\ B_{n} & = \frac{1}{y_{n} + x_{n} B_{n - 1}} \\ ϕ_{0}^{(n)} & = ϕ_{0}^{(n - 1)} A_{n} B_{n} . \end{array}

(B.2)

If $ϕ_{0}^{(n)}$ converges to ϕ₀, then Craviotto et al (1993) show that

| ϕ_{0}^{(n)} - ϕ_{0} | \leq \frac{∣ Y_{n} / Y_{n - 1} ∣}{I [Y_{n} / Y_{n - 1}]} | ϕ_{0}^{(n)} - ϕ_{0}^{(n - 1)} | = \frac{∣ 1 / B_{n} ∣}{I [1 / B_{n}]} | ϕ_{0}^{(n)} - ϕ_{0}^{(n - 1)} |,

(B.3)

where ℐ[Y_n/Y_n−₁] is the imaginary part of Y_n/Y_n−₁ and is assumed to be non-zero. Hence, the Lentz’s algorithm terminates when

\frac{∣ 1 / B_{n} ∣}{I [1 / B_{n}]} | ϕ_{0}^{(n)} - ϕ_{0}^{(n - 1)} |

(B.4)

is small enough. However, A_n and B_n can equal zero themselves and cause problem. Hence, Thompson and Barnett (1986) propose a modification for Lentz’s algorithm by setting A_n and B_n to a very small number, such as 10⁻¹⁶, whenever they equal zero. In practice, the algorithm often terminates after small number of iterations. However, in some rare cases where the numerical computation is unstable, it might take too long before the algorithm terminates. So, we set a predefined maximum number of iterations H as a fallback for these cases.

C Convergence results of increasing the truncation level

Let $f_{a b}^{(B)} (s)$ be the output of the approximation scheme (19) in Theorem 2. In this section, we prove that $f_{a b}^{(B)} (s)$ converges to f_ab(s) as B goes to infinity. To do so, let us consider a truncated birth/birth-death process $X^{(B)} (t) = (X_{1}^{(B)} (t), X_{2}^{(B)} (t))$ at truncation level B such that it executes the same process as X(t) on the state {a₀, a₀ + 1, a₀ + 2,…} × {0, 1, 2, …, B} except that $λ_{a B}^{(2)} = 0$ . Define $P_{a b}^{a_{0} b_{0}, (B)} (t)$ be the transition probabilities of X⁽^B⁾(t) and T_B be the hitting time at which X₂(t) first reach state B + 1. For any set S ⊂ ℕ², we have

\begin{array}{l} Pr (X (t) \in S) = Pr (X (t) \in S ∣ T_{B} > t) Pr (T > t) + Pr (X (t) \in S ∣ T_{B} \leq t) Pr (T_{B} \leq t) \\ = Pr (X^{(B)} (t) \in S) Pr (T_{B} > t) + Pr (X (t) \in S ∣ T_{B} \leq t) Pr (T_{B} \leq t) \\ = Pr (X^{(B)} (t) \in S) + [Pr (X (t) \in S ∣ T_{B} \leq t) - Pr (X^{(B)} (t) \in S)] Pr (T_{B} \leq t) \end{array}

Therefore | Pr(X(t) ∈ S) − Pr(X⁽^B⁾(t) ∈ S)| ≤ Pr(T_B ≤ t). Note that $f_{a b}^{(B)} (s)$ is the Laplace transform of $P_{a b}^{a_{0} b_{0}, (B)} (t)$ . Hence

∣ f_{a b}^{(B)} (s) - f_{a b} (s) ∣ \leq \int_{0}^{\infty} ∣ P_{a b}^{a_{0} b_{0}, (B)} (t) - P_{a b}^{a_{0} b_{0}} (t) ∣ e^{- s t} d t \leq \int_{0}^{\infty} P r (T_{B} \leq t) e^{- s t} d t

By Dominated convergence theorem and the fact that lim_B_→∞ Pr(T_B ≤ t) = 0, we deduce that ${lim}_{B \to \infty} f_{a b}^{(B)} (s) = f_{a b} (s)$ .

D Branching SIR approximation

Here we derive and solve the Kolmogorov backward equations of the two-type branching process necessary for evaluating the probability generating functions (PGFs) whose coefficients yield transition probabilities.

D.1 Deriving the PGF

Our two-type branching process is represented by a vector (X₁(t),X₂(t)) that denotes the numbers of particles of two types at time t. Let the quantities a₁(k, l) denote the rates of producing k type 1 particles and l type 2 particles, starting with one type 1 particle, and a₂(k, l) be analogously defined but beginning with one type 2 particle. Given a two-type branching process defined by instantaneous rates a_i(k, l), denote the following pseudo-generating functions for i = 1, 2 as

u_{i} (s_{1}, s_{2}) = \sum_{k} \sum_{l} a_{i} (k, l) s_{1}^{k} s_{2}^{l} .

(D.1)

We may expand the probability generating functions in the following form:

\begin{array}{l} ϕ_{10} (t, s_{1}, s_{2}) = E (s_{1}^{X_{1} (t)} s_{2}^{X_{2} (t)} ∣ X_{1} (0) = 1, X_{2} (0) = 0) \\ = \sum_{k = 0}^{\infty} \sum_{l = 0}^{\infty} P_{1, 0}^{k l} (t) s_{1}^{k} s_{2}^{l} \\ = \sum_{k = 0}^{\infty} \sum_{l = 0}^{\infty} (1_{k = 1, l = 0} + a_{1} (k, l) t + o (t)) s_{1}^{k} s_{2}^{l} \\ = s_{1} + u_{1} (s_{1}, s_{2}) t + o (t) . \end{array}

(D.2)

We have an analogous expression for ϕ₀₁(t, s₁, s₂) beginning with one particle of type 2 instead of type 1. For short, we will write ϕ₁₀ := ϕ₁, ϕ₀₁ := ϕ₂. Thus, we have the following relation between the functions ϕ and u:

\begin{array}{l} \frac{d ϕ_{1}}{d t} (t, s_{1}, s_{2}) ∣_{t = 0} = u_{1} (s_{1}, s_{2}) and \\ \frac{d ϕ_{2}}{d t} (t, s_{1}, s_{2}) ∣_{t = 0} = u_{2} (s_{1}, s_{2}) . \end{array}

(D.3)

To derive the backwards and forward equations, Chapman-Kolmogorov arguments yield the symmetric relations

\begin{array}{l} ϕ_{1} (t + h, s_{1}, s_{2}) = ϕ_{1} (t, ϕ_{1} (h, s_{1}, s_{2}), ϕ_{2} (h, s_{1}, s_{2})) \\ = ϕ_{1} (h, ϕ_{1} (t, s_{1}, s_{2}), ϕ_{2} (t, s_{1}, s_{2})) . \end{array}

(D.4)

First, we derive the backward equations by expanding around t and applying (D.3):

\begin{array}{l} ϕ_{1} (t + h, s_{1}, s_{2}) = ϕ_{1} (t, s_{1}, s_{2}) + \frac{d ϕ_{1}}{d h} (t + h, s_{1}, s_{2}) ∣_{h = 0} h + o (h) \\ = ϕ_{1} (t, s_{1}, s_{2}) + \frac{d ϕ_{1}}{d h} (h, ϕ_{1} (t, s_{1}, s_{2}), ϕ_{2} (t, s_{1}, s_{2}) ∣_{h = 0} h + o (h)) \\ = ϕ_{1} (t, s_{1}, s_{2}) + u_{1} (ϕ_{1} (t, s_{1}, s_{2}), ϕ_{2} (t, s_{1}, s_{2}) h + o (h)) . \end{array}

(D.5)

Since an analogous argument applies for ϕ₂, we arrive at the system

\begin{array}{l} \frac{d}{d t} ϕ_{1} (t, s_{1}, s_{2}) = u_{1} (ϕ_{1} (t, s_{1}, s_{2}), ϕ_{2} (t, s_{1}, s_{2})) and \\ \frac{d}{d t} ϕ_{2} (t, s_{1}, s_{2}) = u_{2} (ϕ_{1} (t, s_{1}, s_{2}), ϕ_{2} (t, s_{1}, s_{2})), \end{array}

(D.6)

with initial conditions ϕ₁(0, s₁, s₂) = s₁, ϕ₂(0, s₁, s₂) = s₂.

Recall in our SIR approximation, we use the initial population X₂(0) as a constant that scales the instantaneous rates over any time interval [t₀, t₁). The only nonzero rates specifying this proposed model, in the notation above, are

a_{1} (0, 1) = β X_{2} (0), a_{1} (1, 0) = - β X_{2} (0), a_{2} (0, 1) = - α, a_{2} (0, 0) = α .

(D.7)

For simplicity, call X₂(0) := I₀, the constant representing the infected population at the beginning of the time interval. Thus, the corresponding pseudo-generating functions have a simple form:

\begin{array}{r} u_{1} (s_{1}, s_{2}) = β I_{0} s_{2} - β I_{0} s_{1} and \\ u_{2} (s_{1}, s_{2}) = α - α s_{2} = α (1 - s_{2}) . \end{array}

(D.8)

Plugging into the backward equations, we obtain

\begin{array}{l} \frac{d}{d t} ϕ_{1} (t, s_{1}, s_{2}) = β I_{0} (ϕ_{2} (t, s_{1}, s_{2}) - ϕ_{1} (t, s_{1}, s_{2})) and \\ \frac{d}{d t} ϕ_{2} (t, s_{1}, s_{2}) = α - α ϕ_{2} (t, s_{1}, s_{2}) . \end{array}

(D.9)

The ϕ₂ differential equation corresponds to a pure death process and is immediately solvable; suppressing the arguments of ϕ₂ for notational convenience, we obtain

\begin{array}{l} \frac{d}{d t} ϕ_{2} & = α - α ϕ_{2} \\ \frac{d}{d t} ϕ_{2} (\frac{1}{1 - ϕ_{2}}) & = α \\ ln (1 - ϕ_{2}) & = - α t + C \\ ϕ_{2} & = 1 - exp (- α t + C) . \end{array}

(D.10)

Plugging in ϕ₂(0, s₁, s₂) = s₂, we obtain C = ln(1 − s₂), and we arrive at

ϕ_{2} (t, s_{1}, s_{2}) = 1 + (s_{2} - 1) exp (- α t)

(D.11)

Substituting this solution into the first differential equation and applying the integrating factor method provides

\begin{array}{l} ϕ_{1} e^{β I_{0} t} = \int β I_{0} e^{β I_{0} t} (1 + \frac{s_{2} - 1}{e^{α t}}) d t = e^{β I_{0} t} + β I_{0} (s_{2} - 1) \int e^{(β I_{0} - α) t} d t \\ = e^{β I_{0} t} + β I_{0} (s_{2} - 1) \frac{e^{(β I_{0} - α) t}}{β I_{0} - α} + C . \end{array}

(D.12)

Plugging in the initial condition ϕ₁(0, s₁, s₂) = s₁ and rearranging yields

ϕ_{1} = 1 + \frac{β I_{0} (s_{2} - 1)}{β I_{0} - α} e^{- α t} + e^{- β I_{0} t} (s_{1} - 1 - \frac{β I_{0} (s_{2} - 1)}{β I_{0} - α}) .

(D.13)

D.2 Transition probability expressions

Transition probabilities are related to the PGF via repeated partial differentiation; note that

\begin{array}{l} P_{k l}^{m n} (t) = {\frac{1}{k!} \frac{1}{l!} \frac{\partial^{k}}{\partial s_{1}^{k}} \frac{\partial^{l}}{\partial s_{2}^{l}} ϕ_{m n} (t, s_{1}, s_{2}) |}_{s_{1} = s_{2} = 0} \\ = {\frac{1}{k!} \frac{1}{l!} \frac{\partial^{k}}{\partial s_{1}^{k}} \frac{\partial^{l}}{\partial s_{2}^{l}} ϕ_{1}^{m} (t, s_{1}, s_{2}) ϕ_{2}^{n} (t, s_{1}, s_{2}) |}_{s_{1} = s_{2} = 0} \\ = {\frac{\partial^{l}}{\partial s_{2}^{l}} \sum_{i = 0}^{k} (\begin{matrix} k \\ i \end{matrix}) \frac{\partial^{k - i}}{\partial s_{1}^{k - i}} ϕ_{1}^{m} (t, s_{1}, s_{2}) \frac{\partial^{i}}{\partial s_{1}^{i}} ϕ_{2}^{n} (t, s_{1}, s_{2}) |}_{s_{1} = s_{2} = 0} . \end{array}

(D.14)

This expression is generally unwieldy, but notice ${\frac{\partial^{i}}{\partial s_{1}^{i}} ϕ_{2}^{n} (t, s_{1}, s_{2}) |}_{s_{1} = 0} = 0$ for all i > 0 in our model. Remarkably, this allows us to further simplify and ultimately arrive at closed-form expressions. Continuing, we see

\begin{array}{l} P_{k l}^{m n} (t) = {\frac{\partial^{l}}{\partial s_{2}^{l}} [(\begin{matrix} k \\ 0 \end{matrix}) ϕ_{2}^{n} (t, s_{1}, s_{2}) \frac{\partial^{k}}{\partial s_{1}^{k}} ϕ_{1}^{m} (t, s_{1}, s_{2})] |}_{s_{1} = s_{2} = 0} \\ = {\frac{\partial^{l}}{\partial s_{2}^{l}} {ϕ_{2}^{n} (t, s_{1}, s_{2}) \cdot \frac{m!}{(m - k)!} e^{- k β I_{0} t} {[1 + \frac{β I_{0} (s_{2} - 1)}{β I_{0} - α} e^{- α t} - e^{- β I_{0} t} (1 + \frac{β I_{0} (s_{2} - 1)}{β I_{0} - α})]}^{m - k}} |}_{s_{1} = s_{2} = 0} \\ : = {\frac{\partial^{l}}{\partial s_{2}^{l}} [ϕ_{2}^{n} (t, s_{1}, s_{2}) \cdot h (t, s_{1}, s_{2})] |}_{s_{1} = s_{2} = 0} \\ = \sum_{i = 0}^{l} (\begin{matrix} l \\ i \end{matrix}) \frac{\partial^{l - i}}{\partial s_{2}^{l - i}} h (t, s_{1}, s_{2}) \frac{\partial^{i}}{\partial s_{2}^{i}} ϕ_{2}^{n} (t, s_{1}, s_{2}) \\ : = \sum_{i = 0}^{l} (\begin{matrix} l \\ i \end{matrix}) A (l - i) B (i) . \end{array}

(D.15)

From here, it is straightforward to take partial derivatives of h(t, s₁, s₂) and our closed-form expression of $ϕ_{2}^{n} (t, s_{1}, s_{2})$ to arrive at Conditions (41) and (42). A heatmap visualization of the difference between transition probabilities under the branching approximation and those computed using the continued fraction method for the SIR model is included below.

Contributor Information

Lam Si Tung Ho, Department of Biostatistics, University of California, Los Angeles.

Jason Xu, Department of Biomathematics, University of California, Los Angeles.

Forrest W. Crawford, Department of Biostatistics, Yale University

Vladimir N. Minin, Departments of Statistics and Biology, University of Washington

Marc A. Suchard, Departments of Biomathematics, Biostatistics and Human Genetics, University of California, Los Angeles

References

Abate J, Whitt W. The Fourier-series method for inverting transforms of probability distributions. Queueing Systems. 1992;10(1–2):5–87. [Google Scholar]
Andersson H, Britton T. Stochastic epidemic models and their statistical analysis. Vol. 4. Springer; New York: 2000. [Google Scholar]
Andrieu C, Doucet A, Holenstein R. Particle Markov chain Monte Carlo methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2010;72(3):269–342. [Google Scholar]
Blum MG, Tran VC. HIV with contact tracing: a case study in approximate Bayesian computation. Biostatistics. 2010;11(4):644–660. doi: 10.1093/biostatistics/kxq022. [DOI] [PubMed] [Google Scholar]
Brauer F. Mathematical epidemiology. Springer; 2008. Compartmental models in epidemiology; pp. 19–79. [Google Scholar]
Britton T. Stochastic epidemic models: a survey. Mathematical Biosciences. 2010;225(1):24–35. doi: 10.1016/j.mbs.2010.01.006. [DOI] [PubMed] [Google Scholar]
Cauchemez S, Ferguson N. Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London. Journal of the Royal Society Interface. 2008;5(25):885–897. doi: 10.1098/rsif.2007.1292. [DOI] [PMC free article] [PubMed] [Google Scholar]
Correia-Gomes C, Economou T, Bailey T, Brazdil P, Alban L, Niza-Ribeiro J. Transmission parameters estimated for salmonella typhimurium in swine using susceptible-infectious-resistant models and a bayesian approach. BMC veterinary research. 2014;10(1):101. doi: 10.1186/1746-6148-10-101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Craviotto C, Jones WB, Thron W. A survey of truncation error analysis for Padé and continued fraction approximants. Acta Applicandae Mathematica. 1993;33(2–3):211–272. [Google Scholar]
Crawford FW, Suchard MA. Transition probabilities for general birth–death processes with applications in ecology, genetics, and evolution. Journal of Mathematical Biology. 2012;65(3):553–580. doi: 10.1007/s00285-011-0471-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Crawford FW, Minin VN, Suchard MA. Estimation for general birth-death processes. Journal of the American Statistical Association. 2014;109(506):730–747. doi: 10.1080/01621459.2013.866565. [DOI] [PMC free article] [PubMed] [Google Scholar]
Crawford FW, Weiss RE, Suchard MA. Sex, lies, and self-reported counts: Bayesian mixture models for longitudinal heaped count data via birth-death processes. Annals of Applied Statistics. 2015;9:572–596. doi: 10.1214/15-AOAS809. [DOI] [PMC free article] [PubMed] [Google Scholar]
Crawford FW, Stutz TC, Lange K. Coupling bounds for approximating birth-death processes by truncation. Statistics & probability letters. 2016;109:30–38. doi: 10.1016/j.spl.2015.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
Csilléry K, Blum MG, Gaggiotti OE, François O. Approximate Bayesian computation (ABC) in practice. Trends in Ecology & Evolution. 2010;25(7):410–418. doi: 10.1016/j.tree.2010.04.001. [DOI] [PubMed] [Google Scholar]
Doss CR, Suchard MA, Holmes I, Kato-Maeda M, Minin VN. Fitting birth–death processes to panel data with applications to bacterial DNA fingerprinting. The Annals of Applied Statistics. 2013;7(4):2315–2335. doi: 10.1214/13-AOAS673. [DOI] [PMC free article] [PubMed] [Google Scholar]
Drovandi CC, Pettitt AN. Estimation of parameters for macroparasite population evolution using approximate Bayesian computation. Biometrics. 2011;67(1):225–233. doi: 10.1111/j.1541-0420.2010.01410.x. [DOI] [PubMed] [Google Scholar]
Dukic V, Lopes HF, Polson NG. Tracking epidemics with Google flu trends data and a state-space SEIR model. Journal of the American Statistical Association. 2012;107(500):1410–1426. doi: 10.1080/01621459.2012.713876. [DOI] [PMC free article] [PubMed] [Google Scholar]
Earn DJ. Mathematical epidemiology. Springer; 2008. A light introduction to modelling recurrent epidemics; pp. 3–17. [Google Scholar]
Ephraim Y, Mark BL. Bivariate Markov processes and their estimation. Foundations and Trends in Signal Processing. 2012;6(1):1–95. [Google Scholar]
van den Eshof J, Hochbruck M. Preconditioning lanczos approximations to the matrix exponential. SIAM Journal on Scientific Computing. 2006;27(4):1438–1457. [Google Scholar]
Feller W. An Introduction to Probability Theory and its Applications. Vol. 1. John Wiley & Sons; 1968. [Google Scholar]
Finkenstädt B, Grenfell B. Time series modelling of childhood diseases: a dynamical systems approach. Journal of the Royal Statistical Society: Series C (Applied Statistics) 2000;49(2):187–205. [Google Scholar]
Golightly A, Wilkinson DJ. Bayesian inference for stochastic kinetic models using a diffusion approximation. Biometrics. 2005;61(3):781–788. doi: 10.1111/j.1541-0420.2005.00345.x. [DOI] [PubMed] [Google Scholar]
Griffiths D. A bivariate birth-death process which approximates to the spread of a disease involving a vector. Journal of Applied Probability. 1972;9(1):65–75. [Google Scholar]
Hitchcock S. Extinction probabilities in predator-prey models. Journal of Applied Probability. 1986;23(1):1–13. [Google Scholar]
Iglehart DL. Multivariate competition processes. The Annals of Mathematical Statistics. 1964;35(1):350–361. [Google Scholar]
Ionides E, Bretó C, King A. Inference for nonlinear dynamical systems. Proceedings of the National Academy of Sciences, USA. 2006;103(49):18,438–18,443. doi: 10.1073/pnas.0603181103. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ionides EL, Nguyen D, Atchadé Y, Stoev S, King AA. Inference for dynamic and latent variable models via iterated, perturbed Bayes maps. Proceedings of the National Academy of Sciences, USA. 2015;112(3):719–724. doi: 10.1073/pnas.1410597112. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jahnke T, Huisinga W. Solving the chemical master equation for monomolecular reaction systems analytically. Journal of Mathematical Biology. 2007;54(1):1–26. doi: 10.1007/s00285-006-0034-x. [DOI] [PubMed] [Google Scholar]
Karev GP, Berezovskaya FS, Koonin EV. Modeling genome evolution with a diffusion approximation of a birth-and-death process. Bioinformatics. 2005;21(Suppl 3):iii12–iii19. doi: 10.1093/bioinformatics/bti1202. [DOI] [PubMed] [Google Scholar]
Keeling M, Ross J. On methods for studying stochastic disease dynamics. Journal of The Royal Society Interface. 2008;5(19):171–181. doi: 10.1098/rsif.2007.1106. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kermack W, McKendrick A. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London Series A. 1927;115(772):700–721. [Google Scholar]
Lentz WJ. Generating Bessel functions in Mie scattering calculations using continued fractions. Applied Optics. 1976;15(3):668–671. doi: 10.1364/AO.15.000668. [DOI] [PubMed] [Google Scholar]
Levin D. Development of non-linear transformations for improving convergence of sequences. International Journal of Computer Mathematics. 1973;3(1–4):371–388. [Google Scholar]
Martin AD, Quinn KM, Park JH. MCMCpack: Markov chain Monte Carlo in R. Journal of Statistical Software. 2011;42(9):22. URL http://www.jstatsoft.org/v42/i09/ [Google Scholar]
McKendrick A. Applications of mathematics to medical problems. Proceedings of the Edinburgh Mathematics Society. 1926;44:98–130. [Google Scholar]
McKinley T, Cook AR, Deardon R. Inference in epidemic models without likelihoods. The International Journal of Biostatistics. 2009;5(1):1557–4679. [Google Scholar]
Moler C, Loan C. Nineteen dubious ways to compute the exponential of a matrix, twenty-five years later. SIAM Review. 2003;45:3–49. [Google Scholar]
Murphy J, O’Donohoe M. Some properties of continued fractions with applications in Markov processes. IMA Journal of Applied Mathematics. 1975;16(1):57–71. [Google Scholar]
Novozhilov AS, Karev GP, Koonin EV. Biological applications of the theory of birth-and-death processes. Briefings in Bioinformatics. 2006;7(1):70–85. doi: 10.1093/bib/bbk006. [DOI] [PubMed] [Google Scholar]
Owen J, Wilkinson DJ, Gillespie CS. Scalable inference for Markov processes with intractable likelihoods. Statistics and Computing. 2015;25(1):145–156. [Google Scholar]
Rabier CE, Ta T, Ané C. Detecting and locating whole genome duplications on a phylogeny: a probabilistic approach. Molecular Biology and Evolution. 2014;31(3):750–762. doi: 10.1093/molbev/mst263. [DOI] [PMC free article] [PubMed] [Google Scholar]
Raggett G. A stochastic model of the Eyam plague. Journal of Applied Statistics. 1982;9(2):212–225. [Google Scholar]
Renshaw E. Stochastic Population Processes: Analysis, Approximations, Simulations. Oxford University Press; Oxford, UK: 2011. [Google Scholar]
Reuter GEH. Denumerable Markov processes and the associated contraction semigroups on l. Acta Mathematica. 1957;97(1):1–46. [Google Scholar]
Reuter GEH. Competition processes. Proc 4th Berkeley Symp Math Statist Prob. 1961;2:421–430. [Google Scholar]
Riley S, Donnelly CA, Ferguson NM. Robust parameter estimation techniques for stochastic within-host macroparasite models. Journal of Theoretical Biology. 2003;225(4):419–430. doi: 10.1016/s0022-5193(03)00266-2. [DOI] [PubMed] [Google Scholar]
Robert CP, Cornuet JM, Marin JM, Pillai NS. Lack of confidence in approximate Bayesian computation model choice. Proceedings of the National Academy of Sciences. 2011;108(37):15,112–15,117. doi: 10.1073/pnas.1102900108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rosenberg NA, Tsolaki AG, Tanaka MM. Estimating change rates of genetic markers using serial samples: applications to the transposon IS6110 in Mycobacterium tuberculosis. Theoretical Population Biology. 2003;63(4):347–363. doi: 10.1016/s0040-5809(03)00010-8. [DOI] [PubMed] [Google Scholar]
Schranz HW, Yap VB, Easteal S, Knight R, Huttley GA. Pathological rate matrices: from primates to pathogens. BMC Bioinformatics. 2008;9(1):550. doi: 10.1186/1471-2105-9-550. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sidje RB. Expokit: a software package for computing matrix exponentials. ACM Transactions on Mathematical Software (TOMS) 1998;24(1):130–156. [Google Scholar]
Sunnåker M, Busetto AG, Numminen E, Corander J, Foll M, Dessimoz C. Approximate Bayesian computation. PLoS Computational Biology. 2013;9(1):e1002,803. doi: 10.1371/journal.pcbi.1002803. [DOI] [PMC free article] [PubMed] [Google Scholar]
Thompson I, Barnett A. Coulomb and Bessel functions of complex arguments and order. Journal of Computational Physics. 1986;64(2):490–509. [Google Scholar]
Xu J, Guttorp P, Kato-Maeda M, Minin VN. Likelihood-based inference for discretely observed birth–death-shift processes, with applications to evolution of mobile genetic elements. Biometrics. 2015;71(4):1009–1021. doi: 10.1111/biom.12352. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Abate J, Whitt W. The Fourier-series method for inverting transforms of probability distributions. Queueing Systems. 1992;10(1–2):5–87. [Google Scholar]

[R2] Andersson H, Britton T. Stochastic epidemic models and their statistical analysis. Vol. 4. Springer; New York: 2000. [Google Scholar]

[R3] Andrieu C, Doucet A, Holenstein R. Particle Markov chain Monte Carlo methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2010;72(3):269–342. [Google Scholar]

[R4] Blum MG, Tran VC. HIV with contact tracing: a case study in approximate Bayesian computation. Biostatistics. 2010;11(4):644–660. doi: 10.1093/biostatistics/kxq022. [DOI] [PubMed] [Google Scholar]

[R5] Brauer F. Mathematical epidemiology. Springer; 2008. Compartmental models in epidemiology; pp. 19–79. [Google Scholar]

[R6] Britton T. Stochastic epidemic models: a survey. Mathematical Biosciences. 2010;225(1):24–35. doi: 10.1016/j.mbs.2010.01.006. [DOI] [PubMed] [Google Scholar]

[R7] Cauchemez S, Ferguson N. Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London. Journal of the Royal Society Interface. 2008;5(25):885–897. doi: 10.1098/rsif.2007.1292. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Correia-Gomes C, Economou T, Bailey T, Brazdil P, Alban L, Niza-Ribeiro J. Transmission parameters estimated for salmonella typhimurium in swine using susceptible-infectious-resistant models and a bayesian approach. BMC veterinary research. 2014;10(1):101. doi: 10.1186/1746-6148-10-101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Craviotto C, Jones WB, Thron W. A survey of truncation error analysis for Padé and continued fraction approximants. Acta Applicandae Mathematica. 1993;33(2–3):211–272. [Google Scholar]

[R10] Crawford FW, Suchard MA. Transition probabilities for general birth–death processes with applications in ecology, genetics, and evolution. Journal of Mathematical Biology. 2012;65(3):553–580. doi: 10.1007/s00285-011-0471-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Crawford FW, Minin VN, Suchard MA. Estimation for general birth-death processes. Journal of the American Statistical Association. 2014;109(506):730–747. doi: 10.1080/01621459.2013.866565. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Crawford FW, Weiss RE, Suchard MA. Sex, lies, and self-reported counts: Bayesian mixture models for longitudinal heaped count data via birth-death processes. Annals of Applied Statistics. 2015;9:572–596. doi: 10.1214/15-AOAS809. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Crawford FW, Stutz TC, Lange K. Coupling bounds for approximating birth-death processes by truncation. Statistics & probability letters. 2016;109:30–38. doi: 10.1016/j.spl.2015.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Csilléry K, Blum MG, Gaggiotti OE, François O. Approximate Bayesian computation (ABC) in practice. Trends in Ecology & Evolution. 2010;25(7):410–418. doi: 10.1016/j.tree.2010.04.001. [DOI] [PubMed] [Google Scholar]

[R15] Doss CR, Suchard MA, Holmes I, Kato-Maeda M, Minin VN. Fitting birth–death processes to panel data with applications to bacterial DNA fingerprinting. The Annals of Applied Statistics. 2013;7(4):2315–2335. doi: 10.1214/13-AOAS673. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Drovandi CC, Pettitt AN. Estimation of parameters for macroparasite population evolution using approximate Bayesian computation. Biometrics. 2011;67(1):225–233. doi: 10.1111/j.1541-0420.2010.01410.x. [DOI] [PubMed] [Google Scholar]

[R17] Dukic V, Lopes HF, Polson NG. Tracking epidemics with Google flu trends data and a state-space SEIR model. Journal of the American Statistical Association. 2012;107(500):1410–1426. doi: 10.1080/01621459.2012.713876. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Earn DJ. Mathematical epidemiology. Springer; 2008. A light introduction to modelling recurrent epidemics; pp. 3–17. [Google Scholar]

[R19] Ephraim Y, Mark BL. Bivariate Markov processes and their estimation. Foundations and Trends in Signal Processing. 2012;6(1):1–95. [Google Scholar]

[R20] van den Eshof J, Hochbruck M. Preconditioning lanczos approximations to the matrix exponential. SIAM Journal on Scientific Computing. 2006;27(4):1438–1457. [Google Scholar]

[R21] Feller W. An Introduction to Probability Theory and its Applications. Vol. 1. John Wiley & Sons; 1968. [Google Scholar]

[R22] Finkenstädt B, Grenfell B. Time series modelling of childhood diseases: a dynamical systems approach. Journal of the Royal Statistical Society: Series C (Applied Statistics) 2000;49(2):187–205. [Google Scholar]

[R23] Golightly A, Wilkinson DJ. Bayesian inference for stochastic kinetic models using a diffusion approximation. Biometrics. 2005;61(3):781–788. doi: 10.1111/j.1541-0420.2005.00345.x. [DOI] [PubMed] [Google Scholar]

[R24] Griffiths D. A bivariate birth-death process which approximates to the spread of a disease involving a vector. Journal of Applied Probability. 1972;9(1):65–75. [Google Scholar]

[R25] Hitchcock S. Extinction probabilities in predator-prey models. Journal of Applied Probability. 1986;23(1):1–13. [Google Scholar]

[R26] Iglehart DL. Multivariate competition processes. The Annals of Mathematical Statistics. 1964;35(1):350–361. [Google Scholar]

[R27] Ionides E, Bretó C, King A. Inference for nonlinear dynamical systems. Proceedings of the National Academy of Sciences, USA. 2006;103(49):18,438–18,443. doi: 10.1073/pnas.0603181103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] Ionides EL, Nguyen D, Atchadé Y, Stoev S, King AA. Inference for dynamic and latent variable models via iterated, perturbed Bayes maps. Proceedings of the National Academy of Sciences, USA. 2015;112(3):719–724. doi: 10.1073/pnas.1410597112. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] Jahnke T, Huisinga W. Solving the chemical master equation for monomolecular reaction systems analytically. Journal of Mathematical Biology. 2007;54(1):1–26. doi: 10.1007/s00285-006-0034-x. [DOI] [PubMed] [Google Scholar]

[R30] Karev GP, Berezovskaya FS, Koonin EV. Modeling genome evolution with a diffusion approximation of a birth-and-death process. Bioinformatics. 2005;21(Suppl 3):iii12–iii19. doi: 10.1093/bioinformatics/bti1202. [DOI] [PubMed] [Google Scholar]

[R31] Keeling M, Ross J. On methods for studying stochastic disease dynamics. Journal of The Royal Society Interface. 2008;5(19):171–181. doi: 10.1098/rsif.2007.1106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] Kermack W, McKendrick A. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London Series A. 1927;115(772):700–721. [Google Scholar]

[R33] Lentz WJ. Generating Bessel functions in Mie scattering calculations using continued fractions. Applied Optics. 1976;15(3):668–671. doi: 10.1364/AO.15.000668. [DOI] [PubMed] [Google Scholar]

[R34] Levin D. Development of non-linear transformations for improving convergence of sequences. International Journal of Computer Mathematics. 1973;3(1–4):371–388. [Google Scholar]

[R35] Martin AD, Quinn KM, Park JH. MCMCpack: Markov chain Monte Carlo in R. Journal of Statistical Software. 2011;42(9):22. URL http://www.jstatsoft.org/v42/i09/ [Google Scholar]

[R36] McKendrick A. Applications of mathematics to medical problems. Proceedings of the Edinburgh Mathematics Society. 1926;44:98–130. [Google Scholar]

[R37] McKinley T, Cook AR, Deardon R. Inference in epidemic models without likelihoods. The International Journal of Biostatistics. 2009;5(1):1557–4679. [Google Scholar]

[R38] Moler C, Loan C. Nineteen dubious ways to compute the exponential of a matrix, twenty-five years later. SIAM Review. 2003;45:3–49. [Google Scholar]

[R39] Murphy J, O’Donohoe M. Some properties of continued fractions with applications in Markov processes. IMA Journal of Applied Mathematics. 1975;16(1):57–71. [Google Scholar]

[R40] Novozhilov AS, Karev GP, Koonin EV. Biological applications of the theory of birth-and-death processes. Briefings in Bioinformatics. 2006;7(1):70–85. doi: 10.1093/bib/bbk006. [DOI] [PubMed] [Google Scholar]

[R41] Owen J, Wilkinson DJ, Gillespie CS. Scalable inference for Markov processes with intractable likelihoods. Statistics and Computing. 2015;25(1):145–156. [Google Scholar]

[R42] Rabier CE, Ta T, Ané C. Detecting and locating whole genome duplications on a phylogeny: a probabilistic approach. Molecular Biology and Evolution. 2014;31(3):750–762. doi: 10.1093/molbev/mst263. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] Raggett G. A stochastic model of the Eyam plague. Journal of Applied Statistics. 1982;9(2):212–225. [Google Scholar]

[R44] Renshaw E. Stochastic Population Processes: Analysis, Approximations, Simulations. Oxford University Press; Oxford, UK: 2011. [Google Scholar]

[R45] Reuter GEH. Denumerable Markov processes and the associated contraction semigroups on l. Acta Mathematica. 1957;97(1):1–46. [Google Scholar]

[R46] Reuter GEH. Competition processes. Proc 4th Berkeley Symp Math Statist Prob. 1961;2:421–430. [Google Scholar]

[R47] Riley S, Donnelly CA, Ferguson NM. Robust parameter estimation techniques for stochastic within-host macroparasite models. Journal of Theoretical Biology. 2003;225(4):419–430. doi: 10.1016/s0022-5193(03)00266-2. [DOI] [PubMed] [Google Scholar]

[R48] Robert CP, Cornuet JM, Marin JM, Pillai NS. Lack of confidence in approximate Bayesian computation model choice. Proceedings of the National Academy of Sciences. 2011;108(37):15,112–15,117. doi: 10.1073/pnas.1102900108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R49] Rosenberg NA, Tsolaki AG, Tanaka MM. Estimating change rates of genetic markers using serial samples: applications to the transposon IS6110 in Mycobacterium tuberculosis. Theoretical Population Biology. 2003;63(4):347–363. doi: 10.1016/s0040-5809(03)00010-8. [DOI] [PubMed] [Google Scholar]

[R50] Schranz HW, Yap VB, Easteal S, Knight R, Huttley GA. Pathological rate matrices: from primates to pathogens. BMC Bioinformatics. 2008;9(1):550. doi: 10.1186/1471-2105-9-550. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] Sidje RB. Expokit: a software package for computing matrix exponentials. ACM Transactions on Mathematical Software (TOMS) 1998;24(1):130–156. [Google Scholar]

[R52] Sunnåker M, Busetto AG, Numminen E, Corander J, Foll M, Dessimoz C. Approximate Bayesian computation. PLoS Computational Biology. 2013;9(1):e1002,803. doi: 10.1371/journal.pcbi.1002803. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R53] Thompson I, Barnett A. Coulomb and Bessel functions of complex arguments and order. Journal of Computational Physics. 1986;64(2):490–509. [Google Scholar]

[R54] Xu J, Guttorp P, Kato-Maeda M, Minin VN. Likelihood-based inference for discretely observed birth–death-shift processes, with applications to evolution of mobile genetic elements. Biometrics. 2015;71(4):1009–1021. doi: 10.1111/biom.12352. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Birth/birth-death processes and their computable transition probabilities with biological applications

Lam Si Tung Ho

Jason Xu

Forrest W Crawford

Vladimir N Minin

Marc A Suchard

Abstract

1 Introduction

Previous work on computing the transition probabilities

2 Birth(death)/birth-death processes

2.1 Birth/birth-death processes

2.1.1 Sufficient condition for regularity

Definition 1

Theorem 1

Proof

Lemma 1

2.1.2 Recursive formula for transition probabilities

Theorem 2

2.1.3 Numerical approximation of the transitions probabilities

2.2 Death/birth-death processes

Theorem 3

3 Applications

3.1 Monomolecular reaction systems

3.2 Birth-death-shift model for transposable elements

Fig. 1.

3.3 Within-host macro-parasite model

Fig. 2.

Fig. 3.

Fig. 4.

3.4 Stochastic SIR model in epidemiology

3.5 Transition probabilities of the SIR model

Fig. 5.

4 The Plague in Eyam revisited

Table 1.

Fig. 6.

Fig. 7.

5 Discussion

Fig. 8.

Acknowledgments

A Continued fractions

Definition A1

Definition A2

Definition A3

Lemma A1

Lemma A2

B Modified Lentz method

C Convergence results of increasing the truncation level

D Branching SIR approximation

D.1 Deriving the PGF

D.2 Transition probability expressions

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases