Transition probabilities for general birth-death processes with applications in ecology, genetics, and evolution

Forrest W Crawford; Marc A Suchard

doi:10.1007/s00285-011-0471-z

. Author manuscript; available in PMC: 2012 Sep 1.

Published in final edited form as: J Math Biol. 2011 Oct 9;65(3):553–580. doi: 10.1007/s00285-011-0471-z

Transition probabilities for general birth-death processes with applications in ecology, genetics, and evolution

Forrest W Crawford ¹, Marc A Suchard ²

PMCID: PMC3310285 NIHMSID: NIHMS341412 PMID: 21984359

Abstract

A birth-death process is a continuous-time Markov chain that counts the number of particles in a system over time. In the general process with n current particles, a new particle is born with instantaneous rate λ_n and a particle dies with instantaneous rate μ_n. Currently no robust and efficient method exists to evaluate the finite-time transition probabilities in a general birth-death process with arbitrary birth and death rates. In this paper, we first revisit the theory of continued fractions to obtain expressions for the Laplace transforms of these transition probabilities and make explicit an important derivation connecting transition probabilities and continued fractions. We then develop an efficient algorithm for computing these probabilities that analyzes the error associated with approximations in the method. We demonstrate that this error-controlled method agrees with known solutions and outperforms previous approaches to computing these probabilities. Finally, we apply our novel method to several important problems in ecology, evolution, and genetics.

Keywords: General birth-death process, Continuous-time Markov chain, Transition probabilities, Population genetics, Ecology, Evolution

1 Introduction

Birth-death processes (BDPs) have a rich history in probabilistic modeling, including applications in ecology, genetics, and evolution (Thorne et al 1991; Krone and Neuhauser 1997; Novozhilov et al 2006). Traditionally, BDPs have been used to model the number of organisms or particles in a system, each of which reproduce and die in continuous time. A general BDP is a continuous-time Markov chain on the non-negative integers in which instantaneous transitions from state n ≥ 0 to either n+1 or n−1 are possible. These transitions are called “births” and “deaths”. Starting at state n, jumps to n + 1 occur with instantaneous rate λ_n and jumps to n − 1 with instantaneous rate μ_n. The simplest BDP has linear rates λ_n = nλ and μ_n = nμ with no state-independent terms (Kendall 1948; Feller 1971). This model is the most widely-used BDP since there exist closed-form expressions for its transition probabilities (Bailey 1964; Novozhilov et al 2006). Many applications of BDPs require convenient methods for computing the probability P_m,n(t) that the system moves from state m to state n in finite time t ≥ 0. These probabilities exhibit their usefulness in many modeling applications since the probabilities do not depend on the possibly unobserved path taken by the process from m to n and hence make possible analyses of discretely sampled or partially observed processes. Despite the relative simplicity of specifying the rates of a general BDP, it can be remarkably difficult to find closed-form solutions for the transition probabilities even for simple models (Renshaw 1993; Mederer 2003; Novozhilov et al 2006).

In a pioneering series of papers, Karlin and McGregor develop a formal theory of general BDPs that expresses their transition probabilities in terms of a sequence of orthogonal polynomials and a spectral measure (Karlin and McGregor 1957a,b, 1958b). While the work of Karlin and McGregor yields valuable theoretical insights regarding the existence of unique solutions and properties of recurrence and transience for a given process, there remains no clear recipe for determining the orthogonal polynomials and measure corresponding to an arbitrary set of birth and death rates. Additionally, even when the polynomials and measure are known, the transition probabilities may not have an analytic representation or a convenient computational form.

Possibly due to the difficulty of finding computationally useful formulas for transition probabilities in general BDPs, many applied researchers resort to easier analyses using moments, first passage times, equilibrium probabilities, and other tractable quantities of interest. Referring to the system of Kolmogorov forward differential equations for transition probabilities that we give below, Novozhilov et al (2006, page 73) write,

“The problem with exact solutions of system (1) is that, in many cases, the expressions for the state probabilities, although explicit, are intractable for analysis and include special polynomials. In such cases, it may be sensible to solve more modest problems concerning the birth-and-death process under consideration, without the knowledge of the time-dependent behavior of state probabilities p_n(t).”

Indeed, closed-form analytic expressions for transition probabilities of general BDPs are only known for a few types of processes. Some examples include constant birth and death rates (Bailey 1964), zero birth or death rates (pure-death and pure-birth) (Yule 1925; Taylor and Karlin 1998), and certain linear rates (Karlin and McGregor 1958a). As a seemingly straightforward example, in the BDP with linear birth and death rates λ_n = nλ + ν and μ_n = nμ + γ including state-independent terms, Ismail et al (1988) offer the orthogonal polynomials and associated measure, but still no closed form is available for the transition probabilities.

Despite the difficulty in obtaining analytic expressions, several authors have made progress in approximate numerical methods for solution of transition probabilities in general BDPs. Murphy and O’Donohoe (1975) develop an appealing numerical method for the transition probabilities based on a continued fraction representation of Laplace-transformed transition probabilities. They invert these transformed probabilities by first truncating the continued fraction. Several other authors give similar expressions derived from truncation of the state space (Grassmann 1977b,a; Rosenlund 1978; Sharma and Dass 1988; Mohanty et al 1993). However, Klar et al (2010) find that methods based on continued fraction truncation and then subsequent analytical transformation can suffer from instability. As an alternative, Parthasarathy and Sudhesh (2006a) express the infinite continued fraction representation given by Murphy and O’Donohoe as a power series. Unfortunately, the small radius of convergence of this series makes it less useful for numerical computation.

We also note that for general BDPs that take values on a finite state space (usually n ∈ {0, 1,…, N}), it is possible to write a finite-dimensional stochastic transition rate matrix and solve for the matrix of transition probabilities. If the rate matrix is diagonal-izable, computation of transition probabilities in this manner can be computationally straightforward. To illustrate, let Q be a finite-dimensional stochastic rate matrix with Q = UΛU⁻¹ where U is an orthogonal matrix and Λ is diagonal. The matrix of transition probabilities P satisfies the matrix differential equation P′ = PQ with initial condition P(0) = I. The solution is P(t) = exp[Qt] = U diag(e^z₁t, e^z₂t,…, e^z_Nt) U⁻¹, where z₁,…, z_N are the eigenvalues of Q. However, it is possible to specify reasonable rate parameters in a general BDP that satisfy requirements for the existence of a unique solution, but do not result in a diagonalizable rate matrix. Also, if the state space over which the BDP takes values is large, numerical eigendecomposition of Q may be computationally expensive and could introduce serious roundoff errors.

To our knowledge, no robust computational method currently exists for finding the finite-time transition probabilities of general BDPs with arbitrary rates. Such a technique would allow rapid development of rich and sophisticated ecological, genetic, and evolutionary models. Additionally, in statistical applications, transition probabilities can serve as observed data likelihoods, and are thus often useful in estimating transition rate parameters from partially observed BDPs. We believe more sophisticated BDPs can be very useful for applied researchers. In spite of the numerical difficulties presented by approximant methods, we are surprised that continued fraction methods like that of Murphy and O’Donohoe (1975) are not more widely explored. This may be due to omission of important details in their derivation of continued fraction expressions for the Laplace transform of the transition probabilities.

In this paper, we build on continued fraction expressions for the Laplace transforms of the transition probabilities of a general BDP using techniques similar to those introduced by Murphy and O’Donohoe, and we fill in the missing details in the proof of this representation. We then apply the Laplace inversion formulae of Abate and Whitt (1992a,b) to obtain an efficient and robust method for computation of transition probabilities in general BDPs. Our method relies on three observations: 1) it is possible to find exact expressions for Laplace transforms of the transition probabilities of a general BDP using continued fractions (Murphy and O’Donohoe 1975); 2) evaluation of continued fractions is typically very fast, requires far fewer evaluations than equivalent power series, and there exist robust algorithms for evaluating them efficiently (Bankier and Leighton 1942; Wall 1948; Blanch 1964; Lorentzen and Waadeland 1992; Craviotto et al 1993; Abate and Whitt 1999; Cuyt et al 2008); and 3) recovery of probability distributions by Laplace inversion using a Riemann sum approximation is often more computationally stable than analytical methods of inversion (Abate and Whitt 1992a,b, 1995). Finally, we demonstrate the advantages of our error-controlled method through its application to several birth-death models in ecology, genetics, and evolution whose solution remains unavailable by other means.

2 Transition probabilities

2.1 Background

A general birth-death process is a continuous-time Markov process X = {X(t), t ≥ 0} counting the number of arbitrarily defined “particles” in existence at time t ≥ 0, with X(0) = m ≥ 0. To characterize the process, we define non-negative instantaneous birth rates λ_n and death rates μ_n for n ≥ 0, with μ₀ = 0 and transition probabilities P_m,n(t) = Pr(X(t) = n | X(0) = m). While λ_n and μ_n are time-homogeneous constants, they may depend on n. We refer to the classical linear BDP in which λ_n = nλ and μ_n = nμ as the “simple birth-death process” (Kendall 1948; Feller 1971). The general BDP transition probabilities satisfy the infinite system of ordinary differential equations

\begin{array}{l} \frac{{d P}_{m, 0} (t)}{d t} = μ_{1} P_{m, 1} (t) - λ_{0} P_{m, 0} (t), and \\ \frac{{d P}_{m, n} (t)}{d t} = λ_{n - 1} P_{m, n - 1} (t) + μ_{n + 1} P_{m, n + 1} (t) - (λ_{n} + μ_{n}) P_{m, n} (t) for n \geq 1, \end{array}

(1)

with boundary conditions P_m,m(0) = 1 and P_m,n(0) = 0 for n ≠ m (Feller 1971).

Karlin and McGregor (1957b) show that for arbitrary starting state m, transition probabilities can be represented in the form

P_{m, n} (t) = π_{n} \int_{0}^{\infty} e^{- x t} Q_{m} (x) Q_{n} (x) ψ (d x),

(2)

where π₀ = 1 and π_n = (λ₀ ··· λ_n₋₁)/(μ₁ ··· μ_n) for n ≥ 1. Here, {Q_n(x)} is a sequence of polynomials satisfying the three-term recurrence relation

\begin{array}{l} λ_{0} Q_{1} (x) = λ_{0} + μ_{0} - x, and \\ λ_{n} Q_{n + 1} (x) = (λ_{n} + μ_{n} - x) Q_{n} (x) - μ_{n} Q_{n - 1} (x), \end{array}

(3)

and ψ is the spectral measure of X with respect to which the polynomials {Q_n(x)} are orthogonal. The system (1) has a unique solution if and only if

\sum_{k = 0}^{\infty} (π_{k} + \frac{1}{λ_{k} π_{k}}) = \infty .

(4)

In what follows, we assume that the rate parameters {λ_n} and {μ_n} satisfy (4). Closed-form solutions to (1) are available for a surprisingly small number of choices of {λ_n} and {μ_n}. We therefore need another approach to find useful formulae for computation of the transition probabilities.

2.2 Continued fraction representation of Laplace transform

To find an expression that is useful for computing P_m,n(t) for an arbitrary general BDP, a fruitful approach is often to Laplace transform each equation of the system (1) and form a recurrence relationship relating back to the Laplace transform of P_m,n(t). We base our presentation on that of Murphy and O’Donohoe (1975). Denote the Laplace transform of P_n,m(t) as

f_{m, n} (s) = L [P_{m, n} (t)] (s) = \int_{0}^{\infty} e^{- s t} P_{m, n} (t) d t .

(5)

Applying the Laplace transform to (1), with the starting state m = 0, we arrive at

\begin{array}{l} {s f}_{0, 0} (s) - P_{0, 0} (0) = μ_{1} f_{0, 1} (s) - λ_{0} f_{0, 0} (s), and \\ {s f}_{0, n} (s) - P_{0, n} (0) = λ_{n - 1} f_{0, n - 1} (s) + μ_{n + 1} f_{0, n + 1} (s) - (λ_{n} + μ_{n}) f_{0, n} (s) \end{array}

(6)

for n ≥ 1. Rearranging and recalling that P₀_,₀(0) = 1 and P₀_,n(0) = 0 for n ≥ 1, we simplify (6) to

\begin{array}{l} f_{0, 1} (s) = \frac{1}{μ_{1}} [(s + λ_{0}) f_{0, 0} (s) - 1], and \\ f_{0, n} (s) = \frac{1}{μ_{n}} [(s + λ_{n - 1} + μ_{n - 1}) f_{0, n - 1} (s) - λ_{n - 2} f_{0, n - 2} (s)] for n \geq 2. \end{array}

(7)

Some rearranging of (7) yields the forward system of recurrence relations

\begin{array}{l} f_{0, 0} (s) = \frac{1}{s + λ_{0} - μ_{1} (\frac{f_{0, 1} (s)}{f_{0, 0} (s)})}, and \\ \frac{f_{0, n} (s)}{f_{0, n - 1} (s)} = \frac{λ_{n - 1}}{s + μ_{n} + λ_{n} - μ_{n + 1} (\frac{f_{0, n + 1} (s)}{f_{0, n} (s)})} . \end{array}

(8)

Then combining these expressions, we arrive at the generalized continued fraction

f_{0, 0} (s) = \frac{1}{s + λ_{0} - \frac{λ_{0} μ_{1}}{s + λ_{1} + μ_{1} - \frac{λ_{1} μ_{2}}{s + λ_{2} + μ_{2} - \dots}}} .

(9)

This is an exact expression for the Laplace transform of the transition probability P₀_,₀(t). Let the partial numerators in (9) be a₁ = 1 and a_n = −λ_n−₂μ_n−₁, and the partial denominators b₁ = s + λ₀ and b_n = s + λ_n−₁ + μ_n−₁ for n ≥ 2. Then (9) becomes

f_{0, 0} (s) = \frac{a_{1}}{b_{1} + \frac{a_{2}}{b_{2} + \frac{a_{3}}{b_{3} + \dots}}} .

(10)

To express (10) in more typographically economical notation, we write

f_{0, 0} (s) = \frac{a_{1}}{b_{1} +} \frac{a_{2}}{b_{2} +} \frac{a_{3}}{b_{3} +} \dots .

(11)

We denote the kth convergent (approximant) of f₀_,₀(s) as

f_{0, 0}^{(k)} (s) = \frac{a_{1}}{b_{1} +} \frac{a_{2}}{b_{2} +} \dots \frac{a_{k}}{b_{k}} = \frac{A_{k} (s)}{B_{k} (s)} .

(12)

There are deep connections between the orthogonal polynomial representation (3), Laplace transforms (7), and continued fractions of the form (9) that are beyond the scope of this paper (Karlin and McGregor 1957b; Bordes and Roehner 1983; Guillemin and Pinchon 1999). Interestingly, Flajolet and Guillemin (2000) demonstrate a close relationship between the Laplace transforms of transition probabilities and state paths of the underlying Markov chain.

Before stating a theorem supporting this representation, we give two lemmas that will be useful in what follows.

Lemma 1

Both the numerator A_k and denominator B_k of (12) satisfy the same recurrence, due to Wallis (1695):

\begin{array}{l} A_{k} = b_{k} A_{k - 1} + a_{k} A_{k - 2}, and \\ B_{k} = b_{k} B_{k - 1} + a_{k} B_{k - 2}, \end{array}

(13)

with A₀ = 0, A₁ = a₁, B₀ = 1, and B₁ = b₁.

Lemma 2

By repeated application of Lemma 1, we arrive at the determinant formula

\begin{array}{l} A_{k} B_{k - 1} - A_{k - 1} B_{k} = (b_{k} A_{k - 1} + a_{k} A_{k - 2}) B_{k - 1} - A_{k - 1} (b_{k} B_{k - 1} + a_{k} B_{k - 2}) \\ = - a_{k} (A_{k - 1} B_{k - 2} - A_{k - 2} B_{k - 1}) \\ = {(- 1)}^{k - 1} \prod_{i = 1}^{k} a_{i} . \end{array}

(14)

Now we state and prove a theorem giving expressions for the Laplace transform of P_m,n(t). Although Murphy and O’Donohoe (1975) first report this result, they do not provide a detailed derivation in their paper.

Theorem 1

The Laplace transform of the transition probability P_m,n(t) is given by

f_{m, n} (s) = {\begin{array}{l} (\prod_{j = n + 1}^{m} μ_{j}) \frac{B_{n} (s)}{B_{m + 1} (s) +} \frac{B_{m} (s) a_{m + 2}}{b_{m + 2} +} \frac{a_{m + 3}}{b_{m + 3} +} \dots & for n \leq m, \\ (\prod_{j = m}^{n - 1} λ_{j}) \frac{B_{m} (s)}{B_{n + 1} (s) +} \frac{B_{n} (s) a_{n + 2}}{b_{n + 2} +} \frac{a_{n + 3}}{b_{n + 3} +} \dots & for m \leq n, \end{array}

(15)

where a_n, b_n, and B_n are as defined above.

Proof

Proof To simplify notation, we sometimes omit the dependence of f_k, A_k, and B_k on the Laplace variable s. Suppose the process starts at X(0) = m. We can re-write the Laplace-transformed equations (6) with P_m,m(0) = 1 and P_m,n(0) = 0 for all n ≠ m as

{s f}_{m, 0} (s) - δ_{m 0} = μ_{1} f_{m, 1} (s) - λ_{0} f_{m, 0} (s),

(16a)

{s f}_{m, n} (s) - δ_{m n} = λ_{n - 1} f_{m, n - 1} (s) + μ_{n + 1} f_{m, n + 1} (s) - (λ_{n} + μ_{n}) f_{m, n} (s),

(16b)

where δ_mn = 1 if m = n and zero otherwise. We first derive the expression for n ≤ m. If m = 0, f₀_,₀(s) is given by (11), so we assume in what follows when n ≤ m, that m ≥ 1. Rearranging (16a), we see that since B₀ = 1 and s + λ₀ = b₁ = B₁,

f_{m, 0} = \frac{B_{0}}{B_{1}} μ_{1} f_{m, 1} .

(17)

Now, to show the general case by induction, assume that for n ≤ m,

f_{m, n - 1} = \frac{B_{n - 1}}{B_{n}} μ_{n} f_{m, n} .

(18)

Substituting (18) into (16b) when n < m, we have

b_{n + 1} f_{m, n} = λ_{n - 1} \frac{B_{n - 1}}{B_{n}} μ_{n} f_{m, n} + μ_{n + 1} f_{m, n + 1}

(19)

(b_{n + 1} + a_{n + 1} \frac{B_{n - 1}}{B_{n}}) f_{m, n} = μ_{n + 1} f_{m, n + 1}

(20)

f_{m, n} = \frac{B_{n}}{B_{n + 1}} μ_{n + 1} f_{m, n + 1}

(21)

and so (18) is true for any n < m. Letting n = m, we have by (18) and (16b),

b_{m + 1} f_{m, m} = 1 + λ_{m - 1} (\frac{B_{m - 1}}{B_{m}} μ_{m} f_{m, m}) + μ_{m + 1} f_{m, m + 1} .

(22)

Recalling that s + λ_m + μ_m = b_m₊₁ and using Lemma 1,

μ_{m + 1} f_{m, m + 1} = 1 - \frac{B_{m + 1}}{B_{m}} f_{m, m} .

(23)

Rearranging the previous equation, we find that

f_{m, m} = \frac{1}{\frac{B_{m + 1}}{B_{m}} + μ_{m + 1} \frac{f_{m, m + 1}}{f_{m, m}}} .

(24)

Likewise, we can write (16b) as a continued fraction recurrence:

\frac{f_{m, n}}{f_{m, n - 1}} = \frac{λ_{n - 1}}{s + μ_{n} + λ_{n} + μ_{n + 1} \frac{f_{m, n + 1}}{f_{m, n}}} .

(25)

Then plugging (25) into (24) and iterating, we obtain the continued fraction for f_m,m:

\begin{array}{l} f_{m, m} = \frac{1}{\frac{B_{m + 1}}{B_{m}} +} \frac{a_{m + 2}}{b_{m + 2} +} \frac{a_{m + 3}}{b_{m + 3} +} \dots \\ = \frac{B_{m}}{B_{m + 1} +} \frac{B_{m} a_{m + 2}}{b_{m + 2} +} \frac{a_{m + 3}}{b_{m + 3} +} \dots . \end{array}

(26)

This is an exact formula for the Laplace transform of P_m,m(t), and proves the case m = n. For n ≤ m, we iterate (18) to get

\begin{array}{l} f_{m, n} = \frac{B_{n}}{B_{n + 1}} μ_{n + 1} f_{m, n + 1} \\ = \frac{B_{n}}{B_{n + 1}} \frac{B_{n + 1}}{B_{n + 2}} μ_{n + 1} μ_{n + 2} f_{m, n + 2} \\ = \frac{B_{n}}{B_{n + 1}} \frac{B_{n + 1}}{B_{n + 2}} \dots \frac{B_{m - 1}}{B_{m}} μ_{n + 1} μ_{n + 2} \dots μ_{m} f_{m, m} \\ = (\prod_{j = n + 1}^{m} μ_{j}) \frac{B_{n}}{B_{m}} f_{m, m} . \end{array}

(27)

Substituting (26) for f_m,m completes the proof for n ≤ m.

To find the formula for f_m,n when n > m, we adopt a similar approach. From (24) we arrive at

B_{m + 1} f_{m, m} = B_{m} - B_{m} μ_{m + 1} f_{m, m + 1} .

(28)

We proceed inductively. Assume that for n > m,

B_{n + 1} f_{m, n} = (\prod_{j = m}^{n - 1} λ_{j}) B_{m} + μ_{n + 1} B_{n} f_{m, n + 1} .

(29)

From (16b), we have

b_{n + 2} f_{m, n + 1} = λ_{n} f_{m, n} + μ_{n + 2} f_{m, n + 2} .

(30)

Solving for f_m,n in (29) and plugging this into the above equation, we have

b_{n + 2} f_{m, n + 1} = λ_{n} (\prod_{j = m}^{n - 1} λ_{j}) \frac{B_{m}}{B_{n + 1}} + λ_{n} μ_{n + 1} \frac{B_{n}}{B_{n + 1}} f_{m, n + 1} + μ_{n + 2} f_{m, n + 2} .

(31)

Recalling that −λ_nμ_n₊₁ = a_n₊₂,

(b_{n + 2} B_{n + 1} + a_{n + 2} B_{m}) f_{m, n + 1} = (\prod_{j = m}^{n} λ_{j}) B_{n} + μ_{n + 2} B_{n + 1} f_{m, n + 2},

(32)

and by Lemma 1,

B_{n + 2} f_{m, n + 1} = (\prod_{j = m}^{n} λ_{j}) B_{m} + μ_{m + 2} B_{n + 1} f_{m, m + 2} .

(33)

This establishes the recurrence (29). Then for any n ≥ m, we can rearrange (29) to obtain

f_{m, n} = (\prod_{j = m}^{n - 1} λ_{j}) \frac{B_{m}}{B_{n + 1} - B_{n} μ_{n + 1} \frac{f_{m, n + 1}}{f_{m, n}}} .

(34)

This completes the proof.

2.3 Obtaining transition probabilities

Murphy and O’Donohoe (1975) find transition probabilities by truncating (15) at a pre-specified depth, forming a partial fractions sum, and inverse transforming. Parthasarathy and Sudhesh (2006a) give a series solution for transition probabilities based on an equivalence between continued fractions like (15) and power series. However, both of these approaches suffer from serious drawbacks, as we explore in detail in the Appendix.

We instead seek an efficient and robust numerical method for evaluating and inverting (15). We first note that continued fractions typically converge rapidly, and in our experience, evaluation of (15) is very fast and stable using the Lentz algorithm and its subsequent improvements (Lentz 1976; Thompson and Barnett 1986; Press 2007). We therefore invert (15) numerically by a summation formula.

To do this, we treat the continued fraction representation (15) of the Laplace transform of P_m,n(t) as an unknown but computable function of the complex Laplace variable s. We base our presentation on that of Abate and Whitt (1992a). If ε is a positive real number such that all singularities of f_m,n(s) lie to the left of ε in the complex plane, the inverse Laplace transform of f_m,n(s) is given by the Bromwich integral

P_{m, n} (t) = L^{- 1} (f_{m, n} (s)) = \frac{1}{2 π i} \int_{ε - i \infty}^{ε + i \infty} e^{s t} f_{m, n} (s) d s .

(35)

Letting s = ε + iu,

\begin{array}{l} P_{m, n} (t) = \frac{1}{2 π} \int_{- \infty}^{\infty} e^{(ε + i u) t} f_{m, n} (ε + i u) d u \\ = \frac{e^{ε t}}{2 π} \int_{- \infty}^{\infty} [cos (u t) + i sin (u t)] f_{m, n} (ε + i u) d u \\ = \frac{e^{ε t}}{2 π} [\int_{- \infty}^{\infty} [Re (f_{m, n} (ε + i u)) cos (u t) - Im (f_{m, n} (ε + i u)) sin (u t)] d u + i \int_{- \infty}^{\infty} [Im (f_{m, n} (ε + i u)) cos (u t) + Re (f_{m, n} (ε + i u)) sin (u t)] d u], \end{array}

(36)

but P_m,n(t) is real-valued, so the imaginary part of the last equality in (36) is zero.

Then

P_{m, n} (t) = \frac{e^{ε t}}{2 π} \int_{- \infty}^{\infty} [Re (f_{m, n} (ε + i u)) cos (u t) - Im (f_{m, n} (ε + i u)) sin (u t)] d u .

(37)

But since P_m,n(t) = 0 for t < 0, we also have that

\int_{- \infty}^{\infty} [Re (f_{m, n} (ε + i u)) cos (u t) + Im (f_{m, n} (ε + i u)) sin (u t)] d u = 0.

(38)

Then applying (38) to (37), we obtain

P_{m, n} (t) = \frac{e^{ε t}}{π} \int_{- \infty}^{\infty} Re (f_{m, n} (ε + i u)) cos (u t) d u .

(39)

Finally, we note that since

Re (f (ε - i u)) = \int_{0}^{\infty} e^{- ε t} cos (u t) P_{m, n} (t) d t = Re (f (ε + i u)),

(40)

it must be the case that Re f_m,n(ε + iu) is even in u for every ε. Therefore,

P_{m, n} (t) = \frac{2 e^{ε t}}{π} \int_{0}^{\infty} Re (f_{m, n} (ε + i u)) cos (u t) d u .

(41)

Following Abate and Whitt (1992a), we approximate the integral above by a discrete Riemann sum via the trapezoidal rule with step size h:

\begin{array}{l} P_{m, n} (t) \approx \frac{{h e}^{ε t}}{π} Re (f_{m, n} (ε)) + \frac{2 {h e}^{ε t}}{π} \sum_{k = 1}^{\infty} Re (f_{m, n} (ε + ikh)) cos (kht) \\ = \frac{e^{A / 2}}{2 t} Re (f_{m, n} (\frac{A}{2 t})) + \frac{e^{A / 2}}{t} \sum_{k = 1}^{\infty} {(- 1)}^{k} Re (f_{m, n} (\frac{A + 2 k π i}{2 t})), \end{array}

(42)

where the second line is obtained by setting h = π/(2t) and ε = A/(2t); this change of variables eliminates the cosine term.

2.4 Numerical considerations

While (42) presents a method for numerical solution of the transition probabilities P_m,n(t) for a BDP with arbitrary birth and death rates, it is not yet an algorithm for reliable evaluation of these probabilities. In order to develop a reliable numerical method, we must: 1) characterize the error introduced by discretization of the integral in (41); 2) determine a suitable method to evaluate this nearly alternating sum while controlling the error; and 3) accurately and rapidly evaluate the infinite continued fraction in (15).

Abate and Whitt show that the discretization error that arises in (42) is

e_{d} = \sum_{k = 1}^{\infty} e^{- k A} P_{m, n} ((2 k + 1) t),

(43)

and when P_m,n(t) ≤ 1,

e_{d} \leq \sum_{k = 1}^{\infty} e^{- k A} = \frac{e^{- A}}{1 - e^{- A}} \approx e^{- A},

(44)

when e^−A is small. Then to obtain e_d ≤ 10^−γ, we set A = γ log(10). As Abate and Whitt point out, the terms of the series (42) alternate in sign when

Re (f_{m, n} (\frac{A + 2 k π i}{2 t}))

(45)

has constant sign. This suggests that a series acceleration method may be helpful in keeping the terms of the sum manageable and avoiding roundoff error due to summands of alternating sign. We opt to use the Levin transform for this purpose (Levin 1973; Press 2007; Numerical Recipes Software 2007).

Evaluation of rational approximations to continued fractions by repeated application of Lemma 1 is appealing, but suffers from roundoff error when denominators are small (Press 2007). To evaluate the infinite continued fraction in the summand of (42), we use the modified Lentz method (Lentz 1976; Thompson and Barnett 1986; Press 2007). To demonstrate, suppose we wish to approximate the value of f₀_,₀(s), given by (9) by truncating at depth k. Then

f_{0, 0}^{(k)} (s) = \frac{A_{k} (s)}{B_{k} (s)}

(46)

is the kth rational approximant to the infinite continued fraction f₀_,₀(s). In the modified Lentz method, we stabilize the computation by finding the ratios

C_{k} = \frac{A_{k}}{A_{k - 1}} and D_{k} = \frac{B_{k - 1}}{B_{k}}

(47)

so that $f_{0, 0}^{(k)}$ can be found iteratively by

f_{0, 0}^{(k)} = f_{0, 0}^{(k - 1)} C_{k} D_{k} .

(48)

Using Lemma 1, we can iteratively compute C_k and D_k via the updates

C_{k} = b_{k} + \frac{a_{k}}{C_{k - 1}} and D_{k} = \frac{1}{b_{k} + a_{k} D_{k - 1}} .

(49)

In practice, we must evaluate the continued fraction to only a finite depth, but we must evaluate to a depth sufficient to control the error. Suppose we wish to evaluate the infinite continued fraction f₀_,₀(s) given by (9) at some complex number s. Intuitively, we wish to terminate the Lentz algorithm when the difference between successive convergents is small. However, it is not immediately clear how the difference between convergents $f_{0, 0}^{(k)} (s) - f_{0, 0}^{(k - 1)} (s)$ is related to the absolute error $f_{0, 0} (s) = f_{0, 0}^{(k)}$ .Craviotto et al (1993) make this relationship clear by furnishing an a posteriori truncation error bound for Jacobi fractions of the same form as (9) in this paper. Assuming that $f_{0, 0}^{(k)} (s) = A_{k} (s) / B_{k} (s)$ converges to f0,0(s) as k → ∞, Craviotto et al (1993) give the bound

| f_{0, 0} (s) - f_{0, 0}^{(k)} (s) | \leq \frac{| \frac{B_{k} (s)}{B_{k - 1} (s)} |}{| Im (\frac{B_{k} (s)}{B_{k - 1} (s)}) |} | f_{0, 0}^{(k)} (s) - f_{0, 0}^{(k - 1)} (s) |,

(50)

that is valid when Im(s) is nonzero. Note that B_k(s)/B_k−₁(s) = 1/D_k(s), so (50) is easy to evaluate during iteration under the Lentz algorithm. Therefore, we stop at depth k in the Lentz algorithm when

\frac{∣ 1 / D_{k} (s) ∣}{∣ Im (1 / D_{k} (s)) ∣} | f_{0, 0}^{(k)} (s) - f_{0, 0}^{(k - 1)} (s) |

(51)

is small.

2.5 Numerical results

Although our error-controlled method is designed to be used when an analytic solution cannot be found, we seek to validate our numerical results by comparison to available analytic and numerical solutions. For the simple BDP with λ_n = nλ and μ_n = nμ, our numerical results agree with the values from the well-known closed-form solution given explicitly in Bailey (1964) as

\begin{array}{l} P_{m, n} (t) = \sum_{j = 0}^{min (m, n)} (\begin{matrix} m \\ j \end{matrix}) (\begin{matrix} m + n - j - 1 \\ m - 1 \end{matrix}) α^{m - j} β^{n - j} {(1 - α - β)}^{j} \\ P_{m, 0} (t) = α^{m} \end{array}

(52)

where

α = \frac{μ (e^{(λ - μ) t} - 1)}{λ e^{(λ - μ) t} - μ} and β = \frac{λ (e^{(λ - μ) t} - 1)}{λ e^{(λ - μ) t} - μ} .

(53)

Murphy and O’Donohoe (1975) give numerical probabilities for four general birth-death models: a) immigration-death with λ_n = 0.2 and μ_n = 0.4n; b) immigration-emigration with μ_n = 0.3, μ₀ = 0, and μ_n = 0.1; c) queue with λ_n = 0.6, μ₀ = 0, μ₁ = μ₂ = 0.2, μ₃ = μ₄ = 0.4, and μ_n = 0.6 for n ≥ 5; and d) λ_n = 0.4, $μ_{n} = 0.1 \sqrt{n}$ Our results agree with those computed by Murphy and O’Donohoe for each of the four models given in Tables 2 through 7 in their paper (Murphy and O’Donohoe 1975). We note that Murphy and O’Donohoe did not report probabilities for m > 2 or n > 5 in any of their four models. In our experience, their method performs poorly when n + m + k is greater than approximately 20.

As a demonstration of the instability of the approximant method, we contrast the numerical results given by our error-controlled method with those obtained using the approximant method, that we implemented as described in Murphy and O’Donohoe (1975), except for some rescaling of intermediate quantities to avoid obvious sources of roundoff error. Figure 1 shows this comparison, using model (a) above, for three values of the truncation index k. Note that increasing the truncation depth k in the approximant method does not improve the error.

Fig. 1 — Comparison of transition probabilities P₁₀*_;n*(t = 1) computed by our error-controlled method and that of Murphy and O’Donohoe (1975) for the immigration-death model with *λ_n* = 0=2 and *μ_n* = 0.4n. The open circles are the values given by our method. The solid line corresponds with the approximant method of Murphy and O’Donohoe with k = 2 (solid line), k = 3 (dashed line), and k = 4 (dotted line). In our experience, the approximant method fails whenever n +m+ k is greater than approximately 20. It is interesting to note that increasing the depth of truncation k in the approximant method actually worsens the approximation.

3 Applications

Drawing on the robustness and generality of our error-controlled method, we conclude with four models in ecology, genetics, and evolution whose analytic solutions remain elusive and where past numerical approaches have fallen short. Using our approach, computation of transition probabilities is straightforward, and the techniques outlined above may be used without modification. Some of the examples are well-known models, and others are novel. In some cases, the orthogonal polynomials satisfying (3) are known, and hence a solution could be numerically computed using (2), provided there are good ways of evaluating the polynomials. Often, a severe drawback of using known orthogonal polynomials to compute a solution based on (2) is that the polynomials are model-specific. This makes experimentation and model selection difficult, since computation of transition probabilities depends on a priori analytic information about the polynomials and measure associated with the BDP. Our method does not rely on a priori information about the process, other than the birth and death rates for each state.

3.1 Immigration and emigration

Consider a population model for the number of organisms in an area, and suppose new immigrants arrive at rate ν, and emigrants leave at rate γ. Organisms living in the area reproduce with per-capita birth rate λ and die with rate μ. Define the linear rates

λ_{n} + n λ + ν and μ_{n} = n μ + γ .

(54)

For the case γ = 0, an analytic expression for the orthogonal polynomials is known (Karlin and McGregor 1958a). For nonzero γ, orthogonal polynomials are available from which a solution of the form (2) may be computed (Karlin and McGregor 1958a; Ismail et al 1988). However, using our error-controlled method, we can easily find the transition probabilities without additional analytic information. Figure 2 shows an example of the time-evolution of P_10,n(t) for various times t and states n, with the parameters λ = 0.5, ν = 0.2, μ = 0.3, and γ= 0.1. The approximant method method of Murphy and O’Donohoe fails to produce useful probabilities for n > 10 (not shown).

Fig. 2 — Transition probabilities for the immigration/emigration model with λ = 0.5, ν = 0.2, μ = 0.3, and γ= 0.1. The top panel shows P_10,_n(t) with t = 1 (solid line), t = 2 (dashed line), t = 3 (dotted line), and t = 4 (dash-dotted line) for n = 0,…, 50. The bottom panel shows P_10,_n(t) with n = 15 (solid line), n = 20 (dashed line), n = 25 (dotted line), and n = 30 (dash-dotted line) for t ∈ (0, 20).

3.2 Logistic growth with Allee effects

Populations of organisms that occupy a finite space may be subject to various constraints on their growth. The per-capita birth rate may decline when there are more organisms than the ecosystem can sustain (Tan and Piantadosi 1991). This can happen when there are too many organisms competing for the same food supply. The decay of population size above some carrying capacity is usually called logistic growth by ecologists. Another density-dependent constraint is known as the Allee effect, in which per-capita birth rate increases superlinearly with n once a small population has been established, due to favorable consequences of density, such as cooperation and mutual protection from predators (Allee et al 1949). As a realistic example of a general BDP that has no obvious solution by orthogonal polynomials, we seek a model that both transiently supports growth above the carrying capacity, and reflects these two density-dependent constraints, similar in spirit to models described by Tan and Piantadosi (1991) and Dennis (2002).

Qualitatively, if the per-capita birth rate with no density effects is λ, then the total birth rate should rise faster than nλ when n is small, slower than nλ for intermediate n near the carrying capacity, and should decay toward zero for n greater than the carrying capacity. Tan and Piantadosi introduce a logistic birth rate $λ_{n} = n λ (1 - \frac{n}{N})$ for a finite state space model that takes values {0; 1; : : : ;N}. However, to allow for temporary growth beyond the carrying capacity, we choose λ_n α λn²e^−&^agr;n for intermediate and large n. To achieve attenuated growth for small n as well, we scale this rate by a logistic function, yielding

λ_{n} = \frac{λ n^{2} e^{- α n}}{1 + e^{β (n - M)}} and μ_{n} = n μ,

(55)

where M is the population size with highest birth rate, and the death rate is assumed to be proportional only to the number of existing individuals. Figure 3 shows the resulting rates for various states n, with the different phases of population change shaded. To illustrate that the model produces the desired behavior, several realizations of the process are given in the lower panel for various starting values. The shaded regions correspond with the three phases of growth. Note that most paths in the lower panel of Figure 3 center near n = 27, where the birth rate and death rate are equal. The lower panel corresponds with Figure 1 in Dennis (2002). Figure 4 demonstrates the success of the error-controlled method in computing time-dependent extinction probabilities P_m,₀ for various starting values with λ = 1, α = 0.2, β = 0.3, M = 20, and μ = 0.1.

Fig. 3 — Behavior of logistic/Allee model. The upper panel shows a plot of birth (solid line) and death (dashed line) rates for states n = 0; : : : ; 60, and parameters λ = 1, μ = 0.1, M = 20, α = 0.2, and β = 0.3. The different phases of growth are labeled in the shaded regions. The lower panel shows stochastic realizations of the logistic/Allee model for various starting values. The shaded regions correspond with the shaded phases of growth in the upper panel.

Fig. 4 — Logistic/Allee model probabilities of extinction *P_m;*₀(t) for initial population sizes m = 1 (solid line), m = 5 (dashed line), m = 10 (dotted line), and m = 15 (dash-dotted line). The full model parametrization is found in the text.

3.3 Moran models with mutation and selection

The probability of fixation or extinction of an allele in finite populations is frequently of interest to researchers in genetics. However, publications often rely on the probability of eventual extinction P_m,₀(t → ∞), or the probability of fixation of a novel mutation in a population of constant size N, P₁_,N (t → ∞). While these asymptotic probabilities do reveal important properties of the underlying models, the information they provide about the distribution of time to fixation/extinction is incomplete. In practice, researchers may observe that m organisms in a sample exhibit a certain trait at a certain time. Then P_m,₀(t), the probability of extinction of that trait at finite times t in the future should presumably be of great interest, since researchers cannot reliably observe the process for infinitely long times. Additionally, the finite-time probability of fixation/extinction may exhibit threshold effects or unexpected dynamics that are not revealed by the asymptotic probability of such an event.

Moran (1958) introduces a model for the time-evolution of a biallelic locus when the population size is constant through time. A biallelic locus is a location in an organism’s genome in which two different genetic variants or alleles exist in a population. We are interested in how the number of individuals carrying each allele changes from generation to generation. Krone and Neuhauser (1997) exploit the Moran model to derive a BDP counting the number of individuals with a certain allele in the context of ancestral genealogy reconstruction in which one allele offers a selective advantage to individuals that carry it. Selection greatly complicates the problem and remains an active area of research. In a limiting case, this process corresponds to Kingman’s coalescent process when there is no mutation or selection (Kingman 1982a,b).

To construct the Moran process with mutation and selection, suppose a finite population of N haploid organisms has 2 alleles at a certain locus: A₁ and A₂. Individuals that carry A₁ reproduce at rate α and A₂ individuals reproduce at rate β. Suppose further that individuals carrying the A₁ allele have a selective advantage over individuals carrying A₂, so α > β. When an individual dies, it is replaced by the offspring of a random parent chosen from all N individuals, including the one that dies. This parent contributes a gamete carrying its allele that is also subject to mutation. Mutation from A₁ to A₂ happens with probability u and in reverse with rate υ. The new offspring receives the possibly mutated haplotype and the process continues.

Let X(t) be a BDP counting the number of A₁ individuals on the state space n ε {0,…,N}. To construct the transition rates of the process, suppose there are currently n individuals of type A₁. We first consider the addition of a new individual of type A₁, so that n → n + 1. For this to happen, the individual that dies must be of type A₂. If the parent of the replacement is one of the n of type A₁, the parent contributes its allele without mutation, and this happens with probability 1 − u. If the parent of the replacement is one of the N − n of type A₂, the parent contributes its allele, which then mutates with probability υ. Therefore, the total rate of addition is

λ_{n} = \frac{N - n}{N} [α \frac{n}{N} (1 - u) + β \frac{N - n}{N} v],

(56)

for n = 0,…,N with λ_n = 0 when n > N. Likewise, the removal of an individual of type A₁ can happen when one of the n individuals of type A₁ is chosen for replacement. If the parent of the replacement is one of the N − n of type A₂, the parent contributes A₂ without mutation, with probability 1 − υ. If the parent is one of the n of type A₁, the allele must mutate to A₂ with probability u. The total rate of removals becomes

μ_{n} = \frac{n}{N} [β \frac{N - n}{N} (1 - v) + α \frac{n}{N} u],

(57)

for n = 1,…,N with μ₀ = λ_N = 0 and μ_n = 0 when n > N. Note that if υ > 0, then λ₀ > 0 so the A₁ allele cannot go extinct. Also, if u > 0, then μ_N > 0, so the A₁ allele cannot be fixed in the population.

Karlin and McGregor (1962) derive the relevant polynomials and measure for the Moran process described above, but without selection, so that α = β. Donnelly (1984) gives expressions for the transition probabilities in the case where α = β = 1, noting that when selection is introduced (via differing α and α), his approach is no longer fruitful. Using our technique, computation of the transition probabilities under selection is straightforward. The upper panel of Figure 5 shows the probability of fixation by time t. The lower panel shows the finite-time fixation probability of A₁, P_m,₁₀₀(t), with u = 0 so the state n = 100 is absorbing.

Fig. 5 — Transition probabilities for the Moran model with selection. The upper panel shows the probability of n individuals having allele A₁ at time t, P₅₀*_;n*(t) for the Moran model with N = 100, starting from m = 50 with u = 0:02, v = 0:01, α = 60, and β = 10. We show the probabilities for t = 1 (solid line), t = 3 (dashed line), t = 5 (dotted line), t = 8 (dash-dotted line). Note that although the states 0 and 100 are not absorbing, the mutation rates u and v are small enough that probability accumulates significantly in these end states. Note also the asymmetry in the distribution at longer times. The lower panel reports the probability of fixation by time t, *P_m;*₁₀₀(t), for the same model, but with u = 0 so the state n = 100 is absorbing. The probabilities shown are for m = 70 (solid line), m = 50 (dashed line), m = 20 (dotted line), and m = 1 (dash-dotted line). Note the starkly different time-dynamics for different starting values.

Since the state space in the Moran model is finite, it is natural to consider the matrix exponentiation method discussed in the Introduction. We write the stochastic transition matrix as

Q = [\begin{matrix} - λ_{0} & λ_{0} \\ μ_{1} & - (λ_{1} + μ_{1}) & λ_{1} \\ μ_{2} & - (λ_{2} + μ_{2}) & λ_{2} \\ ⋱ & ⋱ & ⋱ \\ μ_{N} & - (λ_{N} + μ_{N}) & λ_{N} \end{matrix}]

(58)

where λ_n and μ_n are defined by (56) and (57), respectively. In our experience, the matrix exponentiation method often works well, and its computational cost is similar to that of our error-controlled method. However, it is highly sensitive to rate matrix conditioning. For example, Figure 6 shows a comparison of transition probabilities from the error-controlled method and the matrix exponentiation method for the Moran model with N = 100, α = 210, β = 20, u = 0:002, and υ = 0. In evolutionary terms, this means that mutation from A₁ to A₂ is impossible, and the A₂ haplotype suffers from low fitness. Computationally, this has the effect of making μ_n small for most n, and hence the rate matrix grows ill-conditioned.

Fig. 6 — Comparison of Moran model transition probabilities P₅₀*_;n*(t = 0.2) computed by two methods with N = 100, α = 210, β = 20, u = 0:002, and v = 0. The open circles correspond with our error-controlled method, and the solid line corresponds with the matrix exponentiation method. This choice of parameters causes wild fluctuations in probabilities reported by the matrix exponentiation method since the stochastic rate matrix becomes nearly singular.

Although the rate matrix in this example is nearly defective, this choice of parameter values is not unreasonably extreme. For example, researchers in population genetics often wish to test the hypothesis that selection occurs in a dataset. They fit parameters for models with selection (full model) and without selection (restricted model) and perform a likelihood ratio test of this hypothesis. If the estimates of β and u in the full model are small, they may be unable to reliably compute the probability (likelihood) of the data, given the estimated parameter values under the full model.

3.4 A frameshift-aware indel model

Thorne et al (1991) introduce a BDP modeling insertion and deletion of nucleotides in DNA for applications in molecular evolution. The authors model the process of sequence length evolution by assuming that a new nucleotide can be inserted adjacent to every existing nucleotide, and every existing nucleotide is subject to deletion, at a constant per-nucleotide rate. This corresponds to the simple BDP with λ_n = nλ and μ_n = nμ. If a sequence has m nucleotides at time 0 and there are n nucleotides at time t later, the probability of this event is P_m,n(t).

However, an important aspect of biological sequence evolution is conservation of the structure and biophysical properties of proteins that result from transcription and translation of DNA sequences. After coding DNA is transcribed into RNA, ribosomes translate 3-nucleotide chunks (codons) of the RNA into a single amino acid residue, that is then joined to the end of a growing protein polymer. Insertions or deletions (indels) in a DNA sequence that result in a shift in this triplet code are called “frame-shift” mutations. It is likely that a frame-shift indel occurring in a protein-coding DNA sequence results in a protein that is prematurely terminated or possesses structural and chemical characteristics unlike the ancestral protein. Insertions or deletions whose length is a multiple of three should be more common. We seek to model this behavior in a novel way: suppose the indel process is a BDP similar in spirit to the one presented by Thorne et al (1991), and the rate of insertion and deletion of nucleotides depends on the number of nucleotides already inserted, modulo (mod) 3:

λ_{n} = {\begin{array}{l} n β_{0} & if n - 1 = 0 & mod 3 \\ n β_{1} & if n - 1 = 1 & mod 3 \\ n β_{2} & if n - 1 = 2 & mod 3 \end{array} and μ_{n} = {\begin{array}{l} n γ_{0} & if n - 1 = 0 & mod 3 \\ n γ_{1} & if n - 1 = 1 & mod 3 \\ n γ_{2} & if n - 1 = 2 & mod 3 \end{array} .

(59)

Here we assume that β₂ > β_0, β₁, and γ₁ > γ_0, γ₂ so that transitions to state n such that n − 1 = 0 mod 3 occur at a faster rate per nucleotide. The linear-periodic nature of these birth and death rates make solution of the orthogonal polynomials and measure corresponding with this BDP difficult. The approximant method of Murphy and O’Donohoe also fails here for large n. However, using our error-controlled method, numerical results are readily available. Figure 7 shows P₁_,n(t) for n = 0,…,50 at various times t. Note that the distribution of the number of inserted bases has peaks at the integers mod three. Finally, it is worth noting that the dearth of tractable BDPs for indel events has been a major deterrent in statistical sequence alignment and we are actively exploring solutions to this problem using our error-controlled method.

Fig. 7 — Frameshift-aware indel model probability of observing n inserted DNA bases, given starting at m = 1. The transition probability P₁*_;n*(t) is shown for t = 5 (solid line), t = 7 (dashed line), t = 9 (dotted line), and t = 11 (dash-dotted line), with parameters β₀ = 0.3, β₁ = 1, β₂ = 4, γ₀ = 2, γ₁ = 0.2, and γ₂ = 0.2.

4 Conclusion

Traditionally the simple BDP with linear rates has dominated modeling applications, since its transition probabilities and other quantities of interest find analytic expressions. However, increasingly sophisticated models in ecology, genetics, and evolution, among other fields, may necessitate more advanced computational methods to handle processes whose birth and death rates do not easily yield analytic solutions. We have demonstrated a flexible method for finding transition probabilities of general BDPs that works for arbitrary sets of birth and death rates {λ_n} and {μ_n}, and does not require additional analytic information. This should prove useful for rapid development and testing of new models in applications. For simple models whose solution is available, we find that our method agrees with known solutions and remains robust for large starting and ending states and long times t. It is our hope that the method presented here will assist researchers in understanding the properties of increasingly rich and realistic models.

Acknowledgments

We are grateful to Ken Lange for helpful comments. This work was supported by National Institutes of Health grants GM086887 and T32GM008185, and National Science Foundation grant DMS0856099. A software implementation of all methods in this paper is available from FWC.

5 Appendix

5.1 Approximant method

Murphy and O’Donohoe (1975) approximate the inverse Laplace transform of (15) by first truncating the continued fraction as a rational approximant through a partial fractions sum. To illustrate the pitfalls of this approach, we derive the inversion expressions presented by Murphy and O’Donohoe and analyze their properties. We provide an example to show that this technique can become numerically unstable. We first seek to uncover the truncation error in the time domain of the transition probabilities. If we truncate the continued fractions (15) at depth k, we have

\begin{array}{l} f_{m, n}^{(k)} (s) = (\prod_{j = n + 1}^{m} μ_{j}) \frac{B_{n}}{B_{m + 1} +} \frac{B_{m} a_{m + 2}}{b_{m + 2} +} \frac{a_{m + 3}}{b_{m + 3} +} \dots \frac{a_{m + k}}{b_{m + k}} for n \leq m, and \\ f_{m, n}^{(k)} (s) = (\prod_{j = m}^{n - 1} λ_{j}) \frac{B_{m}}{B_{n + 1} +} \frac{B_{n} a_{n + 2}}{b_{n + 2} +} \frac{a_{n + 3}}{b_{n + 3} +} \dots \frac{a_{n + k}}{b_{n + k}} for n \geq m . \end{array}

(60)

For concreteness, suppose in what follows that n ≥ m. Note that the denominator of the second equation is simply B_n₊_k. Let $A_{k}^{(n)}$ be the numerator of the continued fraction in the second equation in (60), so

f_{m, n}^{(k)} = (\prod_{j = m}^{n - 1} λ_{j}) \frac{A_{k}^{(n)}}{B_{n + k}},

(61)

where $A_{k}^{(n)}$ satisfies $A_{k}^{(0)} = A_{k}, A_{1}^{(n)} = \prod_{j = 1}^{n + 1} a_{j}$ , and

A_{k}^{(n)} = a_{n + k} A_{k - 2}^{(n)} + b_{n + k} A_{k - 1}^{(n)} .

(62)

Note also that the difference between truncated estimates in the Laplace domain (s) is

\begin{array}{l} \frac{A_{n + k}}{B_{n + k}} - \frac{A_{n}}{B_{n}} = \frac{A_{n + k} B_{n} - A_{n} B_{n + k}}{B_{n + k} B_{n}} \\ = \frac{{(- 1)}^{n} A_{k}^{(n)}}{B_{n + k} B_{n}} . \end{array}

(63)

This yields the generalized determinant formula

A_{n + k} B_{n} - A_{n} B_{n + k} = {(- 1)}^{n} A_{k}^{(n)},

(64)

and at a root s_i of B_n₊_k(s), we have

A_{k}^{(n)} (s_{i}) = {(- 1)}^{n} A_{n + k} (s_{i}) B_{n} (s_{i}) .

(65)

Now if s₁, s_2,…,s_n are the roots of B_n(s), we have, using the previous line and a partial fractions decomposition of (60), the formula for the Laplace transform of the transition probability P_m,n(t), truncated at k,

\begin{array}{l} f_{m, n}^{(k)} (s) = (\prod_{j = m}^{n - 1} λ_{j}) \frac{B_{m} (s) A_{k}^{(n)} (s)}{B_{n + k} (s)} \\ = (\prod_{j = m}^{n - 1} λ_{j}) \frac{B_{m} (s) A_{k}^{(n)} (s)}{\prod_{i = 1}^{n + k} (s - s_{i})} \\ = (\prod_{j = m}^{n - 1} λ_{j}) \sum_{i = 1}^{n + k} \frac{B_{m} (s) B_{n} (s_{i}) A_{n + k} (s_{i})}{\prod_{j \neq i} (s_{j} - s_{i})} (\frac{1}{s - s_{i}}), \end{array}

(66)

since we only require the values of A_n₊_k(s) and B_n(s) at the zeros of B_n₊_k(s). Then inverse transforming, an approximate formula for the transition probability P_m,n(t) is

P_{m, n}^{(k)} (t) \approx (\prod_{j = m}^{n - 1} λ_{j}) \sum_{i = 1}^{n + k} \frac{B_{m} (s_{i}) B_{n} (s_{i}) A_{n + k} (s_{i})}{\prod_{j \neq i} (s_{j} - s_{i})} e^{- s_{i} t} .

(67)

The roots of B_n(s), used in (66) and (67), are often found numerically as follows. Consider the characteristic polynomial det (B̃_n + sI) of the matrix

{\tilde{B}}_{n} = (\begin{matrix} λ_{0} & 1 \\ λ_{0} μ_{1} & λ_{1} + μ_{1} & 1 \\ λ_{1} μ_{2} & λ_{2} + μ_{2} & 1 \\ ⋱ & ⋱ & ⋱ \\ λ_{n - 3} μ_{n - 2} & λ_{n - 2} + μ_{n - 2} & 1 \\ λ_{n - 2} μ_{n - 1} & λ_{n - 1} + μ_{n - 1} \end{matrix}) .

(68)

It is clear that the nth partial denominator B_n(s) = det(B̃_n + sI), and this quantity is zero when −s is an eigenvalue of the matrix B̃_n. Therefore, the negatives of the eigenvalues of B̃_n are the roots of B_n(s). Furthermore, B̃_n can be transformed into a real symmetric matrix via a similarity transform and hence B_n(s) has precisely n roots, all of which are simple, real, and negative. One usually finds these eigenvalues via the QR algorithm or similar numerical techniques (Press 2007). However, the iterative eigendecomposition of (68) generates small errors in the eigenvalues for large n + k. These errors are amplified in the product in the denominator of each summand in (67), resulting in a sum with both positive and negative terms that may be very large. Klar et al (2010) encounter similar instability in this algorithm. Their solution is to find the roots of the terms in the numerator and compute each summand as a product of individual numerators and denominators in an attempt to keep roundoff error in the product from accumulating. So, if z_1,…_,z_n₊_k are the roots of A_n₊_k then (67) becomes

P_{m, n}^{(k)} (t) \approx (\prod_{j = m}^{n - 1} λ_{j}) \sum_{i = 1}^{n + k} B_{m} (s_{i}) B_{n} (s_{i}) (z_{i} - s_{i}) \prod_{j \neq i} (\frac{z_{j} - s_{i}}{s_{j} - s_{i}}) e^{- s_{i} t} .

(69)

This procedure does improve the numerical stability of the computation, but requires two eigendecompositions of possibly large matrices for every evaluation of P_m,n(t), increasing the computational cost and, for large m and n, the roundoff error. In our opinion, it is more advantageous to avoid truncation of the continued fraction (9) at a pre-specified index, and instead evaluate the continued fraction until convergence during numerical inversion. Figure 1 shows how approximant methods fail for large n.

5.2 A power series method

Parthasarathy and Sudhesh (2006a) present exact solutions by transforming continued fractions such as (9) into an equivalent power series. Wall (1948) shows that Jacobi fractions of this type can always be represented by an equivalent power series. However, the small radius of convergence of power series expressions for transition probabilities can limit their usefulness for long times or large birth or death rates. Parthasarathy and Sudhesh show that P_0,_n(t) has a power series representation given by

P_{m, n} (t) = (\prod_{k = 0}^{n - 1} a_{2 k}) \sum_{m = 0}^{\infty} {(- 1)}^{m} A (m, 2 n) \frac{t^{m + n}}{(m + n)!},

(70)

where

A (m, n) = \sum_{i_{1} = 0}^{n} a_{i_{1}} \sum_{i_{2} = 0}^{i_{1} + 1} a_{i_{2}} \sum_{i_{3} = 0}^{i_{2} + 1} a_{i_{3}} \dots \sum_{i_{m} = 0}^{i_{m - 1} + 1} a_{i_{m}},

(71)

with A(0, n) = 1 (Parthasarathy and Sudhesh 2006a,b). Here, a₂_n = λ_n and a₂_n₊₁ = μ_n in the notation used in their papers. This approach is unique because it yields an exact analytic expression for the transition probabilities of a general BDP. However, the radius of convergence of the power series depends on the specified rates, and this radius may be quite small. To illustrate the pitfalls of this approach, consider a_n = (n + 1) λ, corresponding to the BDP with λ_n = (2n + 1) λ and μ_n = 2nλ (Parthasarathy and Sudhesh 2006a, Example 4.6). The power series for the transition probability in this process becomes

P_{0, n} (t) = \sum_{m = 0}^{\infty} {(- 1)}^{m} \frac{(2 n + 2 m)!}{m! n!} \frac{{(λ t / 2)}^{n + m}}{(n + m)!} .

(72)

Then the radius of convergence R of the power series is given by

\begin{array}{l} 1 / R = lim_{m \to \infty} | \frac{(2 n + 2 m + 2)! {(\frac{λ}{2})}^{n + m + 1}}{(m + 1)! n! (n + m + 1)!} \times \frac{m! n! (n + m)!}{(2 n + 2 m)! {(\frac{λ}{2})}^{n + m}} | \\ = lim_{m \to \infty} \frac{(2 m + 2 n + 1) (2 n + 2 m + 2)}{(m + 1) (n + m + 1)} (\frac{λ}{2}) \\ = lim_{m \to \infty} \frac{2 m + 2 n + 1}{m + 1} λ \\ = 2 λ . \end{array}

(73)

And so the series diverges when 2λt > 1. To illustrate the limitations of the power series approach, note that in this process, the transition intensity from 0 to 1 is λ, so the expected first-passage time from 0 to 1 is Inline graphic (T_0,1) = 1/λ. Therefore, we cannot evaluate (72) when t is greater than (T_0,1)=2. If n is much greater than 1, we may be unable to reliably evaluate P_0,_n(t) for times near (T_0,_n).

Contributor Information

Forrest W. Crawford, Email: fcrawford@ucla.edu, Department of Biomathematics, University of California Los Angeles Los Angeles, CA 90095-1766 USA

Marc A. Suchard, Email: msuchard@ucla.edu, Departments of Biomathematics, Biostatistics and Human Genetics, University of California Los Angeles Los Angeles, CA 90095-1766 USA

References

Abate J, Whitt W. The Fourier-series method for inverting transforms of probability distributions. Queueing Syst. 1992a;10:5–87. [Google Scholar]
Abate J, Whitt W. Numerical inversion of probability generating functions. Oper Res Lett. 1992b;12:245–251. [Google Scholar]
Abate J, Whitt W. Numerical inversion of Laplace transforms of probability distributions. ORS J Comput. 1995;7(1):36–43. [Google Scholar]
Abate J, Whitt W. Computing Laplace transforms for numerical inversion via continued fractions. INFORMS J Comput. 1999;11(4):394–405. [Google Scholar]
Allee WC, Emerson AE, Park O. Principles of Animal Ecology. Saunders; Philadelphia: 1949. [Google Scholar]
Bailey NTJ. The Elements of Stochastic Processes with Applications to the Natural Sciences. Wiley; New York: 1964. [Google Scholar]
Bankier JD, Leighton W. Numerical continued fractions. Am J Math. 1942;64(1):653–668. [Google Scholar]
Blanch G. Numerical evaluation of continued fractions. SIAM Rev. 1964;6(4):383–421. [Google Scholar]
Bordes G, Roehner B. Application of stieltjes theory for s-fractions to birth and death processes. Adv Appl Probab. 1983;15(3):507–530. [Google Scholar]
Craviotto C, Jones WB, Thron WJ. A survey of truncation error analysis for Padé and continued fraction approximants. Acta Appl Math. 1993;33:211–272. [Google Scholar]
Cuyt A, Petersen V, Verdonk B, Waadeland H, Jones W. Handbook of Continued Fractions for Special Functions. Springer; Berlin/Heidelberg: 2008. [Google Scholar]
Dennis B. Allee effects in stochastic populations. Oikos. 2002;96(3):389–401. [Google Scholar]
Donnelly P. The transient behaviour of the Moran model in population genetics. Math Proc Cambridge. 1984;95(02):349–358. [Google Scholar]
Feller W. Wiley series in probability and mathematical statistics. Wiley; New York: 1971. An Introduction to Probability Theory and its Applications. [Google Scholar]
Flajolet P, Guillemin F. The formal theory of birth-and-death processes, lattice path combinatorics and continued fractions. Adv Appl Probab. 2000;32(3):750–778. [Google Scholar]
Grassmann W. Transient solutions in Markovian queues : An algorithm for finding them and determining their waiting-time distributions. Eur J Oper Res. 1977a;1(6):396–402. [Google Scholar]
Grassmann WK. Transient solutions in Markovian queueing systems. Comput Oper Res. 1977b;4(1):47–53. [Google Scholar]
Guillemin F, Pinchon D. Excursions of birth and death processes, orthogonal polynomials, and continued fractions. J Appl Probab. 1999;36(3):752–770. [Google Scholar]
Ismail MEH, Letessier J, Valent G. Linear birth and death models and associated La-guerre and Meixner polynomials. J Approx Theory. 1988;55(3):337–348. [Google Scholar]
Karlin S, McGregor J. The classification of birth and death processes. Trans Am Math Soc. 1957a;86(2):366–400. [Google Scholar]
Karlin S, McGregor J. The differential equations of birth-and-death processes, and the Stieltjes moment problem. Trans Am Math Soc. 1957b;85(2):589–646. [Google Scholar]
Karlin S, McGregor J. Linear growth, birth and death processes. J Math Mech. 1958a;7(4):643–662. [Google Scholar]
Karlin S, McGregor J. Many server queueing processes with Poisson input and exponential service times. Pacific J Math. 1958b;8(1):87–118. [Google Scholar]
Karlin S, McGregor J. On a genetics model of Moran. Math Proc Cambridge. 1962;58(02):299–311. [Google Scholar]
Kendall DG. On the generalized “birth-and-death” process. Ann Math Stat. 1948;19(1):1–15. [Google Scholar]
Kingman JFC. The coalescent. Stat Proc Appl. 1982a;13(3):235–248. [Google Scholar]
Kingman JFC. On the genealogy of large populations. J Appl Probab. 1982b;19:27–43. [Google Scholar]
Klar B, Parthasarathy PR, Henze N. Zipf and Lerch limit of birth and death processes. Probab Eng Inform Sc. 2010;24(01):129–144. [Google Scholar]
Krone SM, Neuhauser C. Ancestral processes with selection. Theor Popul Biol. 1997;51:210–237. doi: 10.1006/tpbi.1997.1299. [DOI] [PubMed] [Google Scholar]
Lentz WJ. Generating Bessel functions in Mie scattering calculations using continued fractions. Appl Opt. 1976;15(3):668–671. doi: 10.1364/AO.15.000668. [DOI] [PubMed] [Google Scholar]
Levin D. Development of non-linear transformations for improving convergence of sequences. Int J Comput Math. 1973;3(B):371–388. [Google Scholar]
Lorentzen L, Waadeland H. Continued Fractions with Applications. North-Holland, Amsterdam: 1992. [Google Scholar]
Mederer M. Transient solutions of Markov processes and generalized continued fractions. IMA J Appl Math. 2003;68(1):99–118. [Google Scholar]
Mohanty S, Montazer-Haghighi A, Trueblood R. On the transient behavior of a finite birth-death process with an application. Comput Oper Res. 1993;20(3):239–248. [Google Scholar]
Moran PAP. Random processes in genetics. Math Proc Cambridge. 1958;54(01):60–71. [Google Scholar]
Murphy JA, O’Donohoe MR. Some properties of continued fractions with applications in Markov processes. IMA J Appl Math. 1975;16(1):57–71. [Google Scholar]
Novozhilov AS, Karev GP, Koonin EV. Biological applications of the theory of birth-and-death processes. Brief Bioinform. 2006;7(1):70–85. doi: 10.1093/bib/bbk006. [DOI] [PubMed] [Google Scholar]
Numerical Recipes Software. Derivation of the Levin transformation. Numerical recipes webnote No 6 URL. 2007 http://www.nr.com/webnotes?6.
Parthasarathy PR, Sudhesh R. Exact transient solution of a state-dependent birth-death process. J Appl Math Stoch Anal. 2006a;82(6):1–16. [Google Scholar]
Parthasarathy PR, Sudhesh R. A formula for the coefficients of orthogonal polynomials from the three-term recurrence relations. Appl Math Lett. 2006b;19(10):1083–1089. [Google Scholar]
Press WH. Numerical Recipes: the Art of Scientific Computing. Cambridge University Press; New York: 2007. [Google Scholar]
Renshaw E. Cambridge Studies in Mathematical Biology. Cambridge University Press; 1993. Modelling Biological Populations in Space and Time. [Google Scholar]
Rosenlund SI. Transition probabilities for a truncated birth-death process. Scand J Stat. 1978;5(2):119–122. [Google Scholar]
Sharma OP, Dass J. Multi-server Markovian queue with finite waiting space. Sankhya Ser B. 1988;50(3):428–431. [Google Scholar]
Tan WY, Piantadosi S. On stochastic growth processes with application to stochastic logistic growth. Stat Sinica. 1991;1:527–540. [Google Scholar]
Taylor H, Karlin S. An Introduction to Stochastic Modeling. Academic Press; San Diego: 1998. [Google Scholar]
Thompson IJ, Barnett AR. Coulomb and Bessel functions of complex arguments and order. J Comput Phys. 1986;64:490–509. [Google Scholar]
Thorne J, Kishino H, Felsenstein J. An evolutionary model for maximum likelihood alignment of DNA sequences. J Mol Evol. 1991;33(2):114–124. doi: 10.1007/BF02193625. [DOI] [PubMed] [Google Scholar]
Wall HS. University Series in Higher Mathematics, D. Van Nostrand Company, Inc; New York: 1948. Analytic Theory of Continued Fractions. [Google Scholar]
Wallis J. Oxoniae e Theatro Shedoniano. Vol. 1. Georg Olms Verlag; Hildeshein, New York: 1695. Opera Mathematica. 1972. [Google Scholar]
Yule A mathematical theory of evolution, based on the conclusions of Dr. J. C. Willis, F.R.S. Philos T R Soc Lon B. 1925;213:21–87. [Google Scholar]

[R1] Abate J, Whitt W. The Fourier-series method for inverting transforms of probability distributions. Queueing Syst. 1992a;10:5–87. [Google Scholar]

[R2] Abate J, Whitt W. Numerical inversion of probability generating functions. Oper Res Lett. 1992b;12:245–251. [Google Scholar]

[R3] Abate J, Whitt W. Numerical inversion of Laplace transforms of probability distributions. ORS J Comput. 1995;7(1):36–43. [Google Scholar]

[R4] Abate J, Whitt W. Computing Laplace transforms for numerical inversion via continued fractions. INFORMS J Comput. 1999;11(4):394–405. [Google Scholar]

[R5] Allee WC, Emerson AE, Park O. Principles of Animal Ecology. Saunders; Philadelphia: 1949. [Google Scholar]

[R6] Bailey NTJ. The Elements of Stochastic Processes with Applications to the Natural Sciences. Wiley; New York: 1964. [Google Scholar]

[R7] Bankier JD, Leighton W. Numerical continued fractions. Am J Math. 1942;64(1):653–668. [Google Scholar]

[R8] Blanch G. Numerical evaluation of continued fractions. SIAM Rev. 1964;6(4):383–421. [Google Scholar]

[R9] Bordes G, Roehner B. Application of stieltjes theory for s-fractions to birth and death processes. Adv Appl Probab. 1983;15(3):507–530. [Google Scholar]

[R10] Craviotto C, Jones WB, Thron WJ. A survey of truncation error analysis for Padé and continued fraction approximants. Acta Appl Math. 1993;33:211–272. [Google Scholar]

[R11] Cuyt A, Petersen V, Verdonk B, Waadeland H, Jones W. Handbook of Continued Fractions for Special Functions. Springer; Berlin/Heidelberg: 2008. [Google Scholar]

[R12] Dennis B. Allee effects in stochastic populations. Oikos. 2002;96(3):389–401. [Google Scholar]

[R13] Donnelly P. The transient behaviour of the Moran model in population genetics. Math Proc Cambridge. 1984;95(02):349–358. [Google Scholar]

[R14] Feller W. Wiley series in probability and mathematical statistics. Wiley; New York: 1971. An Introduction to Probability Theory and its Applications. [Google Scholar]

[R15] Flajolet P, Guillemin F. The formal theory of birth-and-death processes, lattice path combinatorics and continued fractions. Adv Appl Probab. 2000;32(3):750–778. [Google Scholar]

[R16] Grassmann W. Transient solutions in Markovian queues : An algorithm for finding them and determining their waiting-time distributions. Eur J Oper Res. 1977a;1(6):396–402. [Google Scholar]

[R17] Grassmann WK. Transient solutions in Markovian queueing systems. Comput Oper Res. 1977b;4(1):47–53. [Google Scholar]

[R18] Guillemin F, Pinchon D. Excursions of birth and death processes, orthogonal polynomials, and continued fractions. J Appl Probab. 1999;36(3):752–770. [Google Scholar]

[R19] Ismail MEH, Letessier J, Valent G. Linear birth and death models and associated La-guerre and Meixner polynomials. J Approx Theory. 1988;55(3):337–348. [Google Scholar]

[R20] Karlin S, McGregor J. The classification of birth and death processes. Trans Am Math Soc. 1957a;86(2):366–400. [Google Scholar]

[R21] Karlin S, McGregor J. The differential equations of birth-and-death processes, and the Stieltjes moment problem. Trans Am Math Soc. 1957b;85(2):589–646. [Google Scholar]

[R22] Karlin S, McGregor J. Linear growth, birth and death processes. J Math Mech. 1958a;7(4):643–662. [Google Scholar]

[R23] Karlin S, McGregor J. Many server queueing processes with Poisson input and exponential service times. Pacific J Math. 1958b;8(1):87–118. [Google Scholar]

[R24] Karlin S, McGregor J. On a genetics model of Moran. Math Proc Cambridge. 1962;58(02):299–311. [Google Scholar]

[R25] Kendall DG. On the generalized “birth-and-death” process. Ann Math Stat. 1948;19(1):1–15. [Google Scholar]

[R26] Kingman JFC. The coalescent. Stat Proc Appl. 1982a;13(3):235–248. [Google Scholar]

[R27] Kingman JFC. On the genealogy of large populations. J Appl Probab. 1982b;19:27–43. [Google Scholar]

[R28] Klar B, Parthasarathy PR, Henze N. Zipf and Lerch limit of birth and death processes. Probab Eng Inform Sc. 2010;24(01):129–144. [Google Scholar]

[R29] Krone SM, Neuhauser C. Ancestral processes with selection. Theor Popul Biol. 1997;51:210–237. doi: 10.1006/tpbi.1997.1299. [DOI] [PubMed] [Google Scholar]

[R30] Lentz WJ. Generating Bessel functions in Mie scattering calculations using continued fractions. Appl Opt. 1976;15(3):668–671. doi: 10.1364/AO.15.000668. [DOI] [PubMed] [Google Scholar]

[R31] Levin D. Development of non-linear transformations for improving convergence of sequences. Int J Comput Math. 1973;3(B):371–388. [Google Scholar]

[R32] Lorentzen L, Waadeland H. Continued Fractions with Applications. North-Holland, Amsterdam: 1992. [Google Scholar]

[R33] Mederer M. Transient solutions of Markov processes and generalized continued fractions. IMA J Appl Math. 2003;68(1):99–118. [Google Scholar]

[R34] Mohanty S, Montazer-Haghighi A, Trueblood R. On the transient behavior of a finite birth-death process with an application. Comput Oper Res. 1993;20(3):239–248. [Google Scholar]

[R35] Moran PAP. Random processes in genetics. Math Proc Cambridge. 1958;54(01):60–71. [Google Scholar]

[R36] Murphy JA, O’Donohoe MR. Some properties of continued fractions with applications in Markov processes. IMA J Appl Math. 1975;16(1):57–71. [Google Scholar]

[R37] Novozhilov AS, Karev GP, Koonin EV. Biological applications of the theory of birth-and-death processes. Brief Bioinform. 2006;7(1):70–85. doi: 10.1093/bib/bbk006. [DOI] [PubMed] [Google Scholar]

[R38] Numerical Recipes Software. Derivation of the Levin transformation. Numerical recipes webnote No 6 URL. 2007 http://www.nr.com/webnotes?6.

[R39] Parthasarathy PR, Sudhesh R. Exact transient solution of a state-dependent birth-death process. J Appl Math Stoch Anal. 2006a;82(6):1–16. [Google Scholar]

[R40] Parthasarathy PR, Sudhesh R. A formula for the coefficients of orthogonal polynomials from the three-term recurrence relations. Appl Math Lett. 2006b;19(10):1083–1089. [Google Scholar]

[R41] Press WH. Numerical Recipes: the Art of Scientific Computing. Cambridge University Press; New York: 2007. [Google Scholar]

[R42] Renshaw E. Cambridge Studies in Mathematical Biology. Cambridge University Press; 1993. Modelling Biological Populations in Space and Time. [Google Scholar]

[R43] Rosenlund SI. Transition probabilities for a truncated birth-death process. Scand J Stat. 1978;5(2):119–122. [Google Scholar]

[R44] Sharma OP, Dass J. Multi-server Markovian queue with finite waiting space. Sankhya Ser B. 1988;50(3):428–431. [Google Scholar]

[R45] Tan WY, Piantadosi S. On stochastic growth processes with application to stochastic logistic growth. Stat Sinica. 1991;1:527–540. [Google Scholar]

[R46] Taylor H, Karlin S. An Introduction to Stochastic Modeling. Academic Press; San Diego: 1998. [Google Scholar]

[R47] Thompson IJ, Barnett AR. Coulomb and Bessel functions of complex arguments and order. J Comput Phys. 1986;64:490–509. [Google Scholar]

[R48] Thorne J, Kishino H, Felsenstein J. An evolutionary model for maximum likelihood alignment of DNA sequences. J Mol Evol. 1991;33(2):114–124. doi: 10.1007/BF02193625. [DOI] [PubMed] [Google Scholar]

[R49] Wall HS. University Series in Higher Mathematics, D. Van Nostrand Company, Inc; New York: 1948. Analytic Theory of Continued Fractions. [Google Scholar]

[R50] Wallis J. Oxoniae e Theatro Shedoniano. Vol. 1. Georg Olms Verlag; Hildeshein, New York: 1695. Opera Mathematica. 1972. [Google Scholar]

[R51] Yule A mathematical theory of evolution, based on the conclusions of Dr. J. C. Willis, F.R.S. Philos T R Soc Lon B. 1925;213:21–87. [Google Scholar]

PERMALINK

Transition probabilities for general birth-death processes with applications in ecology, genetics, and evolution

Forrest W Crawford

Marc A Suchard

Abstract

1 Introduction

2 Transition probabilities

2.1 Background

2.2 Continued fraction representation of Laplace transform

Lemma 1

Lemma 2

Theorem 1

Proof

2.3 Obtaining transition probabilities

2.4 Numerical considerations

2.5 Numerical results

Fig. 1.

3 Applications

3.1 Immigration and emigration

Fig. 2.

3.2 Logistic growth with Allee effects

Fig. 3.

Fig. 4.

3.3 Moran models with mutation and selection

Fig. 5.

Fig. 6.

3.4 A frameshift-aware indel model

Fig. 7.

4 Conclusion

Acknowledgments

5 Appendix

5.1 Approximant method

5.2 A power series method

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Transition probabilities for general birth-death processes with applications in ecology, genetics, and evolution

Forrest W Crawford

Marc A Suchard

Abstract

1 Introduction

2 Transition probabilities

2.1 Background

2.2 Continued fraction representation of Laplace transform

Lemma 1

Lemma 2

Theorem 1

Proof

2.3 Obtaining transition probabilities

2.4 Numerical considerations

2.5 Numerical results

Fig. 1.

3 Applications

3.1 Immigration and emigration

Fig. 2.

3.2 Logistic growth with Allee effects

Fig. 3.

Fig. 4.

3.3 Moran models with mutation and selection

Fig. 5.

Fig. 6.

3.4 A frameshift-aware indel model

Fig. 7.

4 Conclusion

Acknowledgments

5 Appendix

5.1 Approximant method

5.2 A power series method

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases