Abstract
Some mathematical methods for formulation and numerical simulation of stochastic epidemic models are presented. Specifically, models are formulated for continuous-time Markov chains and stochastic differential equations. Some well-known examples are used for illustration such as an SIR epidemic model and a host-vector malaria model. Analytical methods for approximating the probability of a disease outbreak are also discussed.
Keywords: Branching process, Continuous-time Markov chain, Minor outbreak, Stochastic differential equation
2000 MSC: 60H10, 60J28, 92D30
1. Introduction
The intent of this primer is to provide a brief introduction to the formulation, numerical simulation, and analysis of stochastic epidemic models for a newcomer to this field. A background in modeling with ordinary differential equations (ODEs) is assumed. The ODE epidemic models serve as a framework for formulating analogous stochastic models and as a source of comparison with the stochastic models. This primer is restricted to two types of stochastic settings, continuous-time Markov chains (CTMCs) and stochastic differential equations (SDEs). Some well-known examples are used for illustration such as an SIR epidemic model and a host-vector malaria model. For additional examples and information on stochastic epidemic models and stochastic modeling in general, consult the textbooks and papers listed in the references, e.g., (Allen, 2007, Allen, 2008, Allen, 2010, Allen, 2015, Allen et al., 2008, Andersson and Britton, 2000, Bailey, 1975, Britton, 2010, Daley and Gani, 1999, Durrett, 1999, Greenwood et al., 2009, Isham et al., 2005, Jagers, 1975, Karlin and Taylor, 1975, Karlin and Taylor, 1981).
Stochastic modeling of epidemics is important when the number of infectious individuals is small or when the variability in transmission, recovery, births, deaths, or the environment impacts the epidemic outcome. The variability associated with individual dynamics such as transmission, recovery, births or deaths is often referred to as demographic variability. The variability associated with the environment such as conditions related to terrestrial or aquatic settings is referred to as environmental variability. Environmental variability is especially important in modeling zoonotic infectious diseases, vector-borne diseases, and waterborne diseases (e.g., Ebola, avian influenza, malaria, and cholera) (Altizer et al., 2013, Jutla et al., 2013, Wu et al., 2016). In this primer, the emphasis is on demographic variability.
In CTMCs and SDEs, the time variable is continuous, , but the state variables are either discrete (CTMC) or continuous (SDEs). In the following sections, these two stochastic processes are formulated for the well-known SIR (Susceptible-Infectious-Recovered) epidemic model and the Ross malaria host-vector model. The Gillespie algorithm and the Euler-Maruyama numerical method are described for the two types of stochastic processes. In addition, some analytical methods from branching processes that are related to the CTMC models are used to approximate the probability of an outbreak. In the last section, some stochastic methods for modeling environmental variability are presented.
2. SIR deterministic epidemic model
In the SIR deterministic model, , , and are the number of susceptible, infectious, and recovered individuals, respectively. In the simplest model, there are no births and deaths, only infection and recovery:
(1) |
where the total population size is constant, . The disease-free equilibrium is and . The basic reproduction number which is equal to the ratio of the transmission rate β and the recovery rate γ, determines the epidemic outcome when . If and , then the number of infectious individuals increases, an outbreak, and if , the number of infectious individuals decrease. As , system (1) can be simplified to two equations for and
The stochastic formulation of the CTMC and SDE models requires defining two random variables for S and I whose dynamics depend on the probabilities of the two events: infection and recovery. For simplicity, the same notation is used in the stochastic and the deterministic formulations.
3. SIR continuous time Markov chain
3.1. Formulation
The discrete random variables for the SIR CTMC model satisfy
where . The lower case s and i denote the values of the discrete random variables from the set . The transition probabilities associated with the stochastic process are defined for a small period of time :
The transition probabilities depend on the time between events but not on the specific time t, a time-homogeneous process. In addition, given the current state of the process at time t, the future state of the process at time , for any , does not depend on times prior to t, known as the Markov property. For comparison purposes, the transition probabilities are defined in terms of the rates in the SIR ODE model:
(2) |
Summarized in Table 1 are the changes, and , associated with the two events, infection and recovery.
Table 1.
Event | Change | Probability |
---|---|---|
Infection | ||
Recovery |
Given and , the epidemic ends at time t, when . The states , where are referred to as absorbing states; the epidemic stops when an absorbing state is reached. The absorbing states are the states with .
3.2. Kolmogorov differential equations
Differential equations for the transition probabilities can be derived from (2). These differential equations are often referred to as the forward or the backward Kolmogorov differential equations. The forward equations, also referred to as the “master equations”, are used to predict the future dynamics, whereas the backward equations are used to study the end of the epidemic, such as estimating the probability of reaching an absorbing state.
Note that there are ordered pairs of states , e.g., , where . The general form of the Kolmogorov differential equations can be expressed in a simple form, if the transition probabilities are denoted as , where a and b are two ordered pairs from the set of ordered pairs. The general form for the forward Kolmogorov differential equations are
(3) |
and the backward Kolmogorov differential equations are
(4) |
where the values of , , and are defined from the transition rates in Equation (2).
In the forward equations, the transition rates depend on the future state . If a is any state, for the process to be in state at time , one of the following events occurs: (1) the process transitions from a to in time t and an infection occurs with transition probability , or (2) the process transitions from a to in time t and a recovery occurs with transition probability , or (3) the process transitions from a to in time t and no change occurs with probability . That is,
where all terms of order are included in the last term . Subtracting from both sides, dividing by , and letting , leads to the forward Kolmogorov differential equations:
A similar derivation applies to the backward equations. However, the transition rates depend on the initial state . A transition occurs in time , either infection, recovery or no change, and in the remaining time t, there is a transition to state b. The backward equations (4) are
For a more thorough derivation of these equations, consult the references, e.g., (Allen, 2010, Karlin and Taylor, 1975, Ross, 2014).
If the ordered pairs are labeled in a specific order from to , then a matrix of transition rates Q can be defined. Matrix Q has dimension and is known as the infinitesimal generator matrix. Matrix Q is straightforward to define for a single random variable whose states are already linearly ordered from 0 to N (e.g., (Allen, 2010, Karlin and Taylor, 1975, Karlin and Taylor, 1981, Ross, 2014)). But for a bivariate process with states the form of matrix Q depends on how the set of ordered pairs are linearly ordered. In general, matrix Q has negative diagonal entries and nonnegative off-diagonal entries. In addition, matrix Q has the property that the row sums are zero. It follows from Equations (3), (B.1), (B.2), (B.3), (B.4), (C.1), (4) that the forward and the backward Kolmogorov differential equations can be written as systems of matrix differential equations, and , respectively, where is the matrix of transition probabilities. Matrix has diagonal entries , where a is one of the linearly ordered states. The formal solution of these equations is , where is the identity matrix. Note that sometimes the transition probabilities are written in the reverse order, . In this case, the transpose of these equations is applied with matrix instead of Q (Allen, 2008, Allen, 2010).
3.3. Branching process approximation
In this brief introduction, we study the stochastic behavior near the disease-free equilibrium to determine whether an epidemic (major outbreak) occurs when a few infectious individuals are introduced into the population. The probability of no major outbreak (a minor outbreak) for the CTMC model near the disease-free equilibrium is approximated by applying branching process theory and techniques from probability generating functions (pgfs). Most important is the fact that when I hits zero, it stays in state zero. The state I has reached an absorbing state and disease transmission stops. In the limit, of course, the infectious individuals in the stochastic and deterministic models always approach zero, as . But finite time extinction of I occurs in the stochastic model. We are interested in the stochastic dynamics at the initiation of an epidemic, when almost everyone in the population is susceptible. (The duration of the epidemic, once initiated, is another interesting stochastic problem, not considered here, see e.g., (Barbour, 1975, Daley and Gani, 1999)).
The branching process is the linear approximation of the SIR stochastic process near the disease-free equilibrium. For a few initial infectious individuals, the branching process either grows exponentially or hits zero. These two phenomena are captured in the branching process approximation of the CTMC model near the disease-free equilibrium. If the number of infectious individuals increases substantially to a large number of cases, then there is a major outbreak. However, if there are only a few additional cases, above the initial number of cases, then there is a minor outbreak. The branching process is a good approximation of the CTMC model, if the susceptible population size is sufficiently large. Then the two outcomes, either a major or minor outbreak, are clearly distinguishable.
The branching process is a birth and death process for I; the variables S and R are not considered in this approximation. The term is the infection rate (birth) and is the recovery rate (death). The process begins with just a few infectious individuals. The branching process approximation is a CTMC, but near the disease-free equilibrium, the rates are linear (Table 2).
Table 2.
Change | Probability |
---|---|
Three important assumptions underlie the branching process approximation:
-
(1)
Each infectious individual behavior is independent from other infectious individuals.
-
(2)
Each infectious individual has the same probability of recovery and the same probability of transmitting an infection.
-
(3)
The susceptible population is sufficiently large.
Assumption (1) is reasonable if a small number of infectious individuals is introduced into a large homogeneously-mixed population (assumption (3)). Assumption (2) is also reasonable in a homogeneously-mixed population with constant transmission and recovery rates, β and γ.
Two probability generating functions (pgfs) are used in the study of the probability of extinction. The first one applies to each infectious individual, known as the offspring pgf, and the second one applies to the entire infectious class at time t. For our purposes, the offspring pgf is the most important one. In general, an offspring pgf has the form:
where is the probability of one individual generating j new individuals of the same type, e.g., one infectious individual generates j infectious individuals. The pgf has some properties that are useful in the analysis. For example, and the “mean” number of offspring generated from one individual is defined as
The offspring pgf for one infectious individual, , is defined from Table 2:
(5) |
The first term in is the probability that an infectious individual recovers and the coefficient of the second term is the probability that an infectious individual infects another individual. The power to which u is raised is the number of infectious individuals generated from one infectious individual. If an individual recovers, then no new infections are generated and if the infection is transmitted to another individual, there are now two individuals infectious . This offspring pgf differs from a discrete-time branching process, where the “parent” dies and is replaced by the offspring in the next generation. The difference is due to the fact that in a small period of time, in a continuous-time process, the infectious individual that infects another person is counted as still being infectious (two infectious individuals). This infectious individual has the same probability of infecting another individual and the same probability of recovery, .
The offspring pgf (5) satisfies and the mean value is
This latter expression is not the same as the basic reproduction number, the average number of infectious individuals generated by one infectious individual during the period of infectivity. However, is a threshold parameter, similar to . In particular, if and only if .
It is well-known from the theory of branching processes that a fixed point of the offspring pgf yields the asymptotic probability of extinction (Athreya and Ney, 1972, Dorman et al., 2004, Harris, 1963, Jagers, 1975). It is shown in Appendix B that the fixed points of f are the stationary solutions (time-independent solutions) of the branching process approximation for the probability of extinction of the infectious class . Solving for the fixed points of f in (5), for , yields two solutions, namely, , and (if ). It is shown in Appendix B that if , then the fixed point is stable and if , the fixed point is stable. When , the assumption of independence of infectious individuals, implies the probability of no outbreak is either 1 or .
The preceding results were first applied to the stochastic SIR epidemic model by Whittle in 1955 (Whittle, 1955). More precisely, he used the terminology, “the probability of a minor outbreak” is and “the probability of a major outbreak” is . As these estimates for extinction (no outbreak) are asymptotic approximations from the branching process, they are more accurate for a large susceptible population size N and a few infectious individuals. The results are summarized below:
3.4. Numerical simulation
In general, for multivariate processes, it is difficult to find analytical solutions for the transition probabilities from the forward and backward Kolmogorov differential equations. For multivariate processes, it is often simpler to numerically simulate stochastic realizations (sample paths) of the process. A numerical method for simulation of CTMC models was developed by Gillespie (Gillespie, 1977). This method is known as the Gillespie algorithm or the Stochastic Simulation algorithm. To numerically simulate the change in state, two uniform random numbers, , are required for each change, one for the interevent time and a second one for the particular event.
The Markov property implies that the interevent time T has an exponential distribution , where the parameter λ is the sum of the rates for all possible events. For the SIR CTMC model,
where is the particular value for the state of at a given time t. To compute a value for the interevent time τ from the exponential distribution, the first uniform random number yields
(6) |
(See Appendix A for the derivation of the interevent time.) The second random number tells which particular event occurs. In general, given n events, the interval is subdivided according to the probability of each event, , . If lies in the kth subinterval, then the kth event occurs. For the SIR CTMC model, there are only two events with corresponding probabilities
If , an absorbing state has been reached and the process stops.
Four sample paths of the SIR CTMC model are plotted in Fig. 1 for (top two graphs) and (bottom two graphs). The close-up of the dynamics on the right side illustrates exponential growth at the initiation of an outbreak, the region where the branching process approximation is applicable.
4. SIR stochastic differential equations
4.1. Formulation
Stochastic differential equations for the SIR epidemic model follow from a diffusion process. The random variables are continuous,
Forward and backward Kolmogorov partial differential equations for the transition probability density functions can be derived and they in turn lead directly to the SDEs, e.g., (Allen, 2007, Allen, 2008, Allen et al., 2008, Kurtz, 1970, Kurtz, 1971, Kurtz, 1972). The SDEs are useful in simulating sample paths of the continuous-state process. In addition, the SDEs are much easier to solve numerically than the Kolmogorov differential equations and faster than simulating sample paths of the CTMC model. A heuristic derivation of the SDEs corresponding to the SIR epidemic model is described below.
Divide the time interval into small subintervals of length . Let . Subdivide further into smaller subintervals of length , with , , and .
For sufficiently small, it is reasonable to assume that the random variables on the interval are independent and identically distributed. For n sufficiently large, the Central Limit Theorem implies that has an approximate normal distribution with mean and covariance matrix , e.g., (Allen, 2007, Allen et al., 2008, Kurtz, 1970, Kurtz, 1971, Kurtz, 1972). Thus,
where 0 is the zero vector. The expectation of to order is the change that occurs ( or ) times the probability:
and the covariance matrix of to order is
To write the SDEs for the SIR stochastic process, either the square root of the covariance matrix is required, or alternately, a matrix G so that (Allen, 2007, Allen et al., 2008). The following matrix G has this latter property (G is not unique) (Allen, 2007, Allen et al., 2008). Matrix G is straightforward to compute as each column represents the square root of the rates as given in Table 1,
Then , where and . Letting , leads to the following system of SDEs:
where is a vector of two independent Wiener processes. That is, is a normally distributed random variable with mean zero and variance t or . The notation is also employed, where denotes Brownian motion, e.g., (Oksendal, 2000). This stochastic differential equation is known as an Itô SDE because the right side is evaluated at time t (Allen, 2007, Allen et al., 2008).
Rewriting the expression in terms of the random variables and leads to the following system of Itô SDEs:
(7) |
Note that if , then the epidemic stops. The Itô SDEs reduce to the original ODE model (1), if the two Wiener processes are neglected.
4.2. Numerical simulation
The Euler-Maruyama method is a simple numerical method that can be used to simulate sample paths of SDEs. The Euler-Maruyama method is of order and follows directly from the derivation of the SDE, where . In general, for a system of SDEs of the form,
the Euler-Maruyama method is a finite-difference approximation,
, where is chosen sufficiently small to ensure convergence. For k independent Wiener processes , the vector is k independent standard normal random numbers . Sample paths of the SIR SDE model are illustrated in Fig. 2. The sample paths are continuous, but not differentiable (a property of the Wiener process).
Numerical simulation of sample paths for SDE models is faster and simpler than computing sample paths for CTMC models when the population size is large. The time step in SDE models is chosen small but it has a fixed length, whereas in CTMC models the interevent time τ must be computed for each change and the interevent time decreases as the population size increases (see Equation (6)). Numerical methods with greater accuracy than the Euler-Maruyama method are discussed in the references, e.g., (Kloeden and Platen, 1992, Kloeden et al., 1997). In addition, methods have been developed to speed up the stochastic simulation (see, e.g., StochSS: Stochastic Simulation Service, www.StochSS.org, Petzold, UC Santa Barbara).
5. Malaria deterministic model
Malaria infection is caused by a Plasmodium parasite. An infectious mosquito transmits the parasite to a susceptible host through a mosquito bite. It is the female mosquito that bites the host to acquire blood for reproduction. According to the website of the World Health Organization (World Health Organization, 2015) there were approximately 214 million new cases of malaria and 438,000 deaths worldwide in 2015. Most cases were reported in the African region. Sir Ronald Ross was one of the first scientists to formulate mathematical models for the spread of malaria between insect vectors and human hosts (Ross, 1909, Ross, 1911, Smith et al., 2012) and for his work on malaria he was awarded the Nobel Prize in Physiology or Medicine in 1902.
In the Ross malaria model, the total number of vectors M and hosts H are constant. The variables S and I are the number of susceptible and infectious hosts, respectively, and U and V are the number of healthy and infectious vectors, respectively. A female mosquito requires a certain number of blood meals for reproduction and it is assumed that a single mosquito takes k bites per unit time to fulfill this blood requirement. Another important assumption is that the total number of bites by the mosquito population is dependent on the total number of mosquitoes but it is not dependent on the number of human hosts (only the proportion of human hosts). The probability per bite that an infectious mosquito transmits malaria is p and the probability per bite that a healthy mosquito acquires infection is q. Parameter is the transmission rate from an infectious mosquito to a human and is the acquisition rate from an infectious host to a healthy mosquito. The host recovery rate is γ and the mosquito death rate is μ. The birth rate and death rate of the mosquito population are equal. The natural birth and death rates of humans are negligible with respect to the modeling time frame and are assumed to be zero. With the preceding assumptions, the malaria model takes the form:
(8) |
The disease-free equilibrium for this model is , and . Linearization of the differential equations for I and V about the disease-free equilibrium yields the matrices from the next generation matrix approach (van den Driessche & Watmough, 2002):
(9) |
The spectral radius of is often defined as the basic reproduction number (van den Driessche & Watmough, 2002):
An equivalent form in terms of the threshold of one, is defined as the product of the transmission from vector to host and from host to vector:
(10) |
(e.g., (Heffernan et al., 2005, Lloyd et al., 2007)). This latter expression is used in the following discussion of the basic reproduction number. For model (8), the disease dies out if and a stable endemic equilibrium exists if .
6. Malaria continuous time Markov chain
6.1. Formulation
The malaria CTMC model is a time-homogeneous process with the Markov property. There are six events corresponding to transmission from host to vector and vector to host, recovery of humans, death of healthy and infectious mosquitoes, and birth of mosquitoes. Table 3 is a summary of these events and their probabilities. In the malaria CTMC model, given , the exponential distribution for the interevent time has parameter
Table 3.
Probability | Probability | ||
---|---|---|---|
A sample path of the CTMC model is graphed in Fig. 3. A stable endemic equilibrium exists in the ODE model if and the sample path of the CTMC model fluctuates around this endemic equilibrium.
6.2. Branching process approximation
Approximation of the CTMC near the disease-free equilibrium, leads to a multitype branching process in two variables and . Table 4 is a summary of the changes in I and V near the disease-free equilibrium when and .
Table 4.
Change | Probability |
---|---|
Next, two offspring pgfs are defined, one for a host and one for a mosquito. Each offspring pgf has the general form:
For one infectious host, there are only two events, either recovery of the host or infection of a mosquito. The offspring pgf for I, given and is
(11) |
That is, one infectious host recovers with probability or infects a mosquito with probability Note the term in (11) means one infectious host generates one infectious mosquito ( raised to the power one) and remains infectious ( raised to the power one). The offspring pgf for V, given and , is
(12) |
That is, one infectious mosquito dies with probability or infects a host with probability .
To find the probability of extinction (no outbreak), we compute the fixed points of the system on the unit square , and , e.g., (Allen and van den Driessche, 2013, Athreya and Ney, 1972, Dorman et al., 2004, Harris, 1963). The two solutions of this system are (1,1) and , where
(13) |
(14) |
(e.g., (Allen & Lahodny, 2012)). The fixed point exists in the unit square provided . Equivalent expressions were derived by Bartlett in 1964, but they were not written in terms of (Bartlett, 1964). Another equivalent set of expressions for and were derived by Lloyd et al. assuming geometric offspring pgfs (Lloyd et al., 2007). The expression for in (13) has a biological interpretation. Beginning from one infectious host, there is no outbreak if the infectious host recovers with probability or if there is no successful transmission to a susceptible mosquito with probability .
The stability of the fixed points is determined from the spectral radius of the Jacobian matrix of the offspring pgfs, evaluated at , e.g., (Allen and van den Driessche, 2013, Athreya and Ney, 1972, Dorman et al., 2004, Harris, 1963). If the spectral radius is less than one, then is stable and if it is greater than one, (1,1) is unstable and is stable. (See Appendix C.) The Jacobian matrix of the offspring pgfs evaluated at is
In general, it can be shown that
(15) |
where F and V are computed from the next generation matrix approach, Equation (9), is a diagonal matrix of interevent time parameters, , and I is the identity matrix (Allen & van den Driessche, 2013). In addition, there is an important relation between and matrix :
which follows from the identity (15) (Allen & van den Driessche, 2013).
Therefore, the probability of extinction (no outbreak) is one, if , but less than one, if . Given and , it follows from the independent assumption in the branching process approximation that the probabilities of a major or a minor outbreak are:
Alternate forms for the offspring pgf have been proposed that differ from the assumptions in the underlying SIR CTMC epidemic model (Table 1) or in the host-vector CTMC model (Table 3), e.g., (Antia et al., 2003, Blumberg and Lloyd-Smith, 2013, Lloyd et al., 2007, Lloyd-Smith et al., 2005). Offspring pgfs have been assumed to have geometric, Poisson, gamma, or negative binomial distributions (Antia et al., 2003, Blumberg and Lloyd-Smith, 2013, Lloyd et al., 2007, Lloyd-Smith et al., 2005). But they are often applied in the discrete-time case. The geometric distribution and the forms in (5), (11), (12) preserve the Markov property of an exponentially distributed lifetime (Dorman et al., 2004). Branching process approximations can be extended to more complex epidemic models with multiple groups (Allen and Lahodny, 2012, Allen and van den Driessche, 2013).
7. Malaria stochastic differential equations
A derivation of the SDEs for the host-vector model follows in a similar manner as for the SIR model. For simplicity, denote the change of the system variables as and the right side of the host-vector ODE model as , , i.e., . Then the Itô SDEs for the malaria model are computed from the expectation and covariance of based on the six events in Table 3. The system of SDEs has the form , where G is a matrix satisfying , where to order , is the approximate covariance matrix,
and is a vector of six independent Wiener processes, corresponding to the six events represented in Table 3 (Allen, 2007, Allen et al., 2008). More explicitly, the system of Itô SDEs for the host-vector model is
where the dependence on time t is omitted. For and , the epidemic stops.
The Euler-Maruyama method is used to compute sample paths of the host-vector SDE model (see Fig. 3) and to approximate the probability density for large time (see probability histogram in Fig. 4). The probability density appears to be approximately stationary as solutions fluctuate near the endemic equilibrium of the ODE model. How much fluctuation occurs in the disease dynamics depends on the transmission processes (demographic variability). In a more realistic host-vector model, the disease dynamics are also affected by environmental conditions. Environmental variability is especially important in a vector-borne disease, where the vector life cycle is intimately connected to temperature and moisture levels.
8. Environmental variability
For the SIR epidemic model or the malaria host-vector model, changes in the environment may impact the parameters for birth, death, recovery, or transmission. For example, if birth, death, or transmission rates fluctuate with changes in the environmental conditions, then a stochastic differential equation for the model parameter can be formulated as a mean-reverting process (fluctuation about some average value) (E. Allen, 2016).
For example, a mean-reverting Ornstein-Uhlenbeck (OU) process for the parameter α is modeled by the SDE:
where is the mean value and is proportional to the variance. In particular, for the OU process, has a normal distribution with mean
and variance
(16) |
(Allen, 2016, Oksendal, 2000). In the limit, as , the mean is and variance is . The mean agrees with the solution of the ODE: .
In another example, if the carrying capacity varies both seasonally and randomly, a biologically reasonable model for has the form (Allen, E. Allen, & Jonsson, 2006):
(17) |
The mean of K is
and variance has the same expression as in (16). (See Fig. 5.) The parameter K can also be modeled as a discrete random variable. Other mean-reverting stochastic processes that have the additional desirable property of being nonnegative are discussed in (E. Allen, 2016).
9. Summary
The intent of this primer is to introduce the topic of stochastic epidemic modeling from the perspective of an ODE epidemic framework. The emphasis is on continuous-time Markov chains and stochastic differential equations. Many topics of relevance to stochastic epidemic modeling have been omitted, e.g., epidemic duration, discrete-time Markov chains, stochastic spatial models, and parameter estimation for stochastic models. The references provide a wealth of information on these and other topics in stochastic epidemic modeling.
Footnotes
Peer review under responsibility of KeAi Communications Co., Ltd.
Appendix A. Interevent time in the CTMC model
Let be the cumulative distribution for the interevent time T. Let U denote the uniform distribution on . Properties of U and the fact that F is strictly increasing on , yields
But this implies . Calculation of the inverse function and using the fact that has the same distribution as U leads to the formula for τ in Equation (6):
Appendix B. PGF for the SIR branching approximation
For the SIR CTMC model, the pgf for the infectious class , given , is defined in terms of the transition probabilities, :
(B.1) |
Note that is the probability of disease extinction by time t: , given . From the independent assumptions, it follows that is the product of i generating functions for :
(B.2) |
The backward Kolmogorov differential equations for the branching process approximation of are as follows:
(B.3) |
A differential equation is derived for based on the identities (B.1) and (B.2), and the backward Equation (B.3). Differentiate (B.2) with respect to t, solve for , then replace with an equivalent expression from definition (B.1):
Substitution of from the backward Kolmogorov differential Equation (B.3) leads to
where the offspring pgf f is defined in (5). Since the right side of the differential equation is independent of u, the differential equation also applies to That is,
(B.4) |
From the theory of autonomous differential equations, the steady-states of (B.4) are the stationary solutions of the process: solutions u of . From the offspring pgf in (5), there are two possible steady-state solutions, either or It is easy to check if , then is the stable steady-state and if , then is the stable steady-state. If more than one individual is infectious, , then the asymptotic steady-states are either 1 or .
Appendix C. PGF for the host-vector branching approximation
In a manner similar to the derivation for the stochastic SIR model, differential equations for the probabilities of extinction can be derived for the branching process approximation in the host-vector model. Let be the transition probabilities for and in the branching process approximation. The generating function is defined as
In the branching process approximation, assume
(C.1) |
Below are the backward Kolmogorov differential equations for I and V, where the transition probabilities , , and :
Differentiate the identity in (C.1) with respect to t for two cases: and .
Substitute the backward Kolmogorov differential equations into the right side, apply the identity (C.1), and simplify. This leads to the following differential equations for and :
Since and , the preceding differential equations can be expressed in terms of the two probabilities of extinction. The steady-states of this system are the fixed points of the offspring pgfs (11), (12). The stability of these steady-states is determined by the eigenvalues of the Jacobian matrix . The steady-state (1,1) is stable if the eigenvalues of the Jacobian matrix have negative real part if and only if if and only if (Allen & van den Driessche, 2013).
References
- Allen E. Springer; Dordrecht, The Netherlands: 2007. Modeling with Itô stochastic differential equations. [Google Scholar]
- Allen L.J.S. An introduction to stochastic epidemic models. In: Brauer F., van den Driessche P., Wu J., editors. Mathematical epidemiology. Vol. 1945. Springer; Berlin: 2008. pp. 81–130. (Lecture notes in mathematics). Ch. 3. [Google Scholar]
- Allen L.J.S. 2nd Ed. CRC Press; Boca Raton, Fl: 2010. An introduction to stochastic processes with applications to biology. [Google Scholar]
- Allen L.J.S. Springer International Pub.; Cham, Switzerland: 2015. Stochastic population and epidemic models. Persistence and extinction. [Google Scholar]
- Allen E. Environmental variability and mean-reverting processes. Discrete and Continuous Dynamical Systems Series B. 2016;21(7):2073–2089. [Google Scholar]
- Allen E.J., Allen L.J.S., Arciniega A., Greenwood P. Construction of equivalent stochastic differential equation models. Stochastic Analysis and Applications. 2008;26:274–297. [Google Scholar]
- Allen L.J.S., Allen E.J., Jonsson C.B. The impact of environmental variation on hantavirus infection in rodents. In: Gumel A.B., Castillo-Chavez C., Mickens R.E., Clemence D.P., editors. Contemporary mathematics series 410. Vol. 410. AMS; Providence, R. I: 2006. pp. 1–15. (Modeling the dynamics of human diseases: Emerging paradigms and challenges). Ch. 1. [Google Scholar]
- Allen L.J.S., Lahodny G.E., Jr. Extinction thresholds in deterministic and stochastic epidemic models. Journal of Biological Dynamics. 2012;6(2):590–611. doi: 10.1080/17513758.2012.665502. [DOI] [PubMed] [Google Scholar]
- Allen L.J.S., van den Driessche P. Relations between deterministic and stochastic thresholds for disease extinction in continuous- and discrete-time infectious disease models. Mathematical Biosciences. 2013;243:99–108. doi: 10.1016/j.mbs.2013.02.006. [DOI] [PubMed] [Google Scholar]
- Altizer S., Ostfeld R., Johnston P.T.J., Kutz S., Harvell C.D. Climate change and infectious diseases: From evidence to a predictive framework. Science. 2013;341(6145):514–519. doi: 10.1126/science.1239401. [DOI] [PubMed] [Google Scholar]
- Andersson H., Britton T. Springer-Verlag; New York: 2000. Stochastic epidemic models and their Statistical analysis. (Lecture notes in statistics, No. 151). [Google Scholar]
- Antia R., Regoes R., Koella J.C., Bergstrom C.T. The role of evolution in the emergence of infectious diseases. Nature. 2003;426:658–661. doi: 10.1038/nature02104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Athreya K.B., Ney P.E. Springer-Verlag; NewYork: 1972. Branching processes. [Google Scholar]
- Bailey N.T.J. Griffin; London: 1975. The mathematical theory of infectious diseases and its applications. [Google Scholar]
- Barbour A.D. The duration of the closed stochastic epidemic. Biometrika. 1975;62(2):477–482. [Google Scholar]
- Bartlett M.S. The relevance of stochastic models for large-scale epidemiological phenomena. Journal of the Royal Statistical Society, Series C. 1964;13:2–8. [Google Scholar]
- Blumberg S., Lloyd-Smith J.O. Inference of R0 and transmission heterogeneity from the size distribution of stuttering chains. PLOS Computational Biology. 2013;9(5):e1002993. doi: 10.1371/journal.pcbi.1002993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Britton T. Stochastic epidemic models: A survey. Mathematical Biosciences. 2010;225(1):24–35. doi: 10.1016/j.mbs.2010.01.006. [DOI] [PubMed] [Google Scholar]
- Daley D.J., Gani J. Cambridge Univ. Press; New York: 1999. Epidemic Modelling: An introduction. [Google Scholar]
- van den Driessche P., Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Mathematical Biosciences. 2002;180:29–48. doi: 10.1016/s0025-5564(02)00108-6. [DOI] [PubMed] [Google Scholar]
- Dorman K.S., Sinsheimer J.S., Lange K. In the garden of branching processes. SIAM Review. 2004;46:202–229. [Google Scholar]
- Durrett R. Stochastic spatial models. SIAM Review. 1999;41(4):677–718. [Google Scholar]
- Gillespie D.T. Exact stochastic simulation of coupled chemical reactions. The Journal of Physical Chemistry A. 1977;81:2340–2361. [Google Scholar]
- Greenwood P.E., Gordillo L.F. Stochastic epidemic modeling. In: Chowell G., H. J. M, Bettencourt L.M.A., Castillo-Chavez C., editors. Mathematical and statistical estimation approaches in epidemiology. Springer; Dordrecht: 2009. pp. 31–52. Ch. 2. [Google Scholar]
- Harris T.E. Springer-Verlag; Berlin: 1963. The theory of branching processes. [Google Scholar]
- Heffernan J.M., Smith R.J., Wahl L.M. Perspectives on the basic reproductive ratio. Journal of the Royal Society Interface. 2005;2:281–293. doi: 10.1098/rsif.2005.0042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Isham V. Stochastic models for epidemics. In: Davison A.C., Yadolah D., Wermuth N., editors. Celebrating Statistics: Papers in Honour of Sir David Cox on the occasion of his 80th birthday. Oxford Univ. Press; Oxford: 2005. pp. 27–53. (Oxford statistical science series, No. 33). Ch. 1. [Google Scholar]
- Jagers P. John Wiley & Sons; London: 1975. Branching processes with biological applications. [Google Scholar]
- Jutla A., Whitcombe E., Hasan N., Haley B., Akanda A., Huq A. Environmental factors influencing epidemic cholera. American Journal of Tropical Medicine and Hygiene. 2013;89(3):597–607. doi: 10.4269/ajtmh.12-0721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karlin S., Taylor H. 2nd Ed. Academic Press; New York: 1975. A first course in stochastic processes. [Google Scholar]
- Karlin S., Taylor H. Academic Press; New York: 1981. A second course in stochastic processes. [Google Scholar]
- Kloeden P.E., Platen E. Springer-Verlag; New York: 1992. Numerical solution of stochastic differential equations. [Google Scholar]
- Kloeden P.E., Platen E., Schurz H. Springer-Verlag; Berlin: 1997. Numerical solution of SDE through computer experiments. [Google Scholar]
- Kurtz T.G. Solutions of ordinary differential equations as limits of pure jump Markov processes. Journal of Applied Probability. 1970;7:49–58. [Google Scholar]
- Kurtz T.G. Limit theorems for sequences of jump Markov processes approximating ordinary differential processes. Journal of Applied Probability. 1971;8:344–356. [Google Scholar]
- Kurtz T.G. The relationship between stochastic and deterministic models for chemical reactions. Journal of Chemical Physics. 1972;57:2976–2978. [Google Scholar]
- Lloyd-Smith J.O., Schreiber S.J., Kopp P.E., Getz W.M. Superspreading and the impact of individual variation on disease emergence. Nature. 2005;438:355–359. doi: 10.1038/nature04153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lloyd A.L., Zhang Z., Morgan Root A. Stochasticity and heterogeneity in host-vector models. Journal of the Royal Society Interface. 2007;4:851–863. doi: 10.1098/rsif.2007.1064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oksendal B. 5th Ed. Springer-Verlag; Berlin: 2000. Stochastic differential equations: An introduction with applications. [Google Scholar]
- Ross R. 1909. Report on the prevention of malaria in Mauritius, Churchill, London. [Google Scholar]
- Ross R. Murray; London: 1911. The prevention of malaria. [Google Scholar]
- Ross S.M. 11th Ed. Academic Press; Oxford, UK: 2014. An introduction to probability models. [Google Scholar]
- Smith D.L., Battle K.E., Hay S.I., Barker C.M., Scott T.W., McKenzie F. Ross, Macdonald, and a theory for the dynamics and control of mosquito-transmitted pathogens. PLoS Pathogens. 2012;8(4):e1002588. doi: 10.1371/journal.ppat.1002588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Whittle P. The outcome of a stochastic epidemic: A note on Bailey’s paper. Biometrika. 1955;42:116–122. [Google Scholar]
- World Health Organization . 2015. Fact Sheet: World malaria report.http://www.who.int/malaria/media/world-malaria-report-2015/en/ [Google Scholar]
- Wu X., Lu Y., Zhou S., Chen L., Xu B. Impact of climate change on human infectious diseases: Empirical evidence and human adaptation. Environment International. 2016;86:14–23. doi: 10.1016/j.envint.2015.09.007. [DOI] [PubMed] [Google Scholar]