Skip to main content
Springer logoLink to Springer
. 2017 Apr 13;75(6):1563–1590. doi: 10.1007/s00285-017-1123-8

The relationships between message passing, pairwise, Kermack–McKendrick and stochastic SIR epidemic models

Robert R Wilkinson 1,, Frank G Ball 2, Kieran J Sharkey 1
PMCID: PMC5641366  PMID: 28409223

Abstract

We consider a very general stochastic model for an SIR epidemic on a network which allows an individual’s infectious period, and the time it takes to contact each of its neighbours after becoming infected, to be correlated. We write down the message passing system of equations for this model and prove, for the first time, that it has a unique feasible solution. We also generalise an earlier result by proving that this solution provides a rigorous upper bound for the expected epidemic size (cumulative number of infection events) at any fixed time t>0. We specialise these results to a homogeneous special case where the graph (network) is symmetric. The message passing system here reduces to just four equations. We prove that cycles in the network inhibit the spread of infection, and derive important epidemiological results concerning the final epidemic size and threshold behaviour for a major outbreak. For Poisson contact processes, this message passing system is equivalent to a non-Markovian pair approximation model, which we show has well-known pairwise models as special cases. We show further that a sequence of message passing systems, starting with the homogeneous one just described, converges to the deterministic Kermack–McKendrick equations for this stochastic model. For Poisson contact and recovery, we show that this convergence is monotone, from which it follows that the message passing system (and hence also the pairwise model) here provides a better approximation to the expected epidemic size at time t>0 than the Kermack–McKendrick model.

Keywords: Stochastic SIR epidemic, Kermack–McKendrick model, Non-Markovian, Message passing, Pairwise, Network

Introduction

One of the earliest and most comprehensively analysed epidemic models is the susceptible-infected-recovered (SIR) model of Kermack and McKendrick (1927). In addition to providing insights into threshold behaviour and vaccination, it has also underpinned much subsequent work in applied mathematical epidemiology (Anderson and May 1992). A stochastic version, constructed from similar assumptions, was defined and analysed later (for example, Bailey 1975, Chapter 6) and it became of interest to understand the relationship between the two (Kurtz 1970, 1971; Barbour 1972, 1974).

More recently, various heterogeneities have been added to both deterministic and stochastic epidemic models. A particularly important one is the contact network which allows for specific relationships between pairs of individuals; see Danon et al. (2011) and Pastor-Satorras et al. (2015) for reviews. While it is straightforward to simulate stochastic epidemics on networks, deterministic approximations have also been developed to assist our understanding. Important examples of these include pair approximation (Keeling 1999; Sharkey 2008), message passing (Karrer and Newman 2010) and edge-based models (Miller et al. 2011).

The message passing approximation for stochastic epidemics was developed by Karrer and Newman (2010) and is central to the work that we present here. This approach allows one to exactly capture the marginal distributions for the health statuses of individuals (i.e whether they are susceptible, infected or recovered) when the contact network is a tree and provides useful rigorous bounds for these distributions otherwise. Notably, the message passing approach is also applicable to extremely general non-Markovian stochastic epidemics and the number of equations it requires scales linearly with the number of connected pairs of individuals; far fewer than the number of Kolmogorov forward equations for the Markovian case which scales exponentially with population size. Wilkinson and Sharkey (2014) showed that, when contact processes are assumed to be Poisson, a generalised version of the message passing equations is equivalent to a pairwise model that is defined at the level of individuals, thus unifying two major representations of epidemic dynamics. Their argument relies on the application of Leibniz’s integral rule, so here we take the opportunity to provide sufficient conditions for the applicability of that rule in this context (“Appendix 4”).

In Sect. 2 we define a more general stochastic model which allows for realistic correlations between contact times and infectious periods. Specifically, it allows all of an individual’s post-infection contact times (to each of its neighbours), and the negative of its infectious period, to be positively correlated. This could capture, for example, a scenario where infected individuals adopt some disease-combating behaviour such as taking antiviral medication, increasing the infectious contact times to all of their neighbours and decreasing their infectious period. We write down the message passing system for this stochastic model in Sect. 2.1 and then, for the first time, provide a non-restrictive sufficient condition for the message passing equations to have a unique feasible solution (Theorem 1). This is important because so far, the message passing construction of Karrer and Newman has not been shown to give rise to a unique epidemic. We then, in Sect. 2.2, extend the results found in Karrer and Newman (2010) and Wilkinson and Sharkey (2014) to this more general stochastic model; for example, the message passing system cannot underestimate the expected epidemic size at any time t>0, i.e. the expected number of susceptibles infected during (0, t] (Theorem 2; Corollary 1). This is what led Karrer and Newman to describe the message passing system as providing a ‘worst case scenario’.

For all of Sect. 3, we focus on a special case of the above stochastic model which assumes a contact structure with a large amount of symmetry and that all individuals behave in the same way. We refer to this special case as the ‘homogeneous stochastic model’. The corresponding message passing system is written down in Sect. 3.1 and, after exploiting symmetries, this reduces to a system comprising of only four equations which we refer to as the ‘homogeneous message passing system’. This system is identical in form to a special case of the equations formulated by Karrer and Newman (2010, equations 26 and 27), although here it is related to a different stochastic model. We then obtain several epidemiologically relevant results in Sect. 3.2: the stochastic epidemic is shown to be inhibited by cycles in the contact network (Theorem 3), a simple relation for an upper bound on the final epidemic size in the stochastic model is proved and sufficient conditions for no major outbreak in the stochastic model are found (Theorem 4). The latter gives an upper bound on the critical vaccination coverage to prevent a major outbreak, assuming a perfect vaccination.

As a special case of the general correspondence shown in Wilkinson and Sharkey (2014), the homogeneous message passing system has an equivalent non-Markovian pairwise model when the contact processes are Poisson. In Sect. 3.3 we write down these equations explicitly (Theorem 5). This pairwise model provides exactly the same epidemic time course as the homogeneous message passing system and hence exactly the same upper bound on the epidemic size at time t (Corollary 2), and gives the same final epidemic size (Corollary 3). Pairwise models are known to give good approximations of stochastic epidemic dynamics on networks in a broad range of cases [see, for example, Keeling (1999) and Sharkey (2008)]. Thus the proof of equivalence when contact processes are Poisson suggests that message passing provides a good approximation as well as useful bounds.

In Sect. 3.4, we derive the classic Kermack–McKendrick epidemic model as an asymptotic special case of the homogeneous message passing system (Theorem 6). Notably, our derivation of such ‘deterministic’ epidemic models from the homogeneous message passing system allows us to relate them explicitly to the stochastic model [see also, for example, Trapman (2007) and Barbour and Reinert (2013)]. Thus, we are able to show that in the case where contact and recovery processes are independent and Poisson, the Kermack–McKendrick model bounds the expected epidemic size at time t in the homogeneous stochastic model (Corollary 4). However, the bound is coarser than that provided by the homogeneous message passing system and the pairwise system, which therefore give a better approximation than the Kermack–McKendrick model. The paper ends with a brief discussion in Sect. 4.

The stochastic model (non-Markovin network-based SIR dynamics)

We define a very general class of network-based stochastic epidemics which allow heterogeneous and non-Poisson individual-level processes, and heterogeneity in the initial states of individuals (including the case where the initial states of all individuals are non-random).

Let G=(V,E) be an arbitrary (possibly countably infinite) simple, undirected graph, where V is the set of vertices (individuals) and E is the set of undirected edges between vertices (throughout the paper we will use the terms ‘graph’, ‘network’ and ‘contact network’ interchangeably). For iV, let Ni={jV:(i,j)E} be the set of neighbours of i and let |Ni|<. We assume that two individuals are neighbours if and only if at least one can make direct contacts to the other. A particular realisation of the stochastic model is specified as follows. Each individual/vertex iV is assigned a set of numbers Xi relevant to the behaviour of i and the spread of the epidemic:

Xi={Yi,μi,ωji(jNi)},

where Yi is equal to 1, 2, or 3, according to whether i is instantaneously infected at t=0, initially susceptible or initially recovered/vaccinated, these being mutually exclusive; μi[0,] is i’s infectious period if i is ever infected; ωji[0,] is the time elapsing between i first becoming infected and it making a contact to j, if i is ever infected. Therefore, for t0, i makes an infectious contact to j at time t if and only if (i) i becomes infected at some time st, (ii) ωji=t-s, and (iii) ωji<μi. Susceptible individuals become infected as soon as they receive an infectious contact, and infected individuals immediately become recovered when their infectious period terminates (initially recovered/vaccinated individuals never become infected). We let X=iVXi. Thus, the state of the population at time t[0,), which takes values in {S,I,R}V, is a function of X.

The situation which we wish to consider is where X is a set of random variables, so from now on we refer to Yi,μi,ωji, where iV,jNi, as random variables. We use ri and hij to denote the (marginal) probability density functions (PDFs) for μi and ωij respectively, and zi and yi to denote P(Yi=2) and P(Yi=3) respectively. Thus, P(Yi=1)=1-yi-zi. The probability that individual iV is in state Z{S,I,R} at time t0 is denoted by PZi(t).

Importantly we assume that for every iV,

Xi={-μi,ωji(jNi)}

is a set of associated random variables, as defined by Esary et al. (1967) and discussed in this context by Donnelly (1993) and Ball et al. (2015). Additionally, we assume that the set of multivariate random variables {Xi:iV} is mutually independent, and that Yi and Xi are independent for all iV. A finite set of random variables, T1,T2,,Tn say, is associated (or positively correlated) if

E[f(T1,T2,,Tn)g(T1,T2,,Tn)]E[f(T1,T2,,Tn)]E[g(T1,T2,,Tn)] 1

for all non-decreasing real-valued functions fg for which the expectations in (1) exist. Note that (1) implies that the correlation of any pair of these random variables is positive (i.e. 0). Further, if T1,T2,,Tn are mutually independent, then they are associated; see Esary et al. (1967, Theorem 2.1).

The above assumptions of association and independence are made so as to obtain the maximum amount of generality while the message passing and pairwise systems, which we shall define, give rigorous bounds on the expected dynamics in the stochastic model, and exact correspondence when the graph is a tree or forest.

Our stochastic model represents a generalisation of that considered by Karrer and Newman (2010), and also generalises the model considered by Wilkinson and Sharkey (2014), which assumed that all of the elements of X are mutually independent. Here, we do not make this last assumption and allow all of an individual’s post-infection contact times (to each of its neighbours), and the negative of its infectious period, to be positively correlated. This could capture, for example, the scenario where infected individuals tend to adopt some disease-combating behaviour, increasing the contact times to all of their neighbours and decreasing their infectious period.

The model considered by Wilkinson and Sharkey (2014), which incorporates a directed graph, is equivalent to a special case of the above model. Directedness is still captured by the above model since, for any given iV and jNi, ωij and ωji are assigned independently.

The message passing system and its unique solution

Following Wilkinson and Sharkey (2014), we apply the message passing approach of Karrer and Newman (2010) to the stochastic model defined in Sect. 2. Recall that message passing relies on the concept of the cavity state in order to simplify calculations. An individual is placed into the cavity state by cancelling its ability to make contacts. This does not affect its own fate but it does affect the fates of others because it cannot pass on the infection.

For arbitrary iV and neighbour jNi, let Hij(t) denote the probability that i, when in the cavity state, does not receive an infectious contact from j by time t. We can now write:

Hij(t)=1-0tfij(τ)(1-yj-zjΦij(t-τ))dτ, 2

where fij(τ)Δτ=hij(τ)P(μj>τωij=τ)Δτ is the probability (+o(Δτ)) that j makes an infectious contact to i during the time interval [τ,τ+Δτ) (for Δτ0), where time τ is measured from the moment j becomes infected, and Φij(t) is the probability that j does not receive any infectious contacts by time t when i and j are both in the cavity state. Note that although the stochastic model considered here is more general, Hij(t) may still be expressed, as in (2), similarly to equation 1 in Wilkinson and Sharkey (2014), because {Xi:iV} is mutually independent and Yi is independent from Xi for all iV.

To obtain a solvable system, the probability Hij(t) is approximated by Fij(t), where Fij(t)(iV,jNi) satisfies

Fij(t)=1-0tfij(τ)1-yj-zjkNj\iFjk(t-τ)dτ. 3

Any solution of (3) which gives Fij(t)[0,1] for all t0, and all iV,jNi, is called feasible. It was shown by Wilkinson and Sharkey (2014), following Karrer and Newman (2010), that a feasible solution exists as the limit of an iterative procedure.

The message passing system can now be defined (for iV):

Smes(i)(t)=zijNiFij(t), 4
Imes(i)(t)=1-Smes(i)(t)-Rmes(i)(t), 5
Rmes(i)(t)=yi+0tri(τ)[1-yi-Smes(i)(t-τ)]dτ, 6

where the variables on the left-hand side approximate PSi(t), PIi(t) and PRi(t) respectively (recall that PSi(t), PIi(t) and PRi(t) are respectively the probability that individual i is susceptible, infective and recovered-or-vaccinated at time t). Numerical evidence for the effectiveness of the message passing system, in capturing the expected dynamics of the stochastic model, can be seen in Figures 1 and 2 of Wilkinson and Sharkey (2014).

Note that the dimension of the message passing system (3)-(6) is appreciably smaller than that of the Kolomogorov forward equations for the case where the dynamics are Markovian. Suppose that |V|=N. Then the forward equations have dimension 3N and the message passing system has dimension at most N(N-1)+3N. In many cases, symmetries can be exploited to reduce the dimension of both the forward equations, see e.g. Simon et al. (2011), and the message passing system. However, the message passing system is still typically much smaller and can be very small, as in the model studied in Sect. 3.

Theorem 1

(Uniqueness of the feasible solution of the message passing system) Assume that

supiV|Ni|<andsup(i,j)Esupτ0fij(τ)<.

Then there is a unique feasible solution of Eqs. (3)–(6) and the feasible Fij(t) are continuous and non-increasing for all iV,jNi.

Proof

See “Appendix 1”.

It was shown by Wilkinson and Sharkey (2014) that when the graph is finite and fij(τ)=Tije-Tijττrj(τ)dτ(iV,jNi), where Tij(0,), i.e. contact processes are Poisson and independent of recovery processes, then the message passing system (3)–(6) is equivalent to an individual-level pairwise system of integro-differential equations. It now follows that this pairwise system of equations also has a unique feasible solution.

The message passing system (3)–(6), which coincides with that given in Wilkinson and Sharkey (2014) although the underlying model here is more general, differs from the message passing system in Karrer and Newman (2010) in that the probability an individual is initially infected need not be the same for all individuals, and individuals may be initially recovered or vaccinated. The system (3)–(6) also accounts for heterogeneity in the recovery and contact processes. A key use of message passing equations is that they yield a rigorous upper bound for the mean spread in the underlying stochastic epidemic. In the next subsection, we show that this property extends to our more general model.

Bounding the expected epidemic size at time t

For t0, let X(t) denote the number of susceptibles at time t. Thus, X(0)-X(t) is the total number of individuals infected by time t not counting those infected at t=0. We refer to this quantity as the epidemic size at time t.

Theorem 2

(Message passing bounds the marginal distribution for the health status of an individual) For all t0 and all iV,

PSi(t)Smes(i)(t), 7
PRi(t)Rmes(i)(t), 8

with equality if G is a tree or forest.

Proof

In the case where X is mutually independent and V is finite, this is proved by Wilkinson and Sharkey (2014) and Ball et al. (2015) by generalising Karrer and Newman (2010). The proof for our current more general model is in “Appendix 2”.

For t0, let Z(t) denote the number of recovered-or-vaccinated individuals at time t. The following corollary follows immediately from Theorem 2 on noting that, for t0,

E[X(t)]=iPSi(t)andE[Z(t)]=iPRi(t).

Corollary 1

For all t0, we have E[X(t)]iSmes(i)(t) and E[Z(t)]iRmes(i)(t), with equality occurring when the graph is a tree or forest. The expected epidemic size at time t is given by E[X(0)-X(t)]=iVzi-E[X(t)]. Thus, since we have a lower bound on E[X(t)] we also have an upper bound on the expected epidemic size at time t.

The homogeneous stochastic model

In this section we consider a special case of the stochastic model, and we refer to this special case as ‘the homogeneous stochastic model’. In the homogeneous stochastic model, the graph is symmetric and connected. Examples of symmetric connected graphs include complete graphs, ring lattices, infinite square lattices and Bethe lattices. In a symmetric graph, each individual has the same (finite) number n of neighbours, and we say that the graph is n-regular. To avoid triviality we assume n2.

Definition 1

A graph G=(V,E) is called symmetric if it is arc-transitive; i.e. for any two ordered pairs of neighbours ij, and i,j, there exists a graph-automorphism which maps i to i and j to j (Godsil and Royle 2001).

Additionally, in the homogeneous stochastic model, the joint distribution of (Yi,μi,ωji(jNi)) is symmetric in its last n arguments and is the same for all iV. Thus, it is impossible to distinguish between any two individuals by their behaviour or by their position in the graph. Note that we have not precluded the variables in Xi from being non-trivially associated (for all iV), i.e. i’s infectious period and the time it takes for it to contact each of its neighbours, after infection, may all be non-trivially correlated.

We use r and h to denote the (marginal) PDFs for μi and ωij respectively, and z and y to denote P(Yi=2) and P(Yi=3) respectively. Thus, P(Yi=1)=1-y-z. To avoid triviality, we assume that 0<z<1 and 0y<1-z.

Owing to symmetry (in this special case), the probability distribution for the health status of an individual is the same for all individuals, i.e. for all i,iV and all t0, we have PSi(t)=PSi(t),PIi(t)=PIi(t) and PRi(t)=PRi(t) (let PS(t),PI(t) and PR(t) denote these quantities). Similarly, for all iV,jNi and all iV,jNi, and all t0, we have Hij(t)=Hij(t) (let Hsym(t) denote this quantity).

The homogeneous message passing system

For the homogeneous stochastic model, (3) becomes

Fij(t)=1-0tf(τ)1-y-zkNj\iFjk(t-τ)dτ(iV,jNi), 9

where we have used fij(τ)=fij(τ) for all iV,jNi, and all iV,jNi, and all τ0, and we let f(τ) denote this quantity.

The arc-transitivity of symmetric graphs and the symmetry in (9) allow us to simplify (3)–(6), and to write down the full homogeneous message passing system as:

Smes(t)=zFsym(t)n, 10
Imes(t)=1-Smes(t)-Rmes(t), 11
Rmes(t)=y+0tr(τ)[1-y-Smes(t-τ)]dτ, 12

where

Fsym(t)=1-0tf(τ)1-y-zFsym(t-τ)n-1dτ. 13

In deriving these equations, we have used Fij(t)=Fij(t) for all iV,jNi and all iV,jNi, and all t0, and we let Fsym(t) denote this quantity. Note that we have also made use of the fact that every individual has n neighbours. This system is identical in form (when vaccination is disallowed) to the message passing system for the configuration network model provided by Karrer and Newman (2010, equations 26 and 27, making use of equations 1, 4 and 5), in the case where every individual has n neighbours with probability 1. From Theorem 1, we know that if supτ0f(τ)< then (13) has a unique feasible solution.

For clarity we write out these equations for the simplifying cases of Poisson transmission and recovery processes, and Poisson transmission and fixed (non-random) recovery.

Example 1

(Poisson transmission and recovery) For independent Poisson transmission and recovery processes (specifically, τi and ωji are independent and exponentially distributed with rates γ and β respectively), with f(τ)=βe-(β+γ)τ, the homogeneous message passing system can be solved via the following ordinary differential equations (ODEs):

F˙sym(t)=γ(1-Fsym(t))-β(Fsym(t)-y-zFsym(t)n-1), 14
R˙mes(t)=γImes(t), 15

with Smes(t) and Imes(t) given by (10) and (11).

Example 2

(Poisson transmission and fixed recovery) For Poisson transmission processes and a fixed recovery period (specifically, τi is non-random with value R[0,] and ωji is exponentially distributed with rate β), with f(τ)=βe-βτ(1-θ(t-R)) where θ is the Heaviside step function, the homogeneous message passing system can be solved using the following delay differential equation:

F˙sym(t)=-β(Fsym(t)-y-zFsym(t)n-1-θ(t-R)e-βR(1-y-zFsym(t-R)n-1)), 16

with

Rmes(t)=y+θ(t-R)(1-y-Smes(t-R)), 17

and Smes(t) and Imes(t) given by (10) and (11).

Other choices of f(τ) exist which allow the message passing system to be solved via (non-integro) differential equations, such as the top hat function (Karrer and Newman 2010, equation 33).

Epidemiological results

As well as bounding/approximating (or correctly computing in the case of an infinite regular tree) the expected fractional epidemic size at time t0, the homogeneous message passing system generates other epidemiologically relevant results for the stochastic model, as demonstrated here.

Theorem 3

(Cycles in the network inhibit the stochastic epidemic) Suppose that supτ0f(τ)<. The probability of an arbitrary individual being susceptible at a given time, for the n-regular Bethe lattice (infinite tree), is less than or equal to this quantity for all other n-regular symmetric graphs (where the homogeneous stochastic model is otherwise unchanged). The same holds for the probability of an arbitrary individual being recovered except with the inequality reversed.

Proof

From Theorem 2, we know that system (10)–(13) cannot overestimate the probability of an arbitrary individual being susceptible at time t and cannot underestimate the probability of an arbitrary individual being recovered at time t. However, also from Theorem 2, the system is exact if the graph is a tree.

Theorem 3 suggests that, all other things being equal, an infection will have the greatest impact by time t when the contact structure is most tree-like. Indeed, it is known that clustering and the presence of cycles in the graph may slow down and limit the spread of an infection (see Miller (2009) and references therein).

Theorem 4

(Final epidemic size relation and sufficient conditions for no major outbreak) For all t0,

Smes()PS(t),Rmes()PR(t), 18

where Smes()limtSmes(t) may be computed as the unique solution in [0, z] of

Smes()z1n=1-p+py+pzSmes()zn-1n, 19

with p0f(τ)dτ, and Rmes()=1-Smes().

Further, when the fraction initially infected is small, i.e. z1-y from below, then

PS()=PS(0)ify1-1R0orR01, 20

where R0(n-1)p. (This means that if each individual is independently vaccinated with probability greater than or equal to 1-1/R0, or if R01, then a major outbreak of the disease is impossible.)

Proof

Equation (18) follows from Theorem 2 and the observation that PS(t) and PR(t) are non-increasing and non-decreasing respectively.

The feasible Fsym(t) is non-increasing (see Theorem 1), so it converges to some Fsym()[0,1] as t. Note also that, by definition, 0tf(τ)dτ converges to p[0,1] as t. Now, using (13), we can write Fsym(t)=1-0ft(τ)dτ, where ft(τ)=f(τ)(1-y-zFsym(t-τ)n-1) for τ[0,t] and is equal to zero for τ>t. Note that ft(τ) converges pointwise to f(τ)(1-y-zFsym()n-1) as t. Thus, since 0ft(τ)f(τ) for all t,τ0, we can use the dominated convergence theorem to obtain, c.f. Karrer and Newman (2010, equations 23 and 24),

Fsym()=1-0limtft(τ)dτ=1-p1-y-zFsym()n-1. 21

Taking the limit as t in (10), and making use of (21), proves equation (19). It is straightforward to show by graphical means that (19) has a unique solution in [0, z]. In the case where z1-y from below, it is also straightforward to show by graphical means that, after setting z=1-y in (19), then Smes()=z(=Smes(0)=PS(0)) is the only solution in [0, z] if y1-1/R0 (R01 implies this condition). Equation (20) is then proved by noting that PS(t)Smes(t), and PS(t) is non-increasing from PS(0)=Smes(0).

Equation 19 is consistent with the final size relation given by Diekmann et al. (1998) (equations 5.3 and 5.4) for a regular random graph in the limit of large population size.

Remark 1

Consider an infinite sequence of finite homogeneous stochastic models, indexed by m, where ym=y[0,1) for all m, and where Nm, pm(nm-1)R0<, zm1-y, as m (here, Nm denotes the number of individuals in the mth model). This does not preclude the expected number of initial infectives from tending to some positive number, or even diverging, as m. It is straightforward that, in the limit of this sequence, the sufficient conditions for no major outbreak in Theorem 4 still hold. Note that if in addition we have nm as m, then the final size relation for the homogeneous message passing system (in this limit) becomes, using (19) with z=1-y,

Smes()1-y=e-R0(1-Smes()-y).

This is a well-known final size relation in the mean field literature, although usually vaccination is not included (see Miller (2012) for a discussion of derivations of this relation).

The homogeneous message passing system gives the same epidemic time course as a pairwise model

Here we show that a generalised pairwise SIR model, with well-known pairwise models as special cases, gives the same epidemic time course as the homogeneous message passing system. This allows us to prove epidemiological results for the generalised pairwise model. Since pairwise models are known to give good approximations of stochastic epidemic dynamics on networks [see, for example, Keeling (1999) and Sharkey (2008)], this also strengthens the case for the message passing system being a good approximation.

Theorem 5

(Equivalence of the message passing and pairwise models) For the homogeneous stochastic model, assume that the contact processes are Poisson with rate β and that they are independent from the recovery processes, such that f(τ)=βe-βττr(τ)dτ. Assume also that r(τ) is continuous. Then,

[S]˙(t)=-β[SI](t), 22
[I]˙(t)=β[SI](t)-0tr(τ)β[SI](t-τ)dτ-r(t)N(1-y-z), 23
[SS]˙(t)=-2βn-1n[SS](t)[SI](t)[S](t), 24
[SI]˙(t)=-βn-1n[SI](t)[SI](t)[S](t)-β[SI](t)+βn-1n[SS](t)[SI](t)[S](t)-0te-βτr(τ)βn-1n[SS](t-τ)[SI](t-τ)[S](t-τ)×exp(-t-τtβn-1n[SI](τ)[S](τ)dτ)dτ-nNze-βtr(t)(1-y-z)exp(-0tβn-1n[SI](τ)[S](τ)dτ), 25

where

S(t)NSmes(t), 26
I(t)NImes(t), 27
SS(t)nNSSmes(t)nNz2Fsym(t)2(n-1), 28
SI(t)nNSImes(t)nNzFsym(t)n-1(-F˙sym(t)β), 29

and N is a positive number.

Proof

See “Appendix 3”.

Corollary 2

(At all time points the pairwise model cannot underestimate the expected epidemic size) If, in the homogeneous stochastic model, contact processes are Poisson with rate β, i.e. the marginal distribution for ωji is exponential with parameter β for all iV,jNi, and these are independent from the infectious periods, then

[S](t)/NPS(t),[R](t)/NPR(t)(t0),

where [R](t)=N-[S](t)-[I](t).

Proof

This follows immediately from Theorems 2 and 5.

Corollary 3

(Final epidemic size equation for the pairwise model)

[S]()Nz1n=1-p+py+pz[S]()Nzn-1n,

where [S]()limt[S](t) and p0βe-βττr(τ)dτdτ.

Proof

This follows immediately from Theorems 4 and 5.

Note that (22)–(25) constitute a closed system for the variables [S](t), [I](t), [SS](t) and [SI](t) (if [S](t)=0 then the right-hand sides of (24) and (25) are undefined, but in this case the left-hand sides are equal to zero). With reference to (28) and (29), the quantities SSmes(t) and SImes(t) are constructed to capture/approximate, for any given pair of neighbours at time t, the probability that they are both susceptible and the probability that the first is susceptible while the second is infected respectively (see “Appendix 3”). The system (22)–(25) also follows directly from application of the individual-level pairwise equations in Wilkinson and Sharkey (2014, equations 8 and 9). In the case where the infectious period is exponentially distributed and letting N be the population size, (23) and (25) simplify to ODEs, and the pairwise (without clustering) model of Keeling (1999) is obtained. Similarly, after substituting r(τ)=δ(t-R), where δ is the Dirac delta function, into (23) and (25), the pairwise model of Kiss et al. (2015) for a non-random infectious period of duration R is obtained (except that the last term in (23) and the last term in (25), which relate to the behaviour of the initial infectives, need to be neglected). However, it may be more efficient to solve the simpler message passing systems [via (14)–(15) and (16)–(17) respectively] and then, if pairwise quantities are required, these can be computed using (28) and (29).

As part of the proof of equivalence between message passing and pairwise models that we present here, we also close a gap in the arguments of Wilkinson and Sharkey (2014) by demonstrating sufficient conditions for the valid application of Leibniz’s integral rule (“Appendix 4”) in the derivation of the pairwise equations from the message passing equations (“Appendix 3”).

The homogeneous message passing system gives the same epidemic time course as the Kermack–McKendrick model (asymptotically)

Here, we consider a sequence of homogeneous stochastic models where the regular degree n tends to infinity. As n, an individual is able to make contacts to a number of neighbours which tends to infinity, so to obtain a finite limit we assume that the infection function f(τ) depends on n (which we write fn(τ)) such that:

limnnfn(τ)=f(τ)<(τ0).

Note that, in the limit of large n, transmission is frequency dependent and the expected number of infectious contacts made by a given infected individual during the time interval (t1,t2) is t1t2f(τ)dτ, where time is measured from the moment the individual first became infected.

The deterministic model proposed by Kermack and McKendrick (1927) is as follows:

S˙(t)=S(t)0tf(τ)S˙(t-τ)dτ-I(0)f(t), 30
I(t)=1-S(t)-R(t), 31
R(t)=R(0)+0tr(τ)[1-R(0)-S(t-τ)]dτ. 32

Equations 12–15 of Kermack and McKendrick (1927) may be obtained from (30) to (32) after multiplying through by the total population size N in their paper.

The following theorem shows that, under this limiting regime and mild further conditions, the homogeneous message passing system gives the same epidemic time course as the model of Kermack and McKendrick (1927). For n=1,2,, let Smes(n)(t),Imes(n)(t) and Rmes(n)(t) denote the message passing system given by (10)-(13), where Fsym(t) is replaced by Fsym(n)(t), which satisfies (13) with f(τ) replaced by fn(τ).

Theorem 6

(Deriving the Kermack–McKendrick model from message passing) Suppose that for all T0,

  • (i)

    ϵn(T)=sup0tT|nfn(t)-f(t)|0 as n,

  • (ii)

    MT=sup0tTf(t)<,

and that, for all n=1,2,,

  • (iii)

    fn(t) is continuously differentiable,

  • (iv)

    Smes(n)(0),Imes(n)(0),Rmes(n)(0)=(S(0),I(0),R(0))=(z,1-z-y,y).

Then, for all T>0,

limnsup0tTSmes(n)(t)-S(t)=0, 33
limnsup0tTImes(n)(t)-I(t)=0, 34
limnsup0tTRmes(n)(t)-R(t)=0. 35

Proof

Fix T>0 and note first from (13) that, for feasible Fsym(n)(t) and all t[0,T],

1Fsym(n)(t)1-0tfn(τ)dτ(n=1,2,).

Now n0tfn(τ)dτT(MT+ϵn(T)), for all t[0,T], so conditions (i) and (ii) imply that there exists ϵn(1)(T)0 such that for all t[0,T],

1Fsym(n)(t)1-ϵn(1)(T)(n=1,2,), 36

where ϵn(1)(T)0 as n. Thus, for all sufficiently large n, Fsym(n)(t) is non-zero for all t[0,T].

Differentiating (10) yields

S˙mes(n)(t)=nzFsym(n)(t)n-1F˙sym(n)(t), 37

and differentiating (13), using Leibniz’s integral rule (see “Appendix 4”), gives

F˙sym(n)(t)=-fn(t)(1-y-z)+(n-1)z0tfn(τ)Fsym(n)(t-τ)n-2F˙sym(n)(t-τ)dτ. 38

Substituting (38) into (37), and using (10), gives

S˙mes(n)(t)=Smes(n)(t)Fsym(n)(t)[n-1n0tnfn(τ)S˙mes(n)(t-τ)Fsym(n)(t-τ)dτ-nfn(t)(1-y-z)]. 39

It can be shown, using (30) and (39) that, for all t[0,T],

S˙mes(n)(t)-S˙(t)A(n,T)0tS˙mes(n)(u)-S˙(u)du+B(n,T), 40

where B(n,T)0 as n and 0A(n,T)4MT for all sufficiently large n (see “Appendix 5”). Application of Gronwall’s inequality (see “Appendix 4”) then yields that, for all t[0,T],

S˙mes(n)(t)-S˙(t)B(n,T)eA(n,T)t. 41

Thus

limnsup0tTS˙mes(n)(t)-S˙(t)=0,

whence

limnsup0tTSmes(n)(t)-S(t)=limnsup0tT0tS˙mes(n)(u)-S˙(u)dulimnsup0tT0tS˙mes(n)(u)-S˙(u)dulimnTsup0tTS˙mes(n)(t)-S˙(t)=0,

proving (33). Equation (35) now follows using a similar argument and (34) is then immediate.

It is straightforward that if f(τ)=βke-γτ and r(τ)=γe-γτ then the Kermack–McKendrick model reduces to a system of ODEs:

S˙(t)=-βkS(t)I(t), 42
I˙(t)=βkS(t)I(t)-γI(t), 43
R˙(t)=γI(t). 44

For this special case, we state the following corollary to Theorem 6, see also Wilkinson et al. (2016), where it is proved that, for non-random initial conditions, the Kermack–McKendrick model bounds the so-called ‘general stochastic epidemic’.

Corollary 4

(In the Markovian case, message passing and pairwise models are better approximations than the Kermack–McKendrick model) Assume that, in the homogeneous stochastic model, contact and recovery processes are independent and Poisson with rates β and γ respectively. Specifically, h(τ)=βe-βτ, r(τ)=γe-γτ and f(τ)=βe-(β+γ)τ. Let k denote the regular degree of the symmetric graph (instead of n). For this special case,

S(t)<Smes(t)PS(t),R(t)>Rmes(t)PR(t)(t>0),

where S(t) and R(t) are given by (42)–(44), with S(0)=z,I(0)=1-y-z and R(0)=y, and Smes(t) and Rmes(t) are given by (9)–(12) with n replaced by k.

Proof

See “Appendix 6”.

Discussion

The message passing equations of Karrer and Newman (2010) approximate the expected time course for non-Markovian SIR epidemic dynamics on networks. In a later paper, Wilkinson and Sharkey (2014) slightly generalised their equations in order to make them applicable to stochastic models with more individual level heterogeneity. Here, for the first time, we have shown that Karrer and Newman’s system of message passing equations, and its generalisation, have unique feasible solutions (Theorem 1).

An important feature of the message passing equations is that they produce an upper bound to the expected epidemic size (cumulative number of infection events) at every point in time. Thus, they give a ‘worst case scenario’. In addition, they exactly capture the expected epidemic when the contact network is a tree. Here, we extended these results to a further generalised stochastic model which includes realistic correlations between post-infection contact times and the infectious period (Theorem 2). This situation can occur when individuals may adopt disease-combating behaviour, such as taking antiviral medication, which acts both on the ability of an individual to pass on the infection as well as the duration of their infectivity.

Much of this paper was devoted to a special case of the stochastic model which we referred to as the ‘homogeneous stochastic model’, in which individuals are homogeneous and the contact network is a symmetric graph (correlations between post-infection contact times and the infectious period are still allowed). Examples of a symmetric graph include a finite complete graph, an infinite square lattice and an infinite Bethe lattice. Due to symmetry, the message passing system here reduces to just four equations which we refer to as the ‘homogeneous message passing system’. This system is equivalent in form to a special case of the system found by Karrer and Newman (2010) to describe epidemic dynamics on random configuration networks, but here it is applied to a different stochastic model. These equations were analysed, making use of Theorem 2, to obtain a result which shows that cycles in the contact network serve to inhibit the stochastic epidemic (Theorem 3). Following arguments from Karrer and Newman (2010), we also obtained a single equation which provides an upper bound on the final epidemic size (Theorem 4); for the Bethe lattice, the final epidemic size is captured exactly. This naturally provides sufficient conditions, in terms of an R0-like quantity and the level of vaccination, for there to be no major outbreak (Theorem 4).

We found that the ‘limit’ of an appropriate sequence of homogeneous message passing systems gives the same epidemic time course as the Kermack and McKendrick (1927) epidemic model (Theorem 6) showing that it can be viewed as a special case of message passing. This also has the advantage of relating it to the underlying stochastic model (see also Barbour and Reinert (2013) who establish an exact correspondence). The final epidemic size result, and sufficient conditions for no major outbreak, described above for the homogeneous message passing system then translate directly.

From the homogeneous message passing system, we also constructed an equivalent population-level pairwise system which incorporates a general infectious period (Theorem 5). This can also be derived directly as a special case of the general individual-level pairwise system of Wilkinson and Sharkey (2014, equations 8 and 9) by applying the conditions of the homogeneous stochastic model. Here we filled a gap in the arguments of this paper by demonstrating sufficient conditions for the valid application of Leibniz’s integral rule (“Appendix 4”). This population-level pairwise system contains the Poisson pairwise model (without clustering) of Keeling (1999) as a special case. It also contains the delay differential equation model of Kiss et al. (2015) as a special case. We note that an entirely different derivation of (22)–(25) has been found independently and in parallel by Röst et al. (2016).

In general, we have emphasised the equivalence between several different types of SIR epidemic model. Specifically, we mention the derivation of the Kermack–McKendrick model as a special case of message passing and the equivalence (under Markovian transmission) of message passing and a class of pairwise models [see also Wilkinson and Sharkey (2014)]. We also note the recently submitted paper by Sherborne et al. (2016) which highlights the equivalence of message-passing and edge-based models (Miller et al. 2011), and that there is equivalence between edge-based models and the model of Volz (2008) [proved by Miller (2011)], and between the model of Volz and the binding site model of Leung and Diekmann (2017, Remark 1). While for SIR dynamics, message passing provides quite a general unifying framework, we note that for other dynamics such as SIS, it remains difficult to formulate a similar construction.

Unification of models is valuable in narrowing the lines of enquiry and simplifying ongoing research. In addition, owing to their different constructions, different types of results have been more forthcoming for some models than for others, and unification can allow results for one model to be automatically transferred to another. For example, here, by unification with message passing, we have been able to show that when contact and recovery processes are independent and Poisson, the Kermack–McKendrick model (which then reduces to a mass action ODE model) provides a rigorous upper bound on the expected epidemic size at time t>0 in the homogeneous stochastic model (Corollary 4). However, the bound is coarser than that provided by the message passing and pairwise systems, so we now know that these are better approximations. This extends the result that, for non-random initial conditions, the Kermack–McKendrick model bounds the so-called ‘general stochastic epidemic’ (Wilkinson et al. 2016). An interesting development would be to show that the Kermack–McKendrick model (30)–(32) bounds the homogeneous stochastic model more generally. We observe that this could be achieved by showing that the message passing system for the stochastic model is the first in a sequence of message passing systems indexed by n, which satisfies the conditions for Theorem 6, and where Smes(n)(t) is non-increasing with n; this is easy to do for Poisson transmission and recovery processes (“Appendix 6”). Another extension worthy of investigation is to multitype SIR epidemics.

Acknowledgements

R.R.W. acknowledges support from EPSRC (DTA studentship). R.R.W. and K.J.S. acknowledge support from the Leverhulme Trust (RPG-2014-341). We thank the reviewers and associate editor for their constructive comments which have improved the presentation of the paper.

Appendix 1: Proof of Theorem 1

Reproducing an argument from Karrer and Newman (2010), we construct here a feasible (bounded between 0 and 1) solution of (3). Let F(0)ij(t)=1 for all iV,jNi and all t0, and define the following iterative procedure. For m=1,2,, let

F(m)ij(t)=1-0tfij(τ)1-yj-zjkNj\iF(m-1)jk(t-τ)dτ. 45

It is easily shown that 1F(m)ij(t)F(m+1)ij(t)1-0tfij(τ)dτ, for all iV,jNi, t0 and m=0,1,, whence Fm(t)(Fmij(t):iV,jNi) converges to some F(t) as m, and F(t) is a feasible solution of (3). Moreover, letting F(t) be any feasible solution of (3), it can be shown, arguing as in Corduneanu (1991), section 1.3, that

supiV,jNi|Fij(t)-Fmij(t)|(Nmax-1)m(tfmax)m+1(m+1)!, 46

where Nmax=supiV|Ni| and fmax=supiV,jNisupt0fij(t). Assume that Nmax< and fmax<. Then, the right-hand side of (46) converges to zero as m, and F(t) must be the unique feasible solution of (3).

Note that (45) implies that if, for all iV,jNi, it is the case that F(m-1)ij(t) is non-increasing and belongs to [0, 1] for all t0, then these properties are also held by F(m)ij(t) for all iV,jNi. Since these properties are held by F(0)ij(t)(=1) for all iV,jNi, then, by induction, they hold for all m0, so F()ij(t) is non-increasing for all iV,jNi. Thus, the feasible solution of (13) (for Fsym(t)) is non-increasing, whence Smes(t) is non-increasing.

To show continuity of the feasible solution, first note that (45) implies that if, for all iV,jNi, it is the case that F(m-1)ij(t) is continuous, then F(m)ij(t) is also continuous for all iV,jNi. Since F(0)ij(t)(=1) is continuous for all iV,jNi, then, by induction, F(m)ij(t) is continuous for all m0,iV,jNi. Now, for any fixed T>0, the bound in (46) holds for all t[0,T] provided t in the right-hand side of (46) is replaced by T. Thus Fm(t) converges uniformly to F(t) over [0, T] as n and, since each Fm(t) is continuous on [0, T], it follows that F(t) is also continuous on [0, T]. This holds for any T>0, so F(t) is continuous on [0,).

Appendix 2: Proof of Theorem 2

We suppose first that the vertex set V is finite. Similarly to Wilkinson and Sharkey (2014, section III), and Ball et al. (2015), it is straightforward to show that the indicator variable 1iA(t) for the event that a cavity state-individual iV does not receive any infectious contacts from any of ANi by time t0 is a function of the random variables XiV{Xi,Yi} (see the beginning of Sect. 2), and that it is non-decreasing with respect to each element of X. Thus, since X is a set of associated variables [by assumption, and Esary et al. (1967, (P2) and (P3))] and Yi is independent of all other members of X, then using Esary et al. (1967, Theorem 4.1), we have

PSi(t)=ziE[1iNi(t)]zijNiE[1ij(t)]=zijNiHij(t)(iV), 47

with equality occurring when the graph is a tree or forest (where putting an individual into the cavity state prevents any dependencies between the states of its neighbours). Recall that ziP(Yi=2) is the probability that i is initially susceptible.

Similarly, the indicator variable 1(i)jA(t) for the event that a cavity state-individual jV does not receive any infectious contacts from any of ANj\i by time t0, where iNj is also in the cavity state, is a function of the random variables X, and it is non-decreasing with respect to each. Again, since X is a set of associated variables then we have [c.f. (2) and (3)],

Φij(t)=E[1(i)jNj\i(t)]kNj\iE[1(i)jk(t)]kNj\iE[1jk(t)]=kNj\iHjk(t), 48

where the second inequality follows from the fact that taking an individual out of the cavity state cannot increase the probability that a different individual receives no infectious contacts from a given neighbour by time t0. Again, equality occurs when the graph is a tree or forest.

The above derivations of (47) and (48) break down when the vertex set V is countably infinite, since the theory in Esary et al. (1967) requires that the set of random variables X is finite. Suppose now that V is countably infinite and label the vertices 1,2,. Fix iV and an integer ni. Let G(n)=(V(n),E(n)) be the graph obtained from G by deleting the vertices n+1,n+2, and all edges connected to those vertices. Now, since |V(n)|<, the inequality (47) yields

PSi(n)(t)zijNi(n)H(n),ij(t), 49

where the superfix n denotes that the quantity is defined for the epidemic on G(n). Further, for n=i,i+1,, the epidemic on G(n) can be defined using the same set XiV{Xi,Yi} of random variables. It then follows that, for any t0, the event that individual i is susceptible at time t in the epidemic on G(n) decreases with n and tends to the event that individual i is susceptible at time t in the epidemic on G as n, so PSi(n)(t)PSi(t) as n by the continuity of probability measures. A similar argument shows that H(n),ij(t)Hij(t) as n. Letting n in (49) then shows that (47) holds when V is countably infinite, as |Ni|<. The same method of proof shows that (48) also holds when V is countably infinite.

Using (48) in conjunction with (2) we have

Hij(t)1-0tfij(τ)1-yj-zjkNj\iHjk(t-τ)dτ, 50

where equality occurs when the graph is a tree or forest. Using (50), it is easy to show by the iterative procedure in “Appendix 1” (except with F(0)ij(t)=Hij(t)) that a unique feasible solution of (3) exists and, using this solution, that Fij(t)Hij(t) for all iV,jNi and all t0, with equality occurring when the graph is a tree or forest. This fact, in combination with (47), c.f. (4), proves (7), and consequently, c.f. (6), gives (8).

Appendix 3: Proof of Theorem 5

Here we consider the homogeneous stochastic model defined at the beginning of Sect. 3 with reference to the beginning of Sect. 2. We assume that transmission processes are Poisson with rate β and that they are independent of the recovery processes, specifically f(τ)=βe-βττr(τ)dτ. We assume that r(τ) is continuous so that we may apply Leibniz’s integral rule to compute derivatives (see “Appendix 4”). In this case, a pairwise system incorporating a general infectious period can be derived from the homogeneous message passing system (10)–(13) with the additional variables:

SSmes(t)z2Fsym(t)2(n-1), 51
SImes(t)zFsym(t)n-1(-F˙sym(t)β), 52

where SSmes(t) approximates the probability that a pair of neighbours are susceptible at time t, and SImes(t) approximates the probability that the first is susceptible and the second is infected at time t [see Wilkinson and Sharkey (2014, section II B) where these pairwise quantities were first considered in the context of message passing]. To understand the construction of the factor in brackets in (52), note that for any pair of neighbours ij,  the probability that i is susceptible and j is infected at time t remains the same when i is placed into the cavity state. Further, when transmission processes are Poisson with rate β, we must have that:

H˙ij(t)=-βP(jinfected at timetand no infectiouscontacts fromjtoibefore timetiin cavity). 53

Thus, the factor in brackets in (52) can be seen to approximate the probability on the right-hand side of (53) for any pair of neighbours ij (recall that Fsym(t) approximates Hij(t) for any pair of neighbours ij).

To obtain population-level quantities, we define [as in Sharkey (2008, appendix B)]:

[S](t)NSmes(t),[I](t)NImes(t),[SS](t)nNSSmes(t),[SI](t)nNSImes(t), 54

where N is a positive number. Note that (10) and (52) imply

F˙sym(t)=-βFsym(t)SImes(t)Smes(t)(Smes(t)0), 55

so, since Fsym(0)=1, we have:

Fsym(t)=exp(-0tβSImes(τ)Smes(τ)dτ)(Smes(t)0). 56

Substituting from (10)–(12) and (51), and using (55), it is straightforward to write down the time derivatives of [S](t), [I](t) and [SS](t) as in (22)–(24).

Finding the time derivative of [SI](t) is more involved. Setting u=t-τ in (13) and differentiating with respect to t using Leibniz’s integral rule yields, recalling f(τ)=βe-βττr(τ)dτ, that

F˙sym(t)=-β(Fsym(t)-y-zFsym(t)n-1)+0tβe-βτr(τ)(1-y-zFsym(t-τ)n-1)dτ. 57

Substituting from (52) and (57) into (54), we can write

[SI](t)=nNzFsym(t)n-1(-F˙sym(t)β)=nNzFsym(t)n-1[Fsym(t)-y-zFsym(t)n-1-0te-βτr(τ)(1-y-zFsym(t-τ)n-1)dτ]. 58

Differentiating the right-hand side of (58), we can now express the time derivative of [SI](t) as

[SI]˙(t)=n(n-1)NzFsym(t)n-2F˙sym(t)(-F˙sym(t)β)+nNzFsym(t)n-1F˙sym(t)-n(n-1)Nz2Fsym(t)2n-3F˙sym(t)+n(n-1)Nz2Fsym(t)n-10te-βτr(τ)Fsym(t-τ)n-2F˙sym(t-τ)dτ-nNzFsym(t)n-1e-βtr(t)(1-y-z). 59

Substituting from (10), (51), (52), (54), (55) and (56) into (59) yields the expression for [SI]˙(t) in (25); the terms on the right-hand side of (25) are ordered by equality with the terms on the right-hand side of (59).

Appendix 4: Continuity conditions for the application of Leibniz’s integral rule and Gronwall’s inequality

To derive (38), Leibniz’s integral rule is applied to (13), and this is valid if Fsym(t) is continuously differentiable. Similarly, the application of the rule in the derivation of (57) and (59) is valid if f(τ) and Fsym(t) are continuously differentiable. Here we show that Fsym(t) is continuously differentiable if f(τ) is continuously differentiable. Note that if f(τ)=βe-βττr(τ)dτ then f(τ) is continuously differentiable when r(τ) is continuous.

With reference to the message passing system, (10)–(13), assume that f(τ) is continuously differentiable. Thus we may apply Leibniz’s integral rule to (13), after setting τ=t-τ, in order to compute the derivative of Fsym(t) as follows

F˙sym(t)=-0tf˙(t-τ)(1-y-zFsym(τ)n-1)dτ-f(0)(1-y-zFsym(t)n-1). 60

It follows from “Appendix 1” that Fsym(t) is continuous. Thus, since f˙(τ) is also continuous, (60) implies that F˙sym(t) is continuous.

To derive (41), Gronwall’s inequality is applied to (40), and this is valid if S˙mes(n)(t) and S˙(t) are continuous. By condition (iii) of Theorem 6, we have that F˙sym(n)(t) is continuous (by the above argument), so S˙mes(n)(t) is continuous. Conditions (i) and (iii) imply that f(t) is continuous, which implies that S˙(t) is continuous.

We note that Leibniz’s integral rule was assumed to be applicable in Wilkinson and Sharkey (2014). It is straightforward, using a similar argument to above, to show that the application of the rule in that paper is valid if fij(τ) is continuously differentiable for all iV,jNi.

Appendix 5: Proof of (40)

It follows from (30) and (39) that, for all t[0,T],

S˙mes(n)(t)-S˙(t)An(t)+Bn(t), 61

where

An(t)=Smes(n)(t)Fsym(n)(t)[n-1n0tnfn(τ)S˙mes(n)(t-τ)Fsym(n)(t-τ)dτ]-S(t)0tf(τ)S˙(t-τ)dτ

and

Bn(t)=Smes(n)(t)Fsym(n)(t)nfn(t)(1-y-z)-S(t)I(0)f(t).

Now

An(t)An(1)(t)+An(2)(t), 62

where

An(1)(t)=Smes(n)(t)Fsym(n)(t)[n-1n0tnfn(τ)S˙mes(n)(t-τ)Fsym(n)(t-τ)dτ]-Smes(n)(t)0tf(τ)S˙(t-τ)dτ

and

An(2)(t)=Smes(n)(t)-S(t)×0tf(τ)S˙(t-τ)dτ.

Considering An(1)(t), note that, since 0Smes(n)(t)1,

An(1)(t)n-1n1Fsym(n)(t)An(11)(t)+An(12)(t), 63

where

An(11)(t)=0tnfn(τ)S˙mes(n)(t-τ)Fsym(n)(t-τ)dτ-0tf(τ)S˙(t-τ)dτ0tnfn(τ)Fsym(n)(t-τ)S˙mes(n)(t-τ)-S˙(t-τ)dτ+0tnfn(τ)Fsym(n)(t-τ)-f(τ)S˙(t-τ)dτ

and

An(12)(t)=0tf(τ)S˙(t-τ)dτ×n-1n1Fsym(n)(t)-1.

Now conditions (i), (ii) and (36) imply that, for all t[0,T],τ[0,t],

nfn(τ)Fsym(n)(t-τ)MT+ϵn(T)1-ϵn(1)(T)

and

nfn(τ)Fsym(n)(t-τ)-f(τ)1Fsym(n)(t-τ)(nfn(τ)-f(τ)+f(τ)1-Fsym(n)(t-τ))ϵn(T)+MTϵn(1)(T)1-ϵn(1)(T), 64

whence

An(11)(t)MT+ϵn(T)1-ϵn(1)(T)0tS˙mes(n)(t-τ)-S˙(t-τ)dτ+ϵn(T)+MTϵn(1)(T)1-ϵn(1)(T)0tS˙(t-τ)dτMT+ϵn(T)1-ϵn(1)(T)0tS˙mes(n)(u)-S˙(u)du+ϵn(T)+MTϵn(1)(T)1-ϵn(1)(T),

as 0tS˙(t-τ)dτ=S(0)-S(t)[0,1]. A similar argument, noting that

0tf(τ)S˙(t-τ)dτ0tf(τ)S˙(t-τ)dτMT[S(0)-S(t)]MT,

shows that

An(12)(t)MTϵn(1)(T)+1n1-ϵn(1)(T).

Hence, recalling (63),

An(1)(t)MT+ϵn(T)1-ϵn(1)(T)20tS˙mes(n)(u)-S˙(u)du+ϵn(T)+MTϵn(1)(T)1-ϵn(1)(T)2+MTϵn(1)(T)+1n1-ϵn(1)(T). 65

Turning to An(2)(t), note that since Smes(n)(0)=S(0),

Smes(n)(t)-S(t)=0tS˙mes(n)(u)-S˙(u)du0tS˙mes(n)(u)-S˙(u)du,

so

An(2)(t)MT0tS˙mes(n)(u)-S˙(u)du. 66

Further, since I(0)=1-y-z and 0I(0),Smes(n)(t)1,

Bn(t)=I(0)Smes(n)(t)Fsym(n)(t)nfn(t)-S(t)f(t)I(0)f(t)Smes(n)(t)-S(t)+Smes(n)(t)nfn(t)Fsym(n)(t)-f(t)MT0tS˙mes(n)(u)-S˙(u)du+ϵn(T)+MTϵn(1)(T)1-ϵn(1)(T), 67

using a similar result to (64).

Thus, using (61), (62), (65), (66) and (67), we may define

A(n,T)=2MT+MT+ϵn(T)1-ϵn(1)(T)2

and

B(n,T)=ϵn(T)+MTϵn(1)(T)2-ϵn(1)(T)1-ϵn(1)(T)2+MTϵn(1)(T)+1n1-ϵn(1)(T),

such that inequality (40) is satisfied for all t[0,T]. Further, since both ϵn(T) and ϵn(1)(T) converge to 0 as n, it follows that B(n,T)0 as n and 0A(n,T)4MT for all sufficiently large n.

Appendix 6: Proof of Corollary 4

Here, we consider the homogeneous stochastic model (defined at the beginning of Sect. 3, with reference to the beginning of Sect. 2) for the special case where transmission and recovery processes are independent and Poisson with rates β and γ respectively. Specifically, h(τ)=βe-βτ, r(τ)=γe-γτ and f(τ)=βe-(β+γ)τ. For convenience, we let k denote the regular degree of the symmetric graph (instead of n).

For this special case, we show here that for the same initial conditions and parameters,

PS(t)Smes(t)>S(t)forallt>0, 68

where PS(t) is the probability that an arbitrary individual is susceptible at time t (this being the same for all individuals) and S(t) is given by the special case of the Kermack–McKendrick model (42)–(44), with S(0)=z>0,I(0)=1-y-z>0 and R(0)=y; Smes(t) is given by (10) and (13) but with n replaced by k. Note that since

PR(t)=y+0tγe-γτ(1-y-PS(t-τ))dτ,Rmes(t)=y+0tγe-γτ(1-y-Smes(t-τ))dτ,

and

R(t)=y+0tγe-γτ(1-y-S(t-τ))dτ,

then (68) implies that R(t)>Rmes(t)PR(t) for all t>0.

We already have PS(t)Smes(t) by Theorem 2 and the fact that the message passing system, in this case, has a unique solution. Thus, we may prove (68) and Corollary 4 by showing that Smes(t)>S(t) for all t>0.

Setting fn(τ)=(βk/n)e-(βk/n+γ)τ and f(τ)=βke-γτ, the Kermack–McKendrick model reduces to the system of ODEs (42)–(44) and the conditions for Theorem 6 are satisfied. Thus, letting Fsym(n)(t) be defined by (13) but with f(τ) replaced by fn(τ), and letting Smes(n)(t) be defined by (10) but with Fsym(t) replaced by Fsym(n)(t) (as in Sect. 3.4),

limnSmes(n)(t)=S(t)

and

Smes(n)(t)=Smes(t)ifn=k.

Therefore, if Smes(n)(t)zFsym(n)(t)n is strictly decreasing with respect to n, for all t>0, then we have Smes(t)>S(t) for all t>0. We now show this to be the case.

Letting un(t)=Fsym(n)(t)n(=Smes(n)(t)/z), we can write [c.f. (14)]

u˙n(t)=nγ(un(t)n-1n-un(t))-βk(un(t)-yun(t)n-1n-zun(t)2(n-1)n).

For fixed u(0,1), we have that un-1n is strictly decreasing with n, and also that

n(un-1n-u)=nu(u-1n-1)=ne-λ(eλn-1)(whereu=e-λ,soλ>0)=e-λk=11k!λknk-1

is strictly decreasing with n. Therefore, since un(0)=1 and un(t)(0,1) for t>0, it follows that un(t) (and hence Smes(n)(t)) is strictly decreasing with n for all t>0.

Footnotes

Research sponsered in part by The Leverhulme Trust Grant RPG-2014-341 to KJS.

Contributor Information

Robert R. Wilkinson, Email: robertwi@liv.ac.uk

Frank G. Ball, Email: frank.ball@nottingham.ac.uk

Kieran J. Sharkey, Email: kjs@liv.ac.uk

References

  1. Anderson RM, May RM. Infectious diseases of humans. Oxford: Oxford University Press; 1992. [Google Scholar]
  2. Bailey NTJ. The mathematical theory of infectious diseases. London: Griffin; 1975. [Google Scholar]
  3. Ball FG, Wilkinson RR, Sharkey KJ. Erratum: Message passing and moment closure for susceptible–infected–recovered epidemics on finite networks. Phys Rev E. 2015;92:039904. doi: 10.1103/PhysRevE.92.039904. [DOI] [PubMed] [Google Scholar]
  4. Barbour AD. The principle of the diffusion of arbitrary constants. J Appl Probab. 1972;9(3):519–541. doi: 10.1017/S0021900200035841. [DOI] [Google Scholar]
  5. Barbour AD. On a functional central limit theorem for Markov population processes. Adv Appl Probab. 1974;6(1):21–39. doi: 10.1017/S0001867800039690. [DOI] [Google Scholar]
  6. Barbour AD, Reinert G. Approximating the epidemic curve. Electron J Probab. 2013;18(54):1–30. [Google Scholar]
  7. Corduneanu C. Integral equations and applications. Cambridge: Cambridge University Press; 1991. [Google Scholar]
  8. Danon L, Ford AP, House T, Jewell CP, Keeling MJ, Roberts GO, Ross JV, Vernon MC. Networks and the epidemiology of infectious disease. Interdiscip Perspect Infect Dis. 2011 doi: 10.1155/2011/284909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Diekmann O, de Jong MCM, Metz JAJ. A deterministic epidemic model taking account of repeated contacts between the same individuals. J Appl Probab. 1998;35(2):448–462. doi: 10.1017/S0021900200015072. [DOI] [Google Scholar]
  10. Donnelly P. The correlation structure of epidemic models. Math Biosci. 1993;117(1–2):49–75. doi: 10.1016/0025-5564(93)90017-5. [DOI] [PubMed] [Google Scholar]
  11. Esary JD, Proschan F, Walkup DW. Association of random variables, with applications. Ann Math Stat. 1967;38(5):1466–1474. doi: 10.1214/aoms/1177698701. [DOI] [Google Scholar]
  12. Godsil C, Royle G. Algebraic graph theory. New York: Springer; 2001. [Google Scholar]
  13. Karrer B, Newman MEJ. A message passing approach for general epidemic models. Phys Rev E. 2010;82:016101. doi: 10.1103/PhysRevE.82.016101. [DOI] [PubMed] [Google Scholar]
  14. Keeling MJ. The effects of local spatial structure on epidemiological invasions. Proc Biol Sci. 1999;266:589–867. doi: 10.1098/rspb.1999.0716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proc R Soc Lond A. 1927;115:700–721. doi: 10.1098/rspa.1927.0118. [DOI] [Google Scholar]
  16. Kiss IZ, Röst G, Zsolt V. Generalization of pairwise models to non-Markovian epidemics on networks. Phys Rev Lett. 2015;115:078701. doi: 10.1103/PhysRevLett.115.078701. [DOI] [PubMed] [Google Scholar]
  17. Kurtz TG. Solutions of ordinary differential equations as limits of pure jump Markov processes. J Appl Probab. 1970;7(1):49–58. doi: 10.1017/S0021900200026929. [DOI] [Google Scholar]
  18. Kurtz TG. Limit theorems for sequences of jump Markov processes approximating ordinary differential processes. J Appl Probab. 1971;8(2):344–356. doi: 10.1017/S002190020003535X. [DOI] [Google Scholar]
  19. Leung KY, Diekmann O. Dangerous connections: on binding site models of infectious disease dynamics. J Math Biol. 2017;74:619–671. doi: 10.1007/s00285-016-1037-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Miller JC. The spread of infectious disease through clustered populations. J R Soc Interface. 2009 doi: 10.1098/rsif.2008.0524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Miller JC. A note on a paper by Erik Volz: SIR dynamics in random networks. J Math Biol. 2011;62(3):349–358. doi: 10.1007/s00285-010-0337-9. [DOI] [PubMed] [Google Scholar]
  22. Miller JC. A note on the derivation of epidemic final sizes. Bull Math Biol. 2012;74(9):2125–2141. doi: 10.1007/s11538-012-9749-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Miller JC, Slim AC, Volz EM. Edge-based compartmental modelling for infectious disease spread. J R Soc Interface. 2011;9:890–906. doi: 10.1098/rsif.2011.0403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Pastor-Satorras R, Castellano C, Van Mieghem P, Vespignani A. Epidemic processes in complex networks. Rev Mod Phys. 2015;87:925. doi: 10.1103/RevModPhys.87.925. [DOI] [Google Scholar]
  25. Röst G, Vizi Z, Kiss IZ (2016) Pairwise approximation for SIR type network epidemics with non-Markovian recovery. arXiv:1605.02933 [DOI] [PMC free article] [PubMed]
  26. Sharkey KJ. Deterministic epidemiological models at the individual level. J Math Biol. 2008;57:311–331. doi: 10.1007/s00285-008-0161-7. [DOI] [PubMed] [Google Scholar]
  27. Sherborne N, Miller JC, Blyuss KB, Kiss IZ (2016) Mean-field models for non-Markovian epidemics on networks: from edge-based compartmental to pairwise models. arXiv:1611.04030 [DOI] [PMC free article] [PubMed]
  28. Simon PL, Taylor M, Kiss IZ. Exact epidemic models on graphs using graph-automorphism driven lumping. J Math Biol. 2011;62(4):479–508. doi: 10.1007/s00285-010-0344-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Trapman P. Reproduction numbers for epidemics on networks using pair approximation. Math Biosci. 2007;210:464–489. doi: 10.1016/j.mbs.2007.05.011. [DOI] [PubMed] [Google Scholar]
  30. Volz E. SIR dynamics in random networks with heterogeneous connectivity. J Math Biol. 2008;56(3):293–310. doi: 10.1007/s00285-007-0116-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Wilkinson RR, Ball FG, Sharkey KJ (2016) The deterministic Kermack-McKendrick model bounds the general stochastic epidemic. Scheduled for publication. J Appl Probab 53(4):1031–1040
  32. Wilkinson RR, Sharkey KJ. Message passing and moment closure for susceptible–infected–recovered epidemics on finite networks. Phys Rev E. 2014;89:022808. doi: 10.1103/PhysRevE.89.022808. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Mathematical Biology are provided here courtesy of Springer

RESOURCES