The relationships between message passing, pairwise, Kermack–McKendrick and stochastic SIR epidemic models

Robert R Wilkinson; Frank G Ball; Kieran J Sharkey

doi:10.1007/s00285-017-1123-8

. 2017 Apr 13;75(6):1563–1590. doi: 10.1007/s00285-017-1123-8

The relationships between message passing, pairwise, Kermack–McKendrick and stochastic SIR epidemic models

Robert R Wilkinson ^1,^✉, Frank G Ball ², Kieran J Sharkey ¹

PMCID: PMC5641366 PMID: 28409223

Abstract

We consider a very general stochastic model for an SIR epidemic on a network which allows an individual’s infectious period, and the time it takes to contact each of its neighbours after becoming infected, to be correlated. We write down the message passing system of equations for this model and prove, for the first time, that it has a unique feasible solution. We also generalise an earlier result by proving that this solution provides a rigorous upper bound for the expected epidemic size (cumulative number of infection events) at any fixed time $t > 0$ . We specialise these results to a homogeneous special case where the graph (network) is symmetric. The message passing system here reduces to just four equations. We prove that cycles in the network inhibit the spread of infection, and derive important epidemiological results concerning the final epidemic size and threshold behaviour for a major outbreak. For Poisson contact processes, this message passing system is equivalent to a non-Markovian pair approximation model, which we show has well-known pairwise models as special cases. We show further that a sequence of message passing systems, starting with the homogeneous one just described, converges to the deterministic Kermack–McKendrick equations for this stochastic model. For Poisson contact and recovery, we show that this convergence is monotone, from which it follows that the message passing system (and hence also the pairwise model) here provides a better approximation to the expected epidemic size at time $t > 0$ than the Kermack–McKendrick model.

Keywords: Stochastic SIR epidemic, Kermack–McKendrick model, Non-Markovian, Message passing, Pairwise, Network

Introduction

One of the earliest and most comprehensively analysed epidemic models is the susceptible-infected-recovered (SIR) model of Kermack and McKendrick (1927). In addition to providing insights into threshold behaviour and vaccination, it has also underpinned much subsequent work in applied mathematical epidemiology (Anderson and May 1992). A stochastic version, constructed from similar assumptions, was defined and analysed later (for example, Bailey 1975, Chapter 6) and it became of interest to understand the relationship between the two (Kurtz 1970, 1971; Barbour 1972, 1974).

More recently, various heterogeneities have been added to both deterministic and stochastic epidemic models. A particularly important one is the contact network which allows for specific relationships between pairs of individuals; see Danon et al. (2011) and Pastor-Satorras et al. (2015) for reviews. While it is straightforward to simulate stochastic epidemics on networks, deterministic approximations have also been developed to assist our understanding. Important examples of these include pair approximation (Keeling 1999; Sharkey 2008), message passing (Karrer and Newman 2010) and edge-based models (Miller et al. 2011).

The message passing approximation for stochastic epidemics was developed by Karrer and Newman (2010) and is central to the work that we present here. This approach allows one to exactly capture the marginal distributions for the health statuses of individuals (i.e whether they are susceptible, infected or recovered) when the contact network is a tree and provides useful rigorous bounds for these distributions otherwise. Notably, the message passing approach is also applicable to extremely general non-Markovian stochastic epidemics and the number of equations it requires scales linearly with the number of connected pairs of individuals; far fewer than the number of Kolmogorov forward equations for the Markovian case which scales exponentially with population size. Wilkinson and Sharkey (2014) showed that, when contact processes are assumed to be Poisson, a generalised version of the message passing equations is equivalent to a pairwise model that is defined at the level of individuals, thus unifying two major representations of epidemic dynamics. Their argument relies on the application of Leibniz’s integral rule, so here we take the opportunity to provide sufficient conditions for the applicability of that rule in this context (“Appendix 4”).

In Sect. 2 we define a more general stochastic model which allows for realistic correlations between contact times and infectious periods. Specifically, it allows all of an individual’s post-infection contact times (to each of its neighbours), and the negative of its infectious period, to be positively correlated. This could capture, for example, a scenario where infected individuals adopt some disease-combating behaviour such as taking antiviral medication, increasing the infectious contact times to all of their neighbours and decreasing their infectious period. We write down the message passing system for this stochastic model in Sect. 2.1 and then, for the first time, provide a non-restrictive sufficient condition for the message passing equations to have a unique feasible solution (Theorem 1). This is important because so far, the message passing construction of Karrer and Newman has not been shown to give rise to a unique epidemic. We then, in Sect. 2.2, extend the results found in Karrer and Newman (2010) and Wilkinson and Sharkey (2014) to this more general stochastic model; for example, the message passing system cannot underestimate the expected epidemic size at any time $t > 0$ , i.e. the expected number of susceptibles infected during (0, t] (Theorem 2; Corollary 1). This is what led Karrer and Newman to describe the message passing system as providing a ‘worst case scenario’.

For all of Sect. 3, we focus on a special case of the above stochastic model which assumes a contact structure with a large amount of symmetry and that all individuals behave in the same way. We refer to this special case as the ‘homogeneous stochastic model’. The corresponding message passing system is written down in Sect. 3.1 and, after exploiting symmetries, this reduces to a system comprising of only four equations which we refer to as the ‘homogeneous message passing system’. This system is identical in form to a special case of the equations formulated by Karrer and Newman (2010, equations 26 and 27), although here it is related to a different stochastic model. We then obtain several epidemiologically relevant results in Sect. 3.2: the stochastic epidemic is shown to be inhibited by cycles in the contact network (Theorem 3), a simple relation for an upper bound on the final epidemic size in the stochastic model is proved and sufficient conditions for no major outbreak in the stochastic model are found (Theorem 4). The latter gives an upper bound on the critical vaccination coverage to prevent a major outbreak, assuming a perfect vaccination.

As a special case of the general correspondence shown in Wilkinson and Sharkey (2014), the homogeneous message passing system has an equivalent non-Markovian pairwise model when the contact processes are Poisson. In Sect. 3.3 we write down these equations explicitly (Theorem 5). This pairwise model provides exactly the same epidemic time course as the homogeneous message passing system and hence exactly the same upper bound on the epidemic size at time t (Corollary 2), and gives the same final epidemic size (Corollary 3). Pairwise models are known to give good approximations of stochastic epidemic dynamics on networks in a broad range of cases [see, for example, Keeling (1999) and Sharkey (2008)]. Thus the proof of equivalence when contact processes are Poisson suggests that message passing provides a good approximation as well as useful bounds.

In Sect. 3.4, we derive the classic Kermack–McKendrick epidemic model as an asymptotic special case of the homogeneous message passing system (Theorem 6). Notably, our derivation of such ‘deterministic’ epidemic models from the homogeneous message passing system allows us to relate them explicitly to the stochastic model [see also, for example, Trapman (2007) and Barbour and Reinert (2013)]. Thus, we are able to show that in the case where contact and recovery processes are independent and Poisson, the Kermack–McKendrick model bounds the expected epidemic size at time t in the homogeneous stochastic model (Corollary 4). However, the bound is coarser than that provided by the homogeneous message passing system and the pairwise system, which therefore give a better approximation than the Kermack–McKendrick model. The paper ends with a brief discussion in Sect. 4.

The stochastic model (non-Markovin network-based SIR dynamics)

We define a very general class of network-based stochastic epidemics which allow heterogeneous and non-Poisson individual-level processes, and heterogeneity in the initial states of individuals (including the case where the initial states of all individuals are non-random).

Let $G = (V, E)$ be an arbitrary (possibly countably infinite) simple, undirected graph, where $V$ is the set of vertices (individuals) and $E$ is the set of undirected edges between vertices (throughout the paper we will use the terms ‘graph’, ‘network’ and ‘contact network’ interchangeably). For $i \in V$ , let $N_{i} = {j \in V : (i, j) \in E}$ be the set of neighbours of i and let $| N_{i} | < \infty$ . We assume that two individuals are neighbours if and only if at least one can make direct contacts to the other. A particular realisation of the stochastic model is specified as follows. Each individual/vertex $i \in V$ is assigned a set of numbers $X_{i}$ relevant to the behaviour of i and the spread of the epidemic:

\begin{matrix} X_{i} = {Y_{i}, μ_{i}, ω_{j i} (j \in N_{i})}, \end{matrix}

where $Y_{i}$ is equal to 1, 2, or 3, according to whether i is instantaneously infected at $t = 0$ , initially susceptible or initially recovered/vaccinated, these being mutually exclusive; $μ_{i} \in [0, \infty]$ is i’s infectious period if i is ever infected; $ω_{j i} \in [0, \infty]$ is the time elapsing between i first becoming infected and it making a contact to j, if i is ever infected. Therefore, for $t \geq 0$ , i makes an infectious contact to j at time t if and only if (i) i becomes infected at some time $s \leq t$ , (ii) $ω_{j i} = t - s$ , and (iii) $ω_{j i} < μ_{i}$ . Susceptible individuals become infected as soon as they receive an infectious contact, and infected individuals immediately become recovered when their infectious period terminates (initially recovered/vaccinated individuals never become infected). We let $X = \cup_{i \in V} X_{i}$ . Thus, the state of the population at time $t \in [0, \infty)$ , which takes values in ${S, I, R}^{V}$ , is a function of $X$ .

The situation which we wish to consider is where $X$ is a set of random variables, so from now on we refer to $Y_{i}, μ_{i}, ω_{j i}$ , where $i \in V, j \in N_{i}$ , as random variables. We use $r_{i}$ and $h_{i j}$ to denote the (marginal) probability density functions (PDFs) for $μ_{i}$ and $ω_{i j}$ respectively, and $z_{i}$ and $y_{i}$ to denote $P (Y_{i} = 2)$ and $P (Y_{i} = 3)$ respectively. Thus, $P (Y_{i} = 1) = 1 - y_{i} - z_{i}$ . The probability that individual $i \in V$ is in state $Z \in {S, I, R}$ at time $t \geq 0$ is denoted by $P_{Z_{i}} (t)$ .

Importantly we assume that for every $i \in V$ ,

\begin{matrix} X_{i}^{*} = {- μ_{i}, ω_{j i} (j \in N_{i})} \end{matrix}

is a set of associated random variables, as defined by Esary et al. (1967) and discussed in this context by Donnelly (1993) and Ball et al. (2015). Additionally, we assume that the set of multivariate random variables ${X_{i} : i \in V}$ is mutually independent, and that $Y_{i}$ and $X_{i}^{*}$ are independent for all $i \in V$ . A finite set of random variables, $T_{1}, T_{2}, \dots, T_{n}$ say, is associated (or positively correlated) if

\begin{matrix} E [f (T_{1}, T_{2}, \dots, T_{n}) g (T_{1}, T_{2}, \dots, T_{n})] \geq E [f (T_{1}, T_{2}, \dots, T_{n})] E [g (T_{1}, T_{2}, \dots, T_{n})] \end{matrix}

for all non-decreasing real-valued functions f, g for which the expectations in (1) exist. Note that (1) implies that the correlation of any pair of these random variables is positive (i.e. $\geq 0$ ). Further, if $T_{1}, T_{2}, \dots, T_{n}$ are mutually independent, then they are associated; see Esary et al. (1967, Theorem 2.1).

The above assumptions of association and independence are made so as to obtain the maximum amount of generality while the message passing and pairwise systems, which we shall define, give rigorous bounds on the expected dynamics in the stochastic model, and exact correspondence when the graph is a tree or forest.

Our stochastic model represents a generalisation of that considered by Karrer and Newman (2010), and also generalises the model considered by Wilkinson and Sharkey (2014), which assumed that all of the elements of $X$ are mutually independent. Here, we do not make this last assumption and allow all of an individual’s post-infection contact times (to each of its neighbours), and the negative of its infectious period, to be positively correlated. This could capture, for example, the scenario where infected individuals tend to adopt some disease-combating behaviour, increasing the contact times to all of their neighbours and decreasing their infectious period.

The model considered by Wilkinson and Sharkey (2014), which incorporates a directed graph, is equivalent to a special case of the above model. Directedness is still captured by the above model since, for any given $i \in V$ and $j \in N_{i}$ , $ω_{i j}$ and $ω_{j i}$ are assigned independently.

The message passing system and its unique solution

Following Wilkinson and Sharkey (2014), we apply the message passing approach of Karrer and Newman (2010) to the stochastic model defined in Sect. 2. Recall that message passing relies on the concept of the cavity state in order to simplify calculations. An individual is placed into the cavity state by cancelling its ability to make contacts. This does not affect its own fate but it does affect the fates of others because it cannot pass on the infection.

For arbitrary $i \in V$ and neighbour $j \in N_{i}$ , let $H^{i \leftarrow j} (t)$ denote the probability that i, when in the cavity state, does not receive an infectious contact from j by time t. We can now write:

\begin{matrix} H^{i \leftarrow j} (t) = 1 - \int_{0}^{t} f_{i j} (τ) (1 - y_{j} - z_{j} Φ_{i}^{j} (t - τ)) d τ, \end{matrix}

where $f_{i j} (τ) Δ τ = h_{i j} (τ) P (μ_{j} > τ ∣ ω_{i j} = τ) Δ τ$ is the probability ( $+ o (Δ τ)$ ) that j makes an infectious contact to i during the time interval $[τ, τ + Δ τ)$ (for $Δ τ \to 0$ ), where time $τ$ is measured from the moment j becomes infected, and $Φ_{i}^{j} (t)$ is the probability that j does not receive any infectious contacts by time t when i and j are both in the cavity state. Note that although the stochastic model considered here is more general, $H^{i \leftarrow j} (t)$ may still be expressed, as in (2), similarly to equation 1 in Wilkinson and Sharkey (2014), because ${X_{i} : i \in V}$ is mutually independent and $Y_{i}$ is independent from $X_{i}^{*}$ for all $i \in V$ .

To obtain a solvable system, the probability $H^{i \leftarrow j} (t)$ is approximated by $F^{i \leftarrow j} (t)$ , where $F^{i \leftarrow j} (t) (i \in V, j \in N_{i})$ satisfies

\begin{matrix} F^{i \leftarrow j} (t) = 1 - \int_{0}^{t} f_{i j} (τ) (1 - y_{j} - z_{j} \prod_{k \in N_{j} \ i} F^{j \leftarrow k} (t - τ)) d τ . \end{matrix}

Any solution of (3) which gives $F^{i \leftarrow j} (t) \in [0, 1]$ for all $t \geq 0$ , and all $i \in V, j \in N_{i}$ , is called feasible. It was shown by Wilkinson and Sharkey (2014), following Karrer and Newman (2010), that a feasible solution exists as the limit of an iterative procedure.

The message passing system can now be defined (for $i \in V$ ):

\begin{matrix} S_{mes}^{(i)} (t) = & z_{i} \prod_{j \in N_{i}} F^{i \leftarrow j} (t), \end{matrix}

\begin{matrix} I_{mes}^{(i)} (t) = & 1 - S_{mes}^{(i)} (t) - R_{mes}^{(i)} (t), \end{matrix}

\begin{matrix} R_{mes}^{(i)} (t) = & y_{i} + \int_{0}^{t} r_{i} (τ) [1 - y_{i} - S_{mes}^{(i)} (t - τ)] d τ, \end{matrix}

where the variables on the left-hand side approximate $P_{S_{i}} (t)$ , $P_{I_{i}} (t)$ and $P_{R_{i}} (t)$ respectively (recall that $P_{S_{i}} (t)$ , $P_{I_{i}} (t)$ and $P_{R_{i}} (t)$ are respectively the probability that individual i is susceptible, infective and recovered-or-vaccinated at time t). Numerical evidence for the effectiveness of the message passing system, in capturing the expected dynamics of the stochastic model, can be seen in Figures 1 and 2 of Wilkinson and Sharkey (2014).

Note that the dimension of the message passing system (3)-(6) is appreciably smaller than that of the Kolomogorov forward equations for the case where the dynamics are Markovian. Suppose that $| V | = N$ . Then the forward equations have dimension $3^{N}$ and the message passing system has dimension at most $N (N - 1) + 3 N$ . In many cases, symmetries can be exploited to reduce the dimension of both the forward equations, see e.g. Simon et al. (2011), and the message passing system. However, the message passing system is still typically much smaller and can be very small, as in the model studied in Sect. 3.

Theorem 1

(Uniqueness of the feasible solution of the message passing system) Assume that

\begin{matrix} sup_{i \in V} | N_{i} | < \infty and sup_{(i, j) \in E} (sup_{τ \geq 0} f_{i j} (τ)) < \infty . \end{matrix}

Then there is a unique feasible solution of Eqs. (3)–(6) and the feasible $F^{i \leftarrow j} (t)$ are continuous and non-increasing for all $i \in V, j \in N_{i}$ .

Proof

See “Appendix 1”. $□$

It was shown by Wilkinson and Sharkey (2014) that when the graph is finite and $f_{i j} (τ) = T_{i j} e^{- T_{i j} τ} \int_{τ}^{\infty} r_{j} (τ^{'}) d τ^{'} (i \in V, j \in N_{i})$ , where $T_{i j} \in (0, \infty)$ , i.e. contact processes are Poisson and independent of recovery processes, then the message passing system (3)–(6) is equivalent to an individual-level pairwise system of integro-differential equations. It now follows that this pairwise system of equations also has a unique feasible solution.

The message passing system (3)–(6), which coincides with that given in Wilkinson and Sharkey (2014) although the underlying model here is more general, differs from the message passing system in Karrer and Newman (2010) in that the probability an individual is initially infected need not be the same for all individuals, and individuals may be initially recovered or vaccinated. The system (3)–(6) also accounts for heterogeneity in the recovery and contact processes. A key use of message passing equations is that they yield a rigorous upper bound for the mean spread in the underlying stochastic epidemic. In the next subsection, we show that this property extends to our more general model.

Bounding the expected epidemic size at time $t$

For $t \geq 0$ , let X(t) denote the number of susceptibles at time t. Thus, $X (0) - X (t)$ is the total number of individuals infected by time t not counting those infected at $t = 0$ . We refer to this quantity as the epidemic size at time t.

Theorem 2

(Message passing bounds the marginal distribution for the health status of an individual) For all $t \geq 0$ and all $i \in V$ ,

\begin{matrix} P_{S_{i}} (t) \geq & S_{mes}^{(i)} (t), \end{matrix}

\begin{matrix} P_{R_{i}} (t) \leq & R_{mes}^{(i)} (t), \end{matrix}

with equality if G is a tree or forest.

Proof

In the case where $X$ is mutually independent and $V$ is finite, this is proved by Wilkinson and Sharkey (2014) and Ball et al. (2015) by generalising Karrer and Newman (2010). The proof for our current more general model is in “Appendix 2”. $□$

For $t \geq 0$ , let Z(t) denote the number of recovered-or-vaccinated individuals at time t. The following corollary follows immediately from Theorem 2 on noting that, for $t \geq 0$ ,

\begin{matrix} E [X (t)] = \sum_{i} P_{S_{i}} (t) and E [Z (t)] = \sum_{i} P_{R_{i}} (t) . \end{matrix}

Corollary 1

For all $t \geq 0$ , we have $E [X (t)] \geq \sum_{i} S_{mes}^{(i)} (t)$ and $E [Z (t)] \leq \sum_{i} R_{mes}^{(i)} (t)$ , with equality occurring when the graph is a tree or forest. The expected epidemic size at time t is given by $E [X (0) - X (t)] = \sum_{i \in V} z_{i} - E [X (t)]$ . Thus, since we have a lower bound on E[X(t)] we also have an upper bound on the expected epidemic size at time t.

The homogeneous stochastic model

In this section we consider a special case of the stochastic model, and we refer to this special case as ‘the homogeneous stochastic model’. In the homogeneous stochastic model, the graph is symmetric and connected. Examples of symmetric connected graphs include complete graphs, ring lattices, infinite square lattices and Bethe lattices. In a symmetric graph, each individual has the same (finite) number n of neighbours, and we say that the graph is n-regular. To avoid triviality we assume $n \geq 2$ .

Definition 1

A graph $G = (V, E)$ is called symmetric if it is arc-transitive; i.e. for any two ordered pairs of neighbours i, j, and $i^{'}, j^{'}$ , there exists a graph-automorphism which maps i to $i^{'}$ and j to $j^{'}$ (Godsil and Royle 2001).

Additionally, in the homogeneous stochastic model, the joint distribution of $(Y_{i}, μ_{i}, ω_{j i} (j \in N_{i}))$ is symmetric in its last n arguments and is the same for all $i \in V$ . Thus, it is impossible to distinguish between any two individuals by their behaviour or by their position in the graph. Note that we have not precluded the variables in $X_{i}^{*}$ from being non-trivially associated (for all $i \in V$ ), i.e. i’s infectious period and the time it takes for it to contact each of its neighbours, after infection, may all be non-trivially correlated.

We use r and h to denote the (marginal) PDFs for $μ_{i}$ and $ω_{i j}$ respectively, and z and y to denote $P (Y_{i} = 2)$ and $P (Y_{i} = 3)$ respectively. Thus, $P (Y_{i} = 1) = 1 - y - z$ . To avoid triviality, we assume that $0 < z < 1$ and $0 \leq y < 1 - z$ .

Owing to symmetry (in this special case), the probability distribution for the health status of an individual is the same for all individuals, i.e. for all $i, i^{'} \in V$ and all $t \geq 0$ , we have $P_{S_{i}} (t) = P_{S_{i^{'}}} (t), P_{I_{i}} (t) = P_{I_{i^{'}}} (t)$ and $P_{R_{i}} (t) = P_{R_{i^{'}}} (t)$ (let $P_{S} (t), P_{I} (t)$ and $P_{R} (t)$ denote these quantities). Similarly, for all $i \in V, j \in N_{i}$ and all $i^{'} \in V, j^{'} \in N_{i^{'}}$ , and all $t \geq 0$ , we have $H^{i \leftarrow j} (t) = H^{i^{'} \leftarrow j^{'}} (t)$ (let $H_{sym} (t)$ denote this quantity).

The homogeneous message passing system

For the homogeneous stochastic model, (3) becomes

\begin{matrix} F^{i \leftarrow j} (t) = 1 - \int_{0}^{t} f (τ) (1 - y - z \prod_{k \in N_{j} \ i} F^{j \leftarrow k} (t - τ)) d τ (i \in V, j \in N_{i}), \end{matrix}

where we have used $f_{i j} (τ) = f_{i^{'} j^{'}} (τ)$ for all $i \in V, j \in N_{i}$ , and all $i^{'} \in V, j \in N_{i^{'}}$ , and all $τ \geq 0$ , and we let $f (τ)$ denote this quantity.

The arc-transitivity of symmetric graphs and the symmetry in (9) allow us to simplify (3)–(6), and to write down the full homogeneous message passing system as:

\begin{matrix} S_{mes} (t) = & z F_{sym} {(t)}^{n}, \end{matrix}

\begin{matrix} I_{mes} (t) = & 1 - S_{mes} (t) - R_{mes} (t), \end{matrix}

\begin{matrix} R_{mes} (t) = & y + \int_{0}^{t} r (τ) [1 - y - S_{mes} (t - τ)] d τ, \end{matrix}

where

\begin{matrix} F_{sym} (t) = & 1 - \int_{0}^{t} f (τ) [1 - y - z F_{sym} {(t - τ)}^{n - 1}] d τ . \end{matrix}

In deriving these equations, we have used $F^{i \leftarrow j} (t) = F^{i^{'} \leftarrow j^{'}} (t)$ for all $i \in V, j \in N_{i}$ and all $i^{'} \in V, j^{'} \in N_{i^{'}}$ , and all $t \geq 0$ , and we let $F_{sym} (t)$ denote this quantity. Note that we have also made use of the fact that every individual has n neighbours. This system is identical in form (when vaccination is disallowed) to the message passing system for the configuration network model provided by Karrer and Newman (2010, equations 26 and 27, making use of equations 1, 4 and 5), in the case where every individual has n neighbours with probability 1. From Theorem 1, we know that if ${sup}_{τ \geq 0} f (τ) < \infty$ then (13) has a unique feasible solution.

For clarity we write out these equations for the simplifying cases of Poisson transmission and recovery processes, and Poisson transmission and fixed (non-random) recovery.

Example 1

(Poisson transmission and recovery) For independent Poisson transmission and recovery processes (specifically, $τ_{i}$ and $ω_{j i}$ are independent and exponentially distributed with rates $γ$ and $β$ respectively), with $f (τ) = β e^{- (β + γ) τ}$ , the homogeneous message passing system can be solved via the following ordinary differential equations (ODEs):

\begin{matrix} {\dot{F}}_{sym} (t) = & γ (1 - F_{sym} (t)) - β (F_{sym} (t) - y - z F_{sym} {(t)}^{n - 1}), \end{matrix}

\begin{matrix} {\dot{R}}_{mes} (t) = & γ I_{mes} (t), \end{matrix}

with $S_{mes} (t)$ and $I_{mes} (t)$ given by (10) and (11).

Example 2

(Poisson transmission and fixed recovery) For Poisson transmission processes and a fixed recovery period (specifically, $τ_{i}$ is non-random with value $R \in [0, \infty]$ and $ω_{j i}$ is exponentially distributed with rate $β$ ), with $f (τ) = β e^{- β τ} (1 - θ (t - R))$ where $θ$ is the Heaviside step function, the homogeneous message passing system can be solved using the following delay differential equation:

\begin{matrix} {\dot{F}}_{sym} (t) = & - β (F_{sym} (t) - y - z F_{sym} {(t)}^{n - 1} \\ - θ (t - R) e^{- β R} (1 - y - z F_{sym} {(t - R)}^{n - 1})), \end{matrix}

with

\begin{matrix} R_{mes} (t) = & y + θ (t - R) (1 - y - S_{mes} (t - R)), \end{matrix}

and $S_{mes} (t)$ and $I_{mes} (t)$ given by (10) and (11).

Other choices of $f (τ)$ exist which allow the message passing system to be solved via (non-integro) differential equations, such as the top hat function (Karrer and Newman 2010, equation 33).

Epidemiological results

As well as bounding/approximating (or correctly computing in the case of an infinite regular tree) the expected fractional epidemic size at time $t \geq 0$ , the homogeneous message passing system generates other epidemiologically relevant results for the stochastic model, as demonstrated here.

Theorem 3

(Cycles in the network inhibit the stochastic epidemic) Suppose that ${sup}_{τ \geq 0} f (τ) < \infty$ . The probability of an arbitrary individual being susceptible at a given time, for the n-regular Bethe lattice (infinite tree), is less than or equal to this quantity for all other n-regular symmetric graphs (where the homogeneous stochastic model is otherwise unchanged). The same holds for the probability of an arbitrary individual being recovered except with the inequality reversed.

Proof

From Theorem 2, we know that system (10)–(13) cannot overestimate the probability of an arbitrary individual being susceptible at time t and cannot underestimate the probability of an arbitrary individual being recovered at time t. However, also from Theorem 2, the system is exact if the graph is a tree. $□$

Theorem 3 suggests that, all other things being equal, an infection will have the greatest impact by time t when the contact structure is most tree-like. Indeed, it is known that clustering and the presence of cycles in the graph may slow down and limit the spread of an infection (see Miller (2009) and references therein).

Theorem 4

(Final epidemic size relation and sufficient conditions for no major outbreak) For all $t \geq 0$ ,

\begin{matrix} S_{mes} (\infty) \leq P_{S} (t), R_{mes} (\infty) \geq P_{R} (t), \end{matrix}

where $S_{mes} (\infty) \equiv {lim}_{t \to \infty} S_{mes} (t)$ may be computed as the unique solution in [0, z] of

\begin{matrix} {(\frac{S_{mes} (\infty)}{z})}^{\frac{1}{n}} = 1 - p + p y + p z {(\frac{S_{mes} (\infty)}{z})}^{\frac{n - 1}{n}}, \end{matrix}

with $p \equiv \int_{0}^{\infty} f (τ) d τ$ , and $R_{mes} (\infty) = 1 - S_{mes} (\infty)$ .

Further, when the fraction initially infected is small, i.e. $z \to 1 - y$ from below, then

\begin{matrix} P_{S} (\infty) = P_{S} (0) if y \geq 1 - \frac{1}{R_{0}} or R_{0} \leq 1, \end{matrix}

where $R_{0} \equiv (n - 1) p$ . (This means that if each individual is independently vaccinated with probability greater than or equal to $1 - 1 / R_{0}$ , or if $R_{0} \leq 1$ , then a major outbreak of the disease is impossible.)

Proof

Equation (18) follows from Theorem 2 and the observation that $P_{S} (t)$ and $P_{R} (t)$ are non-increasing and non-decreasing respectively.

The feasible $F_{sym} (t)$ is non-increasing (see Theorem 1), so it converges to some $F_{sym} (\infty) \in [0, 1]$ as $t \to \infty$ . Note also that, by definition, $\int_{0}^{t} f (τ) d τ$ converges to $p \in [0, 1]$ as $t \to \infty$ . Now, using (13), we can write $F_{sym} (t) = 1 - \int_{0}^{\infty} f_{t} (τ) d τ$ , where $f_{t} (τ) = f (τ) (1 - y - z F_{sym} {(t - τ)}^{n - 1})$ for $τ \in [0, t]$ and is equal to zero for $τ > t$ . Note that $f_{t} (τ)$ converges pointwise to $f (τ) (1 - y - z F_{sym} {(\infty)}^{n - 1})$ as $t \to \infty$ . Thus, since $0 \leq f_{t} (τ) \leq f (τ)$ for all $t, τ \geq 0$ , we can use the dominated convergence theorem to obtain, c.f. Karrer and Newman (2010, equations 23 and 24),

\begin{matrix} F_{sym} (\infty) = 1 - \int_{0}^{\infty} lim_{t \to \infty} f_{t} (τ) d τ = 1 - p (1 - y - z F_{sym} {(\infty)}^{n - 1}) . \end{matrix}

Taking the limit as $t \to \infty$ in (10), and making use of (21), proves equation (19). It is straightforward to show by graphical means that (19) has a unique solution in [0, z]. In the case where $z \to 1 - y$ from below, it is also straightforward to show by graphical means that, after setting $z = 1 - y$ in (19), then $S_{mes} (\infty) = z (= S_{mes} (0) = P_{S} (0))$ is the only solution in [0, z] if $y \geq 1 - 1 / R_{0}$ ( $R_{0} \leq 1$ implies this condition). Equation (20) is then proved by noting that $P_{S} (t) \geq S_{mes} (t)$ , and $P_{S} (t)$ is non-increasing from $P_{S} (0) = S_{mes} (0)$ . $□$

Equation 19 is consistent with the final size relation given by Diekmann et al. (1998) (equations 5.3 and 5.4) for a regular random graph in the limit of large population size.

Remark 1

Consider an infinite sequence of finite homogeneous stochastic models, indexed by m, where $y_{m} = y \in [0, 1)$ for all m, and where $N_{m} \to \infty$ , $p_{m} (n_{m} - 1) \to R_{0} < \infty$ , $z_{m} \to 1 - y$ , as $m \to \infty$ (here, $N_{m}$ denotes the number of individuals in the mth model). This does not preclude the expected number of initial infectives from tending to some positive number, or even diverging, as $m \to \infty$ . It is straightforward that, in the limit of this sequence, the sufficient conditions for no major outbreak in Theorem 4 still hold. Note that if in addition we have $n_{m} \to \infty$ as $m \to \infty$ , then the final size relation for the homogeneous message passing system (in this limit) becomes, using (19) with $z = 1 - y$ ,

\begin{matrix} \frac{S_{mes} (\infty)}{1 - y} = e^{- R_{0} (1 - S_{mes} (\infty) - y)} . \end{matrix}

This is a well-known final size relation in the mean field literature, although usually vaccination is not included (see Miller (2012) for a discussion of derivations of this relation).

The homogeneous message passing system gives the same epidemic time course as a pairwise model

Here we show that a generalised pairwise SIR model, with well-known pairwise models as special cases, gives the same epidemic time course as the homogeneous message passing system. This allows us to prove epidemiological results for the generalised pairwise model. Since pairwise models are known to give good approximations of stochastic epidemic dynamics on networks [see, for example, Keeling (1999) and Sharkey (2008)], this also strengthens the case for the message passing system being a good approximation.

Theorem 5

(Equivalence of the message passing and pairwise models) For the homogeneous stochastic model, assume that the contact processes are Poisson with rate $β$ and that they are independent from the recovery processes, such that $f (τ) = β e^{- β τ} \int_{τ}^{\infty} r (τ^{'}) d τ^{'}$ . Assume also that $r (τ)$ is continuous. Then,

\begin{matrix} \dot{[S]} (t) = & - β [S I] (t), \end{matrix}

\begin{matrix} \dot{[I]} (t) = & β [S I] (t) - \int_{0}^{t} r (τ) β [S I] (t - τ) d τ - r (t) N (1 - y - z), \end{matrix}

\begin{matrix} \dot{[S S]} (t) = & - 2 β \frac{n - 1}{n} \frac{[S S] (t) [S I] (t)}{[S] (t)}, \end{matrix}

\begin{matrix} \dot{[S I]} (t) = & - β (\frac{n - 1}{n}) \frac{[S I] (t) [S I] (t)}{[S] (t)} \\ - β [S I] (t) \\ + β (\frac{n - 1}{n}) \frac{[S S] (t) [S I] (t)}{[S] (t)} \\ - \int_{0}^{t} e^{- β τ} r (τ) β (\frac{n - 1}{n}) \frac{[S S] (t - τ) [S I] (t - τ)}{[S] (t - τ)} \\ \times \exp (- \int_{t - τ}^{t} β (\frac{n - 1}{n}) \frac{[S I] (τ^{'})}{[S] (τ^{'})} d τ^{'}) d τ \\ - n N z e^{- β t} r (t) (1 - y - z) \exp (- \int_{0}^{t} β (\frac{n - 1}{n}) \frac{[S I] (τ)}{[S] (τ)} d τ), \end{matrix}

where

\begin{matrix} [S] (t) \equiv & N S_{mes} (t), \end{matrix}

\begin{matrix} [I] (t) \equiv & N I_{mes} (t), \end{matrix}

\begin{matrix} [S S] (t) \equiv & n N S S_{mes} (t) \equiv n N z^{2} F_{sym} {(t)}^{2 (n - 1)}, \end{matrix}

\begin{matrix} [S I] (t) \equiv & n N S I_{mes} (t) \equiv n N z F_{sym} {(t)}^{n - 1} (\frac{- {\dot{F}}_{sym} (t)}{β}), \end{matrix}

and N is a positive number.

Proof

See “Appendix 3”. $□$

Corollary 2

(At all time points the pairwise model cannot underestimate the expected epidemic size) If, in the homogeneous stochastic model, contact processes are Poisson with rate $β$ , i.e. the marginal distribution for $ω_{j i}$ is exponential with parameter $β$ for all $i \in V, j \in N_{i}$ , and these are independent from the infectious periods, then

\begin{matrix} [S] (t) / N \leq P_{S} (t), [R] (t) / N \geq P_{R} (t) (t \geq 0), \end{matrix}

where $[R] (t) = N - [S] (t) - [I] (t)$ .

Proof

This follows immediately from Theorems 2 and 5. $□$

Corollary 3

(Final epidemic size equation for the pairwise model)

\begin{matrix} {(\frac{[S] (\infty)}{N z})}^{\frac{1}{n}} = 1 - p + p y + p z {(\frac{[S] (\infty)}{N z})}^{\frac{n - 1}{n}}, \end{matrix}

where $[S] (\infty) \equiv {lim}_{t \to \infty} [S] (t)$ and $p \equiv \int_{0}^{\infty} β e^{- β τ^{'}} \int_{τ^{'}}^{\infty} r (τ) d τ d τ^{'}$ .

Proof

This follows immediately from Theorems 4 and 5. $□$

Note that (22)–(25) constitute a closed system for the variables [S](t), [I](t), [SS](t) and [SI](t) (if $[S] (t) = 0$ then the right-hand sides of (24) and (25) are undefined, but in this case the left-hand sides are equal to zero). With reference to (28) and (29), the quantities $S S_{mes} (t)$ and $S I_{mes} (t)$ are constructed to capture/approximate, for any given pair of neighbours at time t, the probability that they are both susceptible and the probability that the first is susceptible while the second is infected respectively (see “Appendix 3”). The system (22)–(25) also follows directly from application of the individual-level pairwise equations in Wilkinson and Sharkey (2014, equations 8 and 9). In the case where the infectious period is exponentially distributed and letting N be the population size, (23) and (25) simplify to ODEs, and the pairwise (without clustering) model of Keeling (1999) is obtained. Similarly, after substituting $r (τ) = δ (t - R)$ , where $δ$ is the Dirac delta function, into (23) and (25), the pairwise model of Kiss et al. (2015) for a non-random infectious period of duration R is obtained (except that the last term in (23) and the last term in (25), which relate to the behaviour of the initial infectives, need to be neglected). However, it may be more efficient to solve the simpler message passing systems [via (14)–(15) and (16)–(17) respectively] and then, if pairwise quantities are required, these can be computed using (28) and (29).

As part of the proof of equivalence between message passing and pairwise models that we present here, we also close a gap in the arguments of Wilkinson and Sharkey (2014) by demonstrating sufficient conditions for the valid application of Leibniz’s integral rule (“Appendix 4”) in the derivation of the pairwise equations from the message passing equations (“Appendix 3”).

The homogeneous message passing system gives the same epidemic time course as the Kermack–McKendrick model (asymptotically)

Here, we consider a sequence of homogeneous stochastic models where the regular degree n tends to infinity. As $n \to \infty$ , an individual is able to make contacts to a number of neighbours which tends to infinity, so to obtain a finite limit we assume that the infection function $f (τ)$ depends on n (which we write $f_{n} (τ)$ ) such that:

\begin{matrix} lim_{n \to \infty} n f_{n} (τ) = f^{*} (τ) < \infty (τ \geq 0) . \end{matrix}

Note that, in the limit of large n, transmission is frequency dependent and the expected number of infectious contacts made by a given infected individual during the time interval $(t_{1}, t_{2})$ is $\int_{t_{1}}^{t_{2}} f^{*} (τ) d τ$ , where time is measured from the moment the individual first became infected.

The deterministic model proposed by Kermack and McKendrick (1927) is as follows:

\begin{matrix} \dot{S} (t) = & S (t) [\int_{0}^{t} f^{*} (τ) \dot{S} (t - τ) d τ - I (0) f^{*} (t)], \end{matrix}

\begin{matrix} I (t) = & 1 - S (t) - R (t), \end{matrix}

\begin{matrix} R (t) = & R (0) + \int_{0}^{t} r (τ) [1 - R (0) - S (t - τ)] d τ . \end{matrix}

Equations 12–15 of Kermack and McKendrick (1927) may be obtained from (30) to (32) after multiplying through by the total population size N in their paper.

The following theorem shows that, under this limiting regime and mild further conditions, the homogeneous message passing system gives the same epidemic time course as the model of Kermack and McKendrick (1927). For $n = 1, 2, \dots$ , let $S_{mes (n)} (t), I_{mes (n)} (t)$ and $R_{mes (n)} (t)$ denote the message passing system given by (10)-(13), where $F_{sym} (t)$ is replaced by $F_{sym (n)} (t)$ , which satisfies (13) with $f (τ)$ replaced by $f_{n} (τ)$ .

Theorem 6

(Deriving the Kermack–McKendrick model from message passing) Suppose that for all $T \geq 0$ ,

(i)
$ϵ_{n} (T) = {sup}_{0 \leq t \leq T} | n f_{n} (t) - f^{*} (t) | \to 0$ as $n \to \infty$ ,
(ii)
$M_{T} = {sup}_{0 \leq t \leq T} f^{*} (t) < \infty$ ,

and that, for all $n = 1, 2, \dots,$

(iii)
$f_{n} (t)$ is continuously differentiable,
(iv)
$(S_{mes (n)} (0), I_{mes (n)} (0), R_{mes (n)} (0)) = (S (0), I (0), R (0)) = (z, 1 - z - y, y)$ .

Then, for all $T > 0$ ,

\begin{matrix} lim_{n \to \infty} sup_{0 \leq t \leq T} |S_{mes (n)} (t) - S (t)| = & 0, \end{matrix}

\begin{matrix} lim_{n \to \infty} sup_{0 \leq t \leq T} |I_{mes (n)} (t) - I (t)| = & 0, \end{matrix}

\begin{matrix} lim_{n \to \infty} sup_{0 \leq t \leq T} |R_{mes (n)} (t) - R (t)| = & 0 . \end{matrix}

Proof

Fix $T > 0$ and note first from (13) that, for feasible $F_{sym (n)} (t)$ and all $t \in [0, T]$ ,

\begin{matrix} 1 \geq F_{sym (n)} (t) \geq 1 - \int_{0}^{t} f_{n} (τ) d τ (n = 1, 2, \dots) . \end{matrix}

Now $n \int_{0}^{t} f_{n} (τ) d τ \leq T (M_{T} + ϵ_{n} (T))$ , for all $t \in [0, T]$ , so conditions (i) and (ii) imply that there exists $ϵ_{n}^{(1)} (T) \geq 0$ such that for all $t \in [0, T]$ ,

\begin{matrix} 1 \geq F_{sym (n)} (t) \geq 1 - ϵ_{n}^{(1)} (T) (n = 1, 2, \dots), \end{matrix}

where $ϵ_{n}^{(1)} (T) \to 0$ as $n \to \infty$ . Thus, for all sufficiently large n, $F_{sym (n)} (t)$ is non-zero for all $t \in [0, T]$ .

Differentiating (10) yields

\begin{matrix} {\dot{S}}_{mes (n)} (t) = n z F_{sym (n)} {(t)}^{n - 1} {\dot{F}}_{sym (n)} (t), \end{matrix}

and differentiating (13), using Leibniz’s integral rule (see “Appendix 4”), gives

\begin{matrix} {\dot{F}}_{sym (n)} (t) = & - f_{n} (t) (1 - y - z) \\ + (n - 1) z \int_{0}^{t} f_{n} (τ) F_{sym (n)} {(t - τ)}^{n - 2} {\dot{F}}_{sym (n)} (t - τ) d τ . \end{matrix}

Substituting (38) into (37), and using (10), gives

\begin{matrix} {\dot{S}}_{mes (n)} (t) = & \frac{S_{mes (n)} (t)}{F_{sym (n)} (t)} [\frac{n - 1}{n} \int_{0}^{t} n f_{n} (τ) \frac{{\dot{S}}_{mes (n)} (t - τ)}{F_{sym (n)} (t - τ)} d τ \\ - n f_{n} (t) (1 - y - z)] . \end{matrix}

It can be shown, using (30) and (39) that, for all $t \in [0, T]$ ,

\begin{matrix} |{\dot{S}}_{mes (n)} (t) - \dot{S} (t)| \leq A (n, T) \int_{0}^{t} |{\dot{S}}_{mes (n)} (u) - \dot{S} (u)| d u + B (n, T), \end{matrix}

where $B (n, T) \to 0$ as $n \to \infty$ and $0 \leq A (n, T) \leq 4 M_{T}$ for all sufficiently large n (see “Appendix 5”). Application of Gronwall’s inequality (see “Appendix 4”) then yields that, for all $t \in [0, T]$ ,

\begin{matrix} |{\dot{S}}_{mes (n)} (t) - \dot{S} (t)| \leq B (n, T) e^{A (n, T) t} . \end{matrix}

Thus

\begin{matrix} lim_{n \to \infty} sup_{0 \leq t \leq T} |{\dot{S}}_{mes (n)} (t) - \dot{S} (t)| = 0, \end{matrix}

whence

\begin{matrix} lim_{n \to \infty} sup_{0 \leq t \leq T} |S_{mes (n)} (t) - S (t)| = & lim_{n \to \infty} sup_{0 \leq t \leq T} |\int_{0}^{t} {\dot{S}}_{mes (n)} (u) - \dot{S} (u) d u| \\ \leq & lim_{n \to \infty} sup_{0 \leq t \leq T} \int_{0}^{t} |{\dot{S}}_{mes (n)} (u) - \dot{S} (u)| d u \\ \leq & lim_{n \to \infty} T sup_{0 \leq t \leq T} |{\dot{S}}_{mes (n)} (t) - \dot{S} (t)| \\ = & 0, \end{matrix}

proving (33). Equation (35) now follows using a similar argument and (34) is then immediate. $□$

It is straightforward that if $f^{*} (τ) = β k e^{- γ τ}$ and $r (τ) = γ e^{- γ τ}$ then the Kermack–McKendrick model reduces to a system of ODEs:

\begin{matrix} \dot{S} (t) = & - β k S (t) I (t), \end{matrix}

\begin{matrix} \dot{I} (t) = & β k S (t) I (t) - γ I (t), \end{matrix}

\begin{matrix} \dot{R} (t) = & γ I (t) . \end{matrix}

For this special case, we state the following corollary to Theorem 6, see also Wilkinson et al. (2016), where it is proved that, for non-random initial conditions, the Kermack–McKendrick model bounds the so-called ‘general stochastic epidemic’.

Corollary 4

(In the Markovian case, message passing and pairwise models are better approximations than the Kermack–McKendrick model) Assume that, in the homogeneous stochastic model, contact and recovery processes are independent and Poisson with rates $β$ and $γ$ respectively. Specifically, $h (τ) = β e^{- β τ}$ , $r (τ) = γ e^{- γ τ}$ and $f (τ) = β e^{- (β + γ) τ}$ . Let k denote the regular degree of the symmetric graph (instead of n). For this special case,

\begin{matrix} S (t) < S_{mes} (t) \leq P_{S} (t), R (t) > R_{mes} (t) \geq P_{R} (t) (t > 0), \end{matrix}

where S(t) and R(t) are given by (42)–(44), with $S (0) = z, I (0) = 1 - y - z$ and $R (0) = y$ , and $S_{mes} (t)$ and $R_{mes} (t)$ are given by (9)–(12) with n replaced by k.

Proof

See “Appendix 6”. $□$

Discussion

The message passing equations of Karrer and Newman (2010) approximate the expected time course for non-Markovian SIR epidemic dynamics on networks. In a later paper, Wilkinson and Sharkey (2014) slightly generalised their equations in order to make them applicable to stochastic models with more individual level heterogeneity. Here, for the first time, we have shown that Karrer and Newman’s system of message passing equations, and its generalisation, have unique feasible solutions (Theorem 1).

An important feature of the message passing equations is that they produce an upper bound to the expected epidemic size (cumulative number of infection events) at every point in time. Thus, they give a ‘worst case scenario’. In addition, they exactly capture the expected epidemic when the contact network is a tree. Here, we extended these results to a further generalised stochastic model which includes realistic correlations between post-infection contact times and the infectious period (Theorem 2). This situation can occur when individuals may adopt disease-combating behaviour, such as taking antiviral medication, which acts both on the ability of an individual to pass on the infection as well as the duration of their infectivity.

Much of this paper was devoted to a special case of the stochastic model which we referred to as the ‘homogeneous stochastic model’, in which individuals are homogeneous and the contact network is a symmetric graph (correlations between post-infection contact times and the infectious period are still allowed). Examples of a symmetric graph include a finite complete graph, an infinite square lattice and an infinite Bethe lattice. Due to symmetry, the message passing system here reduces to just four equations which we refer to as the ‘homogeneous message passing system’. This system is equivalent in form to a special case of the system found by Karrer and Newman (2010) to describe epidemic dynamics on random configuration networks, but here it is applied to a different stochastic model. These equations were analysed, making use of Theorem 2, to obtain a result which shows that cycles in the contact network serve to inhibit the stochastic epidemic (Theorem 3). Following arguments from Karrer and Newman (2010), we also obtained a single equation which provides an upper bound on the final epidemic size (Theorem 4); for the Bethe lattice, the final epidemic size is captured exactly. This naturally provides sufficient conditions, in terms of an $R_{0}$ -like quantity and the level of vaccination, for there to be no major outbreak (Theorem 4).

We found that the ‘limit’ of an appropriate sequence of homogeneous message passing systems gives the same epidemic time course as the Kermack and McKendrick (1927) epidemic model (Theorem 6) showing that it can be viewed as a special case of message passing. This also has the advantage of relating it to the underlying stochastic model (see also Barbour and Reinert (2013) who establish an exact correspondence). The final epidemic size result, and sufficient conditions for no major outbreak, described above for the homogeneous message passing system then translate directly.

From the homogeneous message passing system, we also constructed an equivalent population-level pairwise system which incorporates a general infectious period (Theorem 5). This can also be derived directly as a special case of the general individual-level pairwise system of Wilkinson and Sharkey (2014, equations 8 and 9) by applying the conditions of the homogeneous stochastic model. Here we filled a gap in the arguments of this paper by demonstrating sufficient conditions for the valid application of Leibniz’s integral rule (“Appendix 4”). This population-level pairwise system contains the Poisson pairwise model (without clustering) of Keeling (1999) as a special case. It also contains the delay differential equation model of Kiss et al. (2015) as a special case. We note that an entirely different derivation of (22)–(25) has been found independently and in parallel by Röst et al. (2016).

In general, we have emphasised the equivalence between several different types of SIR epidemic model. Specifically, we mention the derivation of the Kermack–McKendrick model as a special case of message passing and the equivalence (under Markovian transmission) of message passing and a class of pairwise models [see also Wilkinson and Sharkey (2014)]. We also note the recently submitted paper by Sherborne et al. (2016) which highlights the equivalence of message-passing and edge-based models (Miller et al. 2011), and that there is equivalence between edge-based models and the model of Volz (2008) [proved by Miller (2011)], and between the model of Volz and the binding site model of Leung and Diekmann (2017, Remark 1). While for SIR dynamics, message passing provides quite a general unifying framework, we note that for other dynamics such as SIS, it remains difficult to formulate a similar construction.

Unification of models is valuable in narrowing the lines of enquiry and simplifying ongoing research. In addition, owing to their different constructions, different types of results have been more forthcoming for some models than for others, and unification can allow results for one model to be automatically transferred to another. For example, here, by unification with message passing, we have been able to show that when contact and recovery processes are independent and Poisson, the Kermack–McKendrick model (which then reduces to a mass action ODE model) provides a rigorous upper bound on the expected epidemic size at time $t > 0$ in the homogeneous stochastic model (Corollary 4). However, the bound is coarser than that provided by the message passing and pairwise systems, so we now know that these are better approximations. This extends the result that, for non-random initial conditions, the Kermack–McKendrick model bounds the so-called ‘general stochastic epidemic’ (Wilkinson et al. 2016). An interesting development would be to show that the Kermack–McKendrick model (30)–(32) bounds the homogeneous stochastic model more generally. We observe that this could be achieved by showing that the message passing system for the stochastic model is the first in a sequence of message passing systems indexed by n, which satisfies the conditions for Theorem 6, and where $S_{mes (n)} (t)$ is non-increasing with n; this is easy to do for Poisson transmission and recovery processes (“Appendix 6”). Another extension worthy of investigation is to multitype SIR epidemics.

Acknowledgements

R.R.W. acknowledges support from EPSRC (DTA studentship). R.R.W. and K.J.S. acknowledge support from the Leverhulme Trust (RPG-2014-341). We thank the reviewers and associate editor for their constructive comments which have improved the presentation of the paper.

Appendix 1: Proof of Theorem 1

Reproducing an argument from Karrer and Newman (2010), we construct here a feasible (bounded between 0 and 1) solution of (3). Let $F_{(0)}^{i \leftarrow j} (t) = 1$ for all $i \in V, j \in N_{i}$ and all $t \geq 0$ , and define the following iterative procedure. For $m = 1, 2, \dots$ , let

\begin{matrix} F_{(m)}^{i \leftarrow j} (t) = 1 - \int_{0}^{t} f_{i j} (τ) (1 - y_{j} - z_{j} \prod_{k \in N_{j} \ i} F_{(m - 1)}^{j \leftarrow k} (t - τ)) d τ . \end{matrix}

It is easily shown that $1 \geq F_{(m)}^{i \leftarrow j} (t) \geq F_{(m + 1)}^{i \leftarrow j} (t) \geq 1 - \int_{0}^{t} f_{i j} (τ) d τ$ , for all $i \in V, j \in N_{i}$ , $t \geq 0$ and $m = 0, 1, \dots$ , whence $F_{m} (t) \equiv (F_{m}^{i \leftarrow j} (t) : i \in V, j \in N_{i})$ converges to some $F_{\infty} (t)$ as $m \to \infty$ , and $F_{\infty} (t)$ is a feasible solution of (3). Moreover, letting $F_{*} (t)$ be any feasible solution of (3), it can be shown, arguing as in Corduneanu (1991), section 1.3, that

\begin{matrix} sup_{i \in V, j \in N_{i}} | F_{*}^{i \leftarrow j} (t) - F_{m}^{i \leftarrow j} (t) | \leq \frac{{(N_{max} - 1)}^{m} {(t f_{max})}^{m + 1}}{(m + 1)!}, \end{matrix}

where $N_{max} = {sup}_{i \in V} | N_{i} |$ and $f_{max} = {sup}_{i \in V, j \in N_{i}} {sup}_{t^{'} \geq 0} f_{i j} (t^{'})$ . Assume that $N_{max} < \infty$ and $f_{max} < \infty$ . Then, the right-hand side of (46) converges to zero as $m \to \infty$ , and $F_{\infty} (t)$ must be the unique feasible solution of (3).

Note that (45) implies that if, for all $i \in V, j \in N_{i}$ , it is the case that $F_{(m - 1)}^{i \leftarrow j} (t)$ is non-increasing and belongs to [0, 1] for all $t \geq 0$ , then these properties are also held by $F_{(m)}^{i \leftarrow j} (t)$ for all $i \in V, j \in N_{i}$ . Since these properties are held by $F_{(0)}^{i \leftarrow j} (t) (= 1)$ for all $i \in V, j \in N_{i}$ , then, by induction, they hold for all $m \geq 0$ , so $F_{(\infty)}^{i \leftarrow j} (t)$ is non-increasing for all $i \in V, j \in N_{i}$ . Thus, the feasible solution of (13) (for $F_{sym} (t)$ ) is non-increasing, whence $S_{mes} (t)$ is non-increasing.

To show continuity of the feasible solution, first note that (45) implies that if, for all $i \in V, j \in N_{i}$ , it is the case that $F_{(m - 1)}^{i \leftarrow j} (t)$ is continuous, then $F_{(m)}^{i \leftarrow j} (t)$ is also continuous for all $i \in V, j \in N_{i}$ . Since $F_{(0)}^{i \leftarrow j} (t) (= 1)$ is continuous for all $i \in V, j \in N_{i}$ , then, by induction, $F_{(m)}^{i \leftarrow j} (t)$ is continuous for all $m \geq 0, i \in V, j \in N_{i}$ . Now, for any fixed $T > 0$ , the bound in (46) holds for all $t \in [0, T]$ provided t in the right-hand side of (46) is replaced by T. Thus $F_{m} (t)$ converges uniformly to $F_{\infty} (t)$ over [0, T] as $n \to \infty$ and, since each $F_{m} (t)$ is continuous on [0, T], it follows that $F_{\infty} (t)$ is also continuous on [0, T]. This holds for any $T > 0$ , so $F_{\infty} (t)$ is continuous on $[0, \infty)$ .

Appendix 2: Proof of Theorem 2

We suppose first that the vertex set $V$ is finite. Similarly to Wilkinson and Sharkey (2014, section III), and Ball et al. (2015), it is straightforward to show that the indicator variable $1_{i \leftarrow A (t)}$ for the event that a cavity state-individual $i \in V$ does not receive any infectious contacts from any of $A \subset N_{i}$ by time $t \geq 0$ is a function of the random variables $X^{* *} \equiv \cup_{i \in V} {X_{i}^{*}, Y_{i}}$ (see the beginning of Sect. 2), and that it is non-decreasing with respect to each element of $X^{* *}$ . Thus, since $X^{* *}$ is a set of associated variables [by assumption, and Esary et al. (1967, (P2) and (P3))] and $Y_{i}$ is independent of all other members of $X^{* *}$ , then using Esary et al. (1967, Theorem 4.1), we have

\begin{matrix} P_{S_{i}} (t) = z_{i} E [1_{i \leftarrow N_{i} (t)}] \geq z_{i} \prod_{j \in N_{i}} E [1_{i \leftarrow j (t)}] = z_{i} \prod_{j \in N_{i}} H^{i \leftarrow j} (t) (i \in V), \end{matrix}

with equality occurring when the graph is a tree or forest (where putting an individual into the cavity state prevents any dependencies between the states of its neighbours). Recall that $z_{i} \equiv P (Y_{i} = 2)$ is the probability that i is initially susceptible.

Similarly, the indicator variable $1_{(i) j \leftarrow A (t)}$ for the event that a cavity state-individual $j \in V$ does not receive any infectious contacts from any of $A \subset N_{j} \ i$ by time $t \geq 0$ , where $i \in N_{j}$ is also in the cavity state, is a function of the random variables $X^{* *}$ , and it is non-decreasing with respect to each. Again, since $X^{* *}$ is a set of associated variables then we have [c.f. (2) and (3)],

\begin{matrix} Φ_{i}^{j} (t) = E [1_{(i) j \leftarrow N_{j} \ i (t)}] \geq & \prod_{k \in N_{j} \ i} E [1_{(i) j \leftarrow k (t)}] \\ \geq & \prod_{k \in N_{j} \ i} E [1_{j \leftarrow k (t)}] \\ = & \prod_{k \in N_{j} \ i} H^{j \leftarrow k} (t), \end{matrix}

where the second inequality follows from the fact that taking an individual out of the cavity state cannot increase the probability that a different individual receives no infectious contacts from a given neighbour by time $t \geq 0$ . Again, equality occurs when the graph is a tree or forest.

The above derivations of (47) and (48) break down when the vertex set $V$ is countably infinite, since the theory in Esary et al. (1967) requires that the set of random variables $X^{* *}$ is finite. Suppose now that $V$ is countably infinite and label the vertices $1, 2, \dots$ . Fix $i \in V$ and an integer $n \geq i$ . Let $G^{(n)} = (V^{(n)}, E^{(n)})$ be the graph obtained from G by deleting the vertices $n + 1, n + 2, \dots$ and all edges connected to those vertices. Now, since $| V^{(n)} | < \infty$ , the inequality (47) yields

\begin{matrix} P_{S_{i}}^{(n)} (t) \geq z_{i} \prod_{j \in N_{i}^{(n)}} H^{(n), i \leftarrow j} (t), \end{matrix}

where the superfix n denotes that the quantity is defined for the epidemic on $G^{(n)}$ . Further, for $n = i, i + 1, \dots$ , the epidemic on $G^{(n)}$ can be defined using the same set $X^{* *} \equiv \cup_{i \in V} {X_{i}^{*}, Y_{i}}$ of random variables. It then follows that, for any $t \geq 0$ , the event that individual i is susceptible at time t in the epidemic on $G^{(n)}$ decreases with n and tends to the event that individual i is susceptible at time t in the epidemic on G as $n \to \infty$ , so $P_{S_{i}}^{(n)} (t) \to P_{S_{i}} (t)$ as $n \to \infty$ by the continuity of probability measures. A similar argument shows that $H^{(n), i \leftarrow j} (t) \to H^{i \leftarrow j} (t)$ as $n \to \infty$ . Letting $n \to \infty$ in (49) then shows that (47) holds when $V$ is countably infinite, as $| N_{i} | < \infty$ . The same method of proof shows that (48) also holds when $V$ is countably infinite.

Using (48) in conjunction with (2) we have

\begin{matrix} H^{i \leftarrow j} (t) \geq 1 - \int_{0}^{t} f_{i j} (τ) (1 - y_{j} - z_{j} \prod_{k \in N_{j} \ i} H^{j \leftarrow k} (t - τ)) d τ, \end{matrix}

where equality occurs when the graph is a tree or forest. Using (50), it is easy to show by the iterative procedure in “Appendix 1” (except with $F_{(0)}^{i \leftarrow j} (t) = H^{i \leftarrow j} (t)$ ) that a unique feasible solution of (3) exists and, using this solution, that $F^{i \leftarrow j} (t) \leq H^{i \leftarrow j} (t)$ for all $i \in V, j \in N_{i}$ and all $t \geq 0$ , with equality occurring when the graph is a tree or forest. This fact, in combination with (47), c.f. (4), proves (7), and consequently, c.f. (6), gives (8).

Appendix 3: Proof of Theorem 5

Here we consider the homogeneous stochastic model defined at the beginning of Sect. 3 with reference to the beginning of Sect. 2. We assume that transmission processes are Poisson with rate $β$ and that they are independent of the recovery processes, specifically $f (τ) = β e^{- β τ} \int_{τ}^{\infty} r (τ^{'}) d τ^{'}$ . We assume that $r (τ)$ is continuous so that we may apply Leibniz’s integral rule to compute derivatives (see “Appendix 4”). In this case, a pairwise system incorporating a general infectious period can be derived from the homogeneous message passing system (10)–(13) with the additional variables:

\begin{matrix} S S_{mes} (t) \equiv & z^{2} F_{sym} {(t)}^{2 (n - 1)}, \end{matrix}

\begin{matrix} S I_{mes} (t) \equiv & z F_{sym} {(t)}^{n - 1} (\frac{- {\dot{F}}_{sym} (t)}{β}), \end{matrix}

where $S S_{mes} (t)$ approximates the probability that a pair of neighbours are susceptible at time t, and $S I_{mes} (t)$ approximates the probability that the first is susceptible and the second is infected at time t [see Wilkinson and Sharkey (2014, section II B) where these pairwise quantities were first considered in the context of message passing]. To understand the construction of the factor in brackets in (52), note that for any pair of neighbours i, j, the probability that i is susceptible and j is infected at time t remains the same when i is placed into the cavity state. Further, when transmission processes are Poisson with rate $β$ , we must have that:

\begin{matrix} {\dot{H}}^{i \leftarrow j} (t) = & - β P (j infected at time t and no infectious \\ contacts from j to i before time t ∣ i in cavity) . \end{matrix}

Thus, the factor in brackets in (52) can be seen to approximate the probability on the right-hand side of (53) for any pair of neighbours i, j (recall that $F_{sym} (t)$ approximates $H^{i \leftarrow j} (t)$ for any pair of neighbours i, j).

To obtain population-level quantities, we define [as in Sharkey (2008, appendix B)]:

\begin{matrix} [S] (t) \equiv N S_{mes} (t), [I] (t) \equiv N I_{mes} (t), [S S] (t) \equiv n N S S_{mes} (t), [S I] (t) \equiv n N S I_{mes} (t), \end{matrix}

where N is a positive number. Note that (10) and (52) imply

\begin{matrix} {\dot{F}}_{sym} (t) = - β F_{sym} (t) \frac{S I_{mes} (t)}{S_{mes} (t)} (S_{mes} (t) \neq 0), \end{matrix}

so, since $F_{sym} (0) = 1$ , we have:

\begin{matrix} F_{sym} (t) = exp (- \int_{0}^{t} β \frac{S I_{mes} (τ)}{S_{mes} (τ)} d τ) (S_{mes} (t) \neq 0) . \end{matrix}

Substituting from (10)–(12) and (51), and using (55), it is straightforward to write down the time derivatives of [S](t), [I](t) and [SS](t) as in (22)–(24).

Finding the time derivative of [SI](t) is more involved. Setting $u = t - τ$ in (13) and differentiating with respect to t using Leibniz’s integral rule yields, recalling $f (τ) = β e^{- β τ} \int_{τ}^{\infty} r (τ^{'}) d τ^{'}$ , that

\begin{matrix} {\dot{F}}_{sym} (t) = & - β (F_{sym} (t) - y - z F_{sym} {(t)}^{n - 1}) \\ + \int_{0}^{t} β e^{- β τ} r (τ) (1 - y - z F_{sym} {(t - τ)}^{n - 1}) d τ . \end{matrix}

Substituting from (52) and (57) into (54), we can write

\begin{matrix} [S I] (t) = & n N z F_{sym} {(t)}^{n - 1} (\frac{- {\dot{F}}_{sym} (t)}{β}) \\ = & n N z F_{sym} {(t)}^{n - 1} [F_{sym} (t) - y - z F_{sym} {(t)}^{n - 1} \\ - \int_{0}^{t} e^{- β τ} r (τ) (1 - y - z F_{sym} {(t - τ)}^{n - 1}) d τ] . \end{matrix}

Differentiating the right-hand side of (58), we can now express the time derivative of [SI](t) as

\begin{matrix} \dot{[S I]} (t) = & n (n - 1) N z F_{sym} {(t)}^{n - 2} {\dot{F}}_{sym} (t) (\frac{- {\dot{F}}_{sym} (t)}{β}) \\ + n N z F_{sym} {(t)}^{n - 1} {\dot{F}}_{sym} (t) \\ - n (n - 1) N z^{2} F_{sym} {(t)}^{2 n - 3} {\dot{F}}_{sym} (t) \\ + n (n - 1) N z^{2} F_{sym} {(t)}^{n - 1} \int_{0}^{t} e^{- β τ} r (τ) F_{sym} {(t - τ)}^{n - 2} {\dot{F}}_{sym} (t - τ) d τ \\ - n N z F_{sym} {(t)}^{n - 1} e^{- β t} r (t) (1 - y - z) . \end{matrix}

Substituting from (10), (51), (52), (54), (55) and (56) into (59) yields the expression for $\dot{[S I]} (t)$ in (25); the terms on the right-hand side of (25) are ordered by equality with the terms on the right-hand side of (59).

Appendix 4: Continuity conditions for the application of Leibniz’s integral rule and Gronwall’s inequality

To derive (38), Leibniz’s integral rule is applied to (13), and this is valid if $F_{sym} (t)$ is continuously differentiable. Similarly, the application of the rule in the derivation of (57) and (59) is valid if $f (τ)$ and $F_{sym} (t)$ are continuously differentiable. Here we show that $F_{sym} (t)$ is continuously differentiable if $f (τ)$ is continuously differentiable. Note that if $f (τ) = β e^{- β τ} \int_{τ}^{\infty} r (τ^{'}) d τ^{'}$ then $f (τ)$ is continuously differentiable when $r (τ)$ is continuous.

With reference to the message passing system, (10)–(13), assume that $f (τ)$ is continuously differentiable. Thus we may apply Leibniz’s integral rule to (13), after setting $τ^{'} = t - τ$ , in order to compute the derivative of $F_{sym} (t)$ as follows

\begin{matrix} {\dot{F}}_{sym} (t) = - \int_{0}^{t} \dot{f} (t - τ^{'}) (1 - y - z F_{sym} {(τ^{'})}^{n - 1}) d τ^{'} - f (0) (1 - y - z F_{sym} {(t)}^{n - 1}) . \end{matrix}

It follows from “Appendix 1” that $F_{sym} (t)$ is continuous. Thus, since $\dot{f} (τ)$ is also continuous, (60) implies that ${\dot{F}}_{sym} (t)$ is continuous.

To derive (41), Gronwall’s inequality is applied to (40), and this is valid if ${\dot{S}}_{mes (n)} (t)$ and $\dot{S} (t)$ are continuous. By condition (iii) of Theorem 6, we have that ${\dot{F}}_{sym (n)} (t)$ is continuous (by the above argument), so ${\dot{S}}_{mes (n)} (t)$ is continuous. Conditions (i) and (iii) imply that $f^{*} (t)$ is continuous, which implies that $\dot{S} (t)$ is continuous.

We note that Leibniz’s integral rule was assumed to be applicable in Wilkinson and Sharkey (2014). It is straightforward, using a similar argument to above, to show that the application of the rule in that paper is valid if $f_{i j} (τ)$ is continuously differentiable for all $i \in V, j \in N_{i}$ .

Appendix 5: Proof of (40)

It follows from (30) and (39) that, for all $t \in [0, T]$ ,

\begin{matrix} |{\dot{S}}_{mes (n)} (t) - \dot{S} (t)| \leq A_{n} (t) + B_{n} (t), \end{matrix}

where

\begin{matrix} A_{n} (t) = |\frac{S_{mes (n)} (t)}{F_{sym (n)} (t)} [\frac{n - 1}{n} \int_{0}^{t} n f_{n} (τ) \frac{{\dot{S}}_{mes (n)} (t - τ)}{F_{sym (n)} (t - τ)} d τ] - S (t) \int_{0}^{t} f^{*} (τ) \dot{S} (t - τ) d τ| \end{matrix}

and

\begin{matrix} B_{n} (t) = |\frac{S_{mes (n)} (t)}{F_{sym (n)} (t)} n f_{n} (t) (1 - y - z) - S (t) I (0) f^{*} (t)| . \end{matrix}

Now

\begin{matrix} A_{n} (t) \leq A_{n}^{(1)} (t) + A_{n}^{(2)} (t), \end{matrix}

where

\begin{matrix} A_{n}^{(1)} (t) = & |\frac{S_{mes (n)} (t)}{F_{sym (n)} (t)} [\frac{n - 1}{n} \int_{0}^{t} n f_{n} (τ) \frac{{\dot{S}}_{mes (n)} (t - τ)}{F_{sym (n)} (t - τ)} d τ] \\ - S_{mes (n)} (t) \int_{0}^{t} f^{*} (τ) \dot{S} (t - τ) d τ| \end{matrix}

and

\begin{matrix} A_{n}^{(2)} (t) = |S_{mes (n)} (t) - S (t)| \times |\int_{0}^{t} f^{*} (τ) \dot{S} (t - τ) d τ| . \end{matrix}

Considering $A_{n}^{(1)} (t)$ , note that, since $0 \leq S_{mes (n)} (t) \leq 1$ ,

\begin{matrix} A_{n}^{(1)} (t) \leq (\frac{n - 1}{n}) \frac{1}{F_{sym (n)} (t)} A_{n}^{(11)} (t) + A_{n}^{(12)} (t), \end{matrix}

where

\begin{matrix} A_{n}^{(11)} (t) = & |\int_{0}^{t} n f_{n} (τ) \frac{{\dot{S}}_{mes (n)} (t - τ)}{F_{sym (n)} (t - τ)} d τ - \int_{0}^{t} f^{*} (τ) \dot{S} (t - τ) d τ| \\ \leq & |\int_{0}^{t} \frac{n f_{n} (τ)}{F_{sym (n)} (t - τ)} ({\dot{S}}_{mes (n)} (t - τ) - \dot{S} (t - τ)) d τ| \\ + |\int_{0}^{t} (\frac{n f_{n} (τ)}{F_{sym (n)} (t - τ)} - f^{*} (τ)) \dot{S} (t - τ) d τ| \end{matrix}

and

\begin{matrix} A_{n}^{(12)} (t) = |\int_{0}^{t} f^{*} (τ) \dot{S} (t - τ) d τ| \times |(\frac{n - 1}{n}) \frac{1}{F_{sym (n)} (t)} - 1| . \end{matrix}

Now conditions (i), (ii) and (36) imply that, for all $t \in [0, T], τ \in [0, t]$ ,

\begin{matrix} \frac{n f_{n} (τ)}{F_{sym (n)} (t - τ)} \leq \frac{M_{T} + ϵ_{n} (T)}{1 - ϵ_{n}^{(1)} (T)} \end{matrix}

and

\begin{matrix} |\frac{n f_{n} (τ)}{F_{sym (n)} (t - τ)} - f^{*} (τ)| \leq & \frac{1}{F_{sym (n)} (t - τ)} (|n f_{n} (τ) - f^{*} (τ)| \\ + f^{*} (τ) (1 - F_{sym (n)} (t - τ))) \\ \leq & \frac{ϵ_{n} (T) + M_{T} ϵ_{n}^{(1)} (T)}{1 - ϵ_{n}^{(1)} (T)}, \end{matrix}

whence

\begin{matrix} A_{n}^{(11)} (t) & \leq \frac{M_{T} + ϵ_{n} (T)}{1 - ϵ_{n}^{(1)} (T)} \int_{0}^{t} |{\dot{S}}_{mes (n)} (t - τ) - \dot{S} (t - τ)| d τ \\ + \frac{ϵ_{n} (T) + M_{T} ϵ_{n}^{(1)} (T)}{1 - ϵ_{n}^{(1)} (T)} |\int_{0}^{t} \dot{S} (t - τ) d τ| \\ \leq \frac{M_{T} + ϵ_{n} (T)}{1 - ϵ_{n}^{(1)} (T)} \int_{0}^{t} |{\dot{S}}_{mes (n)} (u) - \dot{S} (u)| d u + \frac{ϵ_{n} (T) + M_{T} ϵ_{n}^{(1)} (T)}{1 - ϵ_{n}^{(1)} (T)}, \end{matrix}

as $\int_{0}^{t} \dot{S} (t - τ) d τ = S (0) - S (t) \in [0, 1]$ . A similar argument, noting that

\begin{matrix} |\int_{0}^{t} f^{*} (τ) \dot{S} (t - τ) d τ| \leq \int_{0}^{t} |f^{*} (τ) \dot{S} (t - τ)| d τ \leq M_{T} [S (0) - S (t)] \leq M_{T}, \end{matrix}

shows that

\begin{matrix} A_{n}^{(12)} (t) \leq \frac{M_{T} (ϵ_{n}^{(1)} (T) + \frac{1}{n})}{1 - ϵ_{n}^{(1)} (T)} . \end{matrix}

Hence, recalling (63),

\begin{matrix} A_{n}^{(1)} (t) \leq & \frac{M_{T} + ϵ_{n} (T)}{{(1 - ϵ_{n}^{(1)} (T))}^{2}} \int_{0}^{t} |{\dot{S}}_{mes (n)} (u) - \dot{S} (u)| d u \\ + \frac{ϵ_{n} (T) + M_{T} ϵ_{n}^{(1)} (T)}{{(1 - ϵ_{n}^{(1)} (T))}^{2}} + \frac{M_{T} (ϵ_{n}^{(1)} (T) + \frac{1}{n})}{1 - ϵ_{n}^{(1)} (T)} . \end{matrix}

Turning to $A_{n}^{(2)} (t)$ , note that since $S_{mes (n)} (0) = S (0)$ ,

\begin{matrix} |S_{mes (n)} (t) - S (t)| = & |\int_{0}^{t} {\dot{S}}_{mes (n)} (u) - \dot{S} (u) d u| \\ \leq & \int_{0}^{t} |{\dot{S}}_{mes (n)} (u) - \dot{S} (u)| d u, \end{matrix}

\begin{matrix} A_{n}^{(2)} (t) \leq M_{T} \int_{0}^{t} |{\dot{S}}_{mes (n)} (u) - \dot{S} (u)| d u . \end{matrix}

Further, since $I (0) = 1 - y - z$ and $0 \leq I (0), S_{mes (n)} (t) \leq 1$ ,

\begin{matrix} B_{n} (t) = & I (0) |\frac{S_{mes (n)} (t)}{F_{sym (n)} (t)} n f_{n} (t) - S (t) f^{*} (t)| \\ \leq & I (0) (f^{*} (t) |S_{mes (n)} (t) - S (t)| + S_{mes (n)} (t) |\frac{n f_{n} (t)}{F_{sym (n)} (t)} - f^{*} (t)|) \\ \leq & M_{T} \int_{0}^{t} |{\dot{S}}_{mes (n)} (u) - \dot{S} (u)| d u + \frac{ϵ_{n} (T) + M_{T} ϵ_{n}^{(1)} (T)}{1 - ϵ_{n}^{(1)} (T)}, \end{matrix}

using a similar result to (64).

Thus, using (61), (62), (65), (66) and (67), we may define

\begin{matrix} A (n, T) = 2 M_{T} + \frac{M_{T} + ϵ_{n} (T)}{{(1 - ϵ_{n}^{(1)} (T))}^{2}} \end{matrix}

and

\begin{matrix} B (n, T) = \frac{(ϵ_{n} (T) + M_{T} ϵ_{n}^{(1)} (T)) (2 - ϵ_{n}^{(1)} (T))}{{(1 - ϵ_{n}^{(1)} (T))}^{2}} + \frac{M_{T} (ϵ_{n}^{(1)} (T) + \frac{1}{n})}{1 - ϵ_{n}^{(1)} (T)}, \end{matrix}

such that inequality (40) is satisfied for all $t \in [0, T]$ . Further, since both $ϵ_{n} (T)$ and $ϵ_{n}^{(1)} (T)$ converge to 0 as $n \to \infty$ , it follows that $B (n, T) \to 0$ as $n \to \infty$ and $0 \leq A (n, T) \leq 4 M_{T}$ for all sufficiently large n.

Appendix 6: Proof of Corollary 4

Here, we consider the homogeneous stochastic model (defined at the beginning of Sect. 3, with reference to the beginning of Sect. 2) for the special case where transmission and recovery processes are independent and Poisson with rates $β$ and $γ$ respectively. Specifically, $h (τ) = β e^{- β τ}$ , $r (τ) = γ e^{- γ τ}$ and $f (τ) = β e^{- (β + γ) τ}$ . For convenience, we let k denote the regular degree of the symmetric graph (instead of n).

For this special case, we show here that for the same initial conditions and parameters,

\begin{matrix} P_{S} (t) \geq S_{mes} (t) > S (t) for all t > 0, \end{matrix}

where $P_{S} (t)$ is the probability that an arbitrary individual is susceptible at time t (this being the same for all individuals) and S(t) is given by the special case of the Kermack–McKendrick model (42)–(44), with $S (0) = z > 0, I (0) = 1 - y - z > 0$ and $R (0) = y$ ; $S_{mes} (t)$ is given by (10) and (13) but with n replaced by k. Note that since

\begin{matrix} P_{R} (t) = & y + \int_{0}^{t} γ e^{- γ τ} (1 - y - P_{S} (t - τ)) d τ, \\ R_{mes} (t) = & y + \int_{0}^{t} γ e^{- γ τ} (1 - y - S_{mes} (t - τ)) d τ, \end{matrix}

and

\begin{matrix} R (t) = y + \int_{0}^{t} γ e^{- γ τ} (1 - y - S (t - τ)) d τ, \end{matrix}

then (68) implies that $R (t) > R_{mes} (t) \geq P_{R} (t)$ for all $t > 0$ .

We already have $P_{S} (t) \geq S_{mes} (t)$ by Theorem 2 and the fact that the message passing system, in this case, has a unique solution. Thus, we may prove (68) and Corollary 4 by showing that $S_{mes} (t) > S (t)$ for all $t > 0$ .

Setting $f_{n} (τ) = (β k / n) e^{- (β k / n + γ) τ}$ and $f^{*} (τ) = β k e^{- γ τ}$ , the Kermack–McKendrick model reduces to the system of ODEs (42)–(44) and the conditions for Theorem 6 are satisfied. Thus, letting $F_{sym (n)} (t)$ be defined by (13) but with $f (τ)$ replaced by $f_{n} (τ)$ , and letting $S_{mes (n)} (t)$ be defined by (10) but with $F_{sym} (t)$ replaced by $F_{sym (n)} (t)$ (as in Sect. 3.4),

\begin{matrix} lim_{n \to \infty} S_{mes (n)} (t) = S (t) \end{matrix}

and

\begin{matrix} S_{mes (n)} (t) = S_{mes} (t) if n = k . \end{matrix}

Therefore, if $S_{mes (n)} (t) \equiv z F_{sym (n)} {(t)}^{n}$ is strictly decreasing with respect to n, for all $t > 0$ , then we have $S_{mes} (t) > S (t)$ for all $t > 0$ . We now show this to be the case.

Letting $u_{n} (t) = F_{sym (n)} {(t)}^{n} (= S_{mes (n)} (t) / z)$ , we can write [c.f. (14)]

\begin{matrix} {\dot{u}}_{n} (t) = n γ (u_{n} {(t)}^{\frac{n - 1}{n}} - u_{n} (t)) - β k (u_{n} (t) - y u_{n} {(t)}^{\frac{n - 1}{n}} - z u_{n} {(t)}^{\frac{2 (n - 1)}{n}}) . \end{matrix}

For fixed $u \in (0, 1)$ , we have that $u^{\frac{n - 1}{n}}$ is strictly decreasing with n, and also that

\begin{matrix} n (u^{\frac{n - 1}{n}} - u) = & n u (u^{\frac{- 1}{n}} - 1) \\ = & n e^{- λ} (e^{\frac{λ}{n}} - 1) (where u = e^{- λ}, so λ > 0) \\ = & e^{- λ} \sum_{k = 1}^{\infty} \frac{1}{k!} \frac{λ^{k}}{n^{k - 1}} \end{matrix}

is strictly decreasing with n. Therefore, since $u_{n} (0) = 1$ and $u_{n} (t) \in (0, 1)$ for $t > 0$ , it follows that $u_{n} (t)$ (and hence $S_{mes (n)} (t)$ ) is strictly decreasing with n for all $t > 0$ .

Footnotes

Research sponsered in part by The Leverhulme Trust Grant RPG-2014-341 to KJS.

Contributor Information

Robert R. Wilkinson, Email: robertwi@liv.ac.uk

Frank G. Ball, Email: frank.ball@nottingham.ac.uk

Kieran J. Sharkey, Email: kjs@liv.ac.uk

References

Anderson RM, May RM. Infectious diseases of humans. Oxford: Oxford University Press; 1992. [Google Scholar]
Bailey NTJ. The mathematical theory of infectious diseases. London: Griffin; 1975. [Google Scholar]
Ball FG, Wilkinson RR, Sharkey KJ. Erratum: Message passing and moment closure for susceptible–infected–recovered epidemics on finite networks. Phys Rev E. 2015;92:039904. doi: 10.1103/PhysRevE.92.039904. [DOI] [PubMed] [Google Scholar]
Barbour AD. The principle of the diffusion of arbitrary constants. J Appl Probab. 1972;9(3):519–541. doi: 10.1017/S0021900200035841. [DOI] [Google Scholar]
Barbour AD. On a functional central limit theorem for Markov population processes. Adv Appl Probab. 1974;6(1):21–39. doi: 10.1017/S0001867800039690. [DOI] [Google Scholar]
Barbour AD, Reinert G. Approximating the epidemic curve. Electron J Probab. 2013;18(54):1–30. [Google Scholar]
Corduneanu C. Integral equations and applications. Cambridge: Cambridge University Press; 1991. [Google Scholar]
Danon L, Ford AP, House T, Jewell CP, Keeling MJ, Roberts GO, Ross JV, Vernon MC. Networks and the epidemiology of infectious disease. Interdiscip Perspect Infect Dis. 2011 doi: 10.1155/2011/284909. [DOI] [PMC free article] [PubMed] [Google Scholar]
Diekmann O, de Jong MCM, Metz JAJ. A deterministic epidemic model taking account of repeated contacts between the same individuals. J Appl Probab. 1998;35(2):448–462. doi: 10.1017/S0021900200015072. [DOI] [Google Scholar]
Donnelly P. The correlation structure of epidemic models. Math Biosci. 1993;117(1–2):49–75. doi: 10.1016/0025-5564(93)90017-5. [DOI] [PubMed] [Google Scholar]
Esary JD, Proschan F, Walkup DW. Association of random variables, with applications. Ann Math Stat. 1967;38(5):1466–1474. doi: 10.1214/aoms/1177698701. [DOI] [Google Scholar]
Godsil C, Royle G. Algebraic graph theory. New York: Springer; 2001. [Google Scholar]
Karrer B, Newman MEJ. A message passing approach for general epidemic models. Phys Rev E. 2010;82:016101. doi: 10.1103/PhysRevE.82.016101. [DOI] [PubMed] [Google Scholar]
Keeling MJ. The effects of local spatial structure on epidemiological invasions. Proc Biol Sci. 1999;266:589–867. doi: 10.1098/rspb.1999.0716. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proc R Soc Lond A. 1927;115:700–721. doi: 10.1098/rspa.1927.0118. [DOI] [Google Scholar]
Kiss IZ, Röst G, Zsolt V. Generalization of pairwise models to non-Markovian epidemics on networks. Phys Rev Lett. 2015;115:078701. doi: 10.1103/PhysRevLett.115.078701. [DOI] [PubMed] [Google Scholar]
Kurtz TG. Solutions of ordinary differential equations as limits of pure jump Markov processes. J Appl Probab. 1970;7(1):49–58. doi: 10.1017/S0021900200026929. [DOI] [Google Scholar]
Kurtz TG. Limit theorems for sequences of jump Markov processes approximating ordinary differential processes. J Appl Probab. 1971;8(2):344–356. doi: 10.1017/S002190020003535X. [DOI] [Google Scholar]
Leung KY, Diekmann O. Dangerous connections: on binding site models of infectious disease dynamics. J Math Biol. 2017;74:619–671. doi: 10.1007/s00285-016-1037-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Miller JC. The spread of infectious disease through clustered populations. J R Soc Interface. 2009 doi: 10.1098/rsif.2008.0524. [DOI] [PMC free article] [PubMed] [Google Scholar]
Miller JC. A note on a paper by Erik Volz: SIR dynamics in random networks. J Math Biol. 2011;62(3):349–358. doi: 10.1007/s00285-010-0337-9. [DOI] [PubMed] [Google Scholar]
Miller JC. A note on the derivation of epidemic final sizes. Bull Math Biol. 2012;74(9):2125–2141. doi: 10.1007/s11538-012-9749-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Miller JC, Slim AC, Volz EM. Edge-based compartmental modelling for infectious disease spread. J R Soc Interface. 2011;9:890–906. doi: 10.1098/rsif.2011.0403. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pastor-Satorras R, Castellano C, Van Mieghem P, Vespignani A. Epidemic processes in complex networks. Rev Mod Phys. 2015;87:925. doi: 10.1103/RevModPhys.87.925. [DOI] [Google Scholar]
Röst G, Vizi Z, Kiss IZ (2016) Pairwise approximation for SIR type network epidemics with non-Markovian recovery. arXiv:1605.02933 [DOI] [PMC free article] [PubMed]
Sharkey KJ. Deterministic epidemiological models at the individual level. J Math Biol. 2008;57:311–331. doi: 10.1007/s00285-008-0161-7. [DOI] [PubMed] [Google Scholar]
Sherborne N, Miller JC, Blyuss KB, Kiss IZ (2016) Mean-field models for non-Markovian epidemics on networks: from edge-based compartmental to pairwise models. arXiv:1611.04030 [DOI] [PMC free article] [PubMed]
Simon PL, Taylor M, Kiss IZ. Exact epidemic models on graphs using graph-automorphism driven lumping. J Math Biol. 2011;62(4):479–508. doi: 10.1007/s00285-010-0344-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Trapman P. Reproduction numbers for epidemics on networks using pair approximation. Math Biosci. 2007;210:464–489. doi: 10.1016/j.mbs.2007.05.011. [DOI] [PubMed] [Google Scholar]
Volz E. SIR dynamics in random networks with heterogeneous connectivity. J Math Biol. 2008;56(3):293–310. doi: 10.1007/s00285-007-0116-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wilkinson RR, Ball FG, Sharkey KJ (2016) The deterministic Kermack-McKendrick model bounds the general stochastic epidemic. Scheduled for publication. J Appl Probab 53(4):1031–1040
Wilkinson RR, Sharkey KJ. Message passing and moment closure for susceptible–infected–recovered epidemics on finite networks. Phys Rev E. 2014;89:022808. doi: 10.1103/PhysRevE.89.022808. [DOI] [PubMed] [Google Scholar]

[CR1] Anderson RM, May RM. Infectious diseases of humans. Oxford: Oxford University Press; 1992. [Google Scholar]

[CR2] Bailey NTJ. The mathematical theory of infectious diseases. London: Griffin; 1975. [Google Scholar]

[CR3] Ball FG, Wilkinson RR, Sharkey KJ. Erratum: Message passing and moment closure for susceptible–infected–recovered epidemics on finite networks. Phys Rev E. 2015;92:039904. doi: 10.1103/PhysRevE.92.039904. [DOI] [PubMed] [Google Scholar]

[CR4] Barbour AD. The principle of the diffusion of arbitrary constants. J Appl Probab. 1972;9(3):519–541. doi: 10.1017/S0021900200035841. [DOI] [Google Scholar]

[CR5] Barbour AD. On a functional central limit theorem for Markov population processes. Adv Appl Probab. 1974;6(1):21–39. doi: 10.1017/S0001867800039690. [DOI] [Google Scholar]

[CR6] Barbour AD, Reinert G. Approximating the epidemic curve. Electron J Probab. 2013;18(54):1–30. [Google Scholar]

[CR7] Corduneanu C. Integral equations and applications. Cambridge: Cambridge University Press; 1991. [Google Scholar]

[CR8] Danon L, Ford AP, House T, Jewell CP, Keeling MJ, Roberts GO, Ross JV, Vernon MC. Networks and the epidemiology of infectious disease. Interdiscip Perspect Infect Dis. 2011 doi: 10.1155/2011/284909. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] Diekmann O, de Jong MCM, Metz JAJ. A deterministic epidemic model taking account of repeated contacts between the same individuals. J Appl Probab. 1998;35(2):448–462. doi: 10.1017/S0021900200015072. [DOI] [Google Scholar]

[CR10] Donnelly P. The correlation structure of epidemic models. Math Biosci. 1993;117(1–2):49–75. doi: 10.1016/0025-5564(93)90017-5. [DOI] [PubMed] [Google Scholar]

[CR11] Esary JD, Proschan F, Walkup DW. Association of random variables, with applications. Ann Math Stat. 1967;38(5):1466–1474. doi: 10.1214/aoms/1177698701. [DOI] [Google Scholar]

[CR12] Godsil C, Royle G. Algebraic graph theory. New York: Springer; 2001. [Google Scholar]

[CR13] Karrer B, Newman MEJ. A message passing approach for general epidemic models. Phys Rev E. 2010;82:016101. doi: 10.1103/PhysRevE.82.016101. [DOI] [PubMed] [Google Scholar]

[CR14] Keeling MJ. The effects of local spatial structure on epidemiological invasions. Proc Biol Sci. 1999;266:589–867. doi: 10.1098/rspb.1999.0716. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proc R Soc Lond A. 1927;115:700–721. doi: 10.1098/rspa.1927.0118. [DOI] [Google Scholar]

[CR16] Kiss IZ, Röst G, Zsolt V. Generalization of pairwise models to non-Markovian epidemics on networks. Phys Rev Lett. 2015;115:078701. doi: 10.1103/PhysRevLett.115.078701. [DOI] [PubMed] [Google Scholar]

[CR17] Kurtz TG. Solutions of ordinary differential equations as limits of pure jump Markov processes. J Appl Probab. 1970;7(1):49–58. doi: 10.1017/S0021900200026929. [DOI] [Google Scholar]

[CR18] Kurtz TG. Limit theorems for sequences of jump Markov processes approximating ordinary differential processes. J Appl Probab. 1971;8(2):344–356. doi: 10.1017/S002190020003535X. [DOI] [Google Scholar]

[CR19] Leung KY, Diekmann O. Dangerous connections: on binding site models of infectious disease dynamics. J Math Biol. 2017;74:619–671. doi: 10.1007/s00285-016-1037-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] Miller JC. The spread of infectious disease through clustered populations. J R Soc Interface. 2009 doi: 10.1098/rsif.2008.0524. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] Miller JC. A note on a paper by Erik Volz: SIR dynamics in random networks. J Math Biol. 2011;62(3):349–358. doi: 10.1007/s00285-010-0337-9. [DOI] [PubMed] [Google Scholar]

[CR22] Miller JC. A note on the derivation of epidemic final sizes. Bull Math Biol. 2012;74(9):2125–2141. doi: 10.1007/s11538-012-9749-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] Miller JC, Slim AC, Volz EM. Edge-based compartmental modelling for infectious disease spread. J R Soc Interface. 2011;9:890–906. doi: 10.1098/rsif.2011.0403. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] Pastor-Satorras R, Castellano C, Van Mieghem P, Vespignani A. Epidemic processes in complex networks. Rev Mod Phys. 2015;87:925. doi: 10.1103/RevModPhys.87.925. [DOI] [Google Scholar]

[CR25] Röst G, Vizi Z, Kiss IZ (2016) Pairwise approximation for SIR type network epidemics with non-Markovian recovery. arXiv:1605.02933 [DOI] [PMC free article] [PubMed]

[CR26] Sharkey KJ. Deterministic epidemiological models at the individual level. J Math Biol. 2008;57:311–331. doi: 10.1007/s00285-008-0161-7. [DOI] [PubMed] [Google Scholar]

[CR27] Sherborne N, Miller JC, Blyuss KB, Kiss IZ (2016) Mean-field models for non-Markovian epidemics on networks: from edge-based compartmental to pairwise models. arXiv:1611.04030 [DOI] [PMC free article] [PubMed]

[CR28] Simon PL, Taylor M, Kiss IZ. Exact epidemic models on graphs using graph-automorphism driven lumping. J Math Biol. 2011;62(4):479–508. doi: 10.1007/s00285-010-0344-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] Trapman P. Reproduction numbers for epidemics on networks using pair approximation. Math Biosci. 2007;210:464–489. doi: 10.1016/j.mbs.2007.05.011. [DOI] [PubMed] [Google Scholar]

[CR30] Volz E. SIR dynamics in random networks with heterogeneous connectivity. J Math Biol. 2008;56(3):293–310. doi: 10.1007/s00285-007-0116-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] Wilkinson RR, Ball FG, Sharkey KJ (2016) The deterministic Kermack-McKendrick model bounds the general stochastic epidemic. Scheduled for publication. J Appl Probab 53(4):1031–1040

[CR32] Wilkinson RR, Sharkey KJ. Message passing and moment closure for susceptible–infected–recovered epidemics on finite networks. Phys Rev E. 2014;89:022808. doi: 10.1103/PhysRevE.89.022808. [DOI] [PubMed] [Google Scholar]

PERMALINK

The relationships between message passing, pairwise, Kermack–McKendrick and stochastic SIR epidemic models

Robert R Wilkinson

Frank G Ball

Kieran J Sharkey

Abstract

Introduction

The stochastic model (non-Markovin network-based SIR dynamics)

The message passing system and its unique solution

Theorem 1

Proof

Bounding the expected epidemic size at time t

Theorem 2

Proof

Corollary 1

The homogeneous stochastic model

Definition 1

The homogeneous message passing system

Example 1

Example 2

Epidemiological results

Theorem 3

Proof

Theorem 4

Proof

Remark 1

The homogeneous message passing system gives the same epidemic time course as a pairwise model

Theorem 5

Proof

Corollary 2

Proof

Corollary 3

Proof

The homogeneous message passing system gives the same epidemic time course as the Kermack–McKendrick model (asymptotically)

Theorem 6

Proof

Corollary 4

Proof

Discussion

Acknowledgements

Appendix 1: Proof of Theorem 1

Appendix 2: Proof of Theorem 2

Appendix 3: Proof of Theorem 5

Appendix 4: Continuity conditions for the application of Leibniz’s integral rule and Gronwall’s inequality

Appendix 5: Proof of (40)

Appendix 6: Proof of Corollary 4

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Bounding the expected epidemic size at time $t$