Skip to main content
Springer logoLink to Springer
. 2023 Aug 2;87(2):36. doi: 10.1007/s00285-023-01967-9

Necessary and sufficient conditions for exact closures of epidemic equations on configuration model networks

István Z Kiss 1,2,, Eben Kenah 3, Grzegorz A Rempała 3
PMCID: PMC10397147  PMID: 37532967

Abstract

We prove that it is possible to obtain the exact closure of SIR pairwise epidemic equations on a configuration model network if and only if the degree distribution follows a Poisson, binomial, or negative binomial distribution. The proof relies on establishing the equivalence, for these specific degree distributions, between the closed pairwise model and a dynamical survival analysis (DSA) model that was previously shown to be exact. Specifically, we demonstrate that the DSA model is equivalent to the well-known edge-based Volz model. Using this result, we also provide reductions of the closed pairwise and Volz models to a single equation that involves only susceptibles. This equation has a useful statistical interpretation in terms of times to infection. We provide some numerical examples to illustrate our results.

Keywords: Epidemics, Networks, Inference, Pairwise models, Survival analysis

Introduction

Many models of the transmission dynamics of infectious diseases (e.g., Khuda Bukhsh et al. 2022; Kiss et al. 2017; Gross et al. 2006; Ball et al. 2019; Risau-Gusmán and Zanette 2009; Jacobsen et al. 2018) represent contacts as a random graph of N individuals (nodes) formed using the configuration model (Molloy and Reed 1995; Bollobás 2001). Node degrees are typically assumed to be independent and identically distributed, as in the Newman–Strogatz–Watts (NSW) random graph construction Newman et al. (2001). Unfortunately, the systems of equations required to fully describe the stochastic dynamics of epidemics on such networks are often too large to handle, even for moderate values of N. A common simplification is to average stochastic quantities, but this still often leads to infinite-sized systems of equations. To address this challenge, some authors have employed a “closure” technique to create a reduced and closed (finite) system of equations by expressing terms corresponding to larger structures in terms of smaller structures. In most cases, this representation involves an approximation, but it can be exact in the case of SIR dynamics on configuration model networks, as demonstrated in Kiss et al. (2017).

Approximating stochastic epidemics on networks is an important problem that has received significant attention, leading to the development of several mean-field models that are represented in terms of systems of ordinary differential equations. The pairwise model Rand (1999a); Keeling (1999) is based on a set of equations for the expected number of susceptible ([S]) and infected ([I]) nodes and the expected number of S–I ([SI]) and S–S ([SS]) pairs. It relies on a closure that approximates expected number of triples in terms of singles and pairs, which breaks the dependence on ever-higher moments. The Volz model Volz (2008) is based on a system of differential equations that relies on the probability generating function (PGF) of the degree distribution as well as edge-centric quantities (such as the number of edges with nodes in certain states) rather than node-centric quantities (such as the number of infected or susceptible nodes). This model gave excellent agreement with simulations, and it was formally proven to be the large-N limit of a stochastic SIR epidemic on a configuration model network by Decreusefond and colleagues Decreusefond et al. (2012) More recently, Jacobsen and colleagues Jacobsen et al. (2018) provided an alternative method to derive the mean-field limit of a stochastic SIR model on a multi-layer network that we refer to as dynamical survival analysis (DSA). This approach results in a mean-field model over variables different from those in Volz’s approach, but it also shows the exactness of the Volz model in the large-network limit. The DSA formulation allows us to reinterpret the epidemic from a statistical viewpoint (e.g., by approximating the probability that a typical node who was susceptible at time t=0 is still susceptible at time t>0).

In this paper, we show that the pairwise model closure is exact if and only if the contact network has a degree distribution that is Poisson, binomial, or negative binomial (which we call Poisson-type distributions). Once this condition is satisfied, the limiting pairwise model closure is equivalent to the edge-based model proposed by Volz (2008) and extended by Miller (2011) as well as to the network-based DSA model KhudaBukhsh et al. (2020). We also show that the equivalence between the Volz–Miller and DSA models holds for any degree distribution with finite variance and that it allows statistical inference for epidemic model, as applied recently to early COVID-19 pandemic modelling Di Lauro et al. (2022); KhudaBukhsh et al. (2023).

The rest of the paper is organised as follows: In Sect. 2, we briefly describe stochastic epidemic dynamics on a configuration model network along with limiting approximations based on the pairwise, Volz, and DSA approaches. In Sect. 3, we introduce and characterise the class of Poisson-type distributions and then present our main result on the necessary and sufficient condition for the exact closure of the pairwise network model. This result is more precise than that obtained in Jacobsen et al. (2018), but it is less general as it only covers single-layer networks. In Sects. 4 and 5, we provide additional details on DSA model’s connection with statistical inference and offer concluding remarks. Additional calculations on the DSA and Volz models are presented in the Appendix.

Network epidemic models

We describe the underlying dynamics of the stochastic SIR epidemic process on a network of size N as follows: At the start of an epidemic, we pick m initially infectious individuals at random from the population. An infectious individual remains so for an infectious period that is sampled from an exponential distribution with rate γ. During this period, s/he makes contact with his or her immediate neighbours according to a Poisson process with intensity β. If a contacted individual is still susceptible at the time of the contact, s/he will immediately become infectious. After the infectious period, the infectious individual recovers and is immune to further infections. All infectious periods and Poisson processes are assumed to be independent of each other.

The epidemic is assumed to evolve on a configuration model network that is constructed as follows: Each node is given a random number of half edges according to a specified degree distribution (pk), and all half edges are matched uniformly at random to form proper edges.1 Although the exact behaviour of this SIR epidemic process is quite complicated, there exist several approximations that rely on aggregated or averaged quantities. To describe them, generating functions are useful.

Probability generating function

If pk is the probability that a randomly chosen node has degree k, then the probability generating function (PGF) of the degree distribution is

ψ(u)=k=0pkuk.

The PGF ψ contains a tremendous amount of information about epidemic dynamics on configuration model networks. Let θ be the probability that an initially susceptible node of degree one remains uninfected at time t in an infinite network. Then, assuming no variation in infectiousness or susceptibility to infection among nodes except for their degree, the probability that a node with degree k remains uninfected equals the probability θk that infection has not crossed any of its edges (see Volz (2008)). Summing over all possible k shows that ψ(θ) is the probability that a randomly chosen node remains susceptible in an infinite network. The degree distribution of the remaining susceptible nodes has the PGF

uk=0(pkθk)ukk=0pkθk=ψ(θu)ψ(θ), 1

which equals ψ(u) when θ=1. Via (1), ψ tells us about the properties of a node chosen uniformly at random from the set of susceptible nodes.

The first derivative of ψ tells us about the mean degree of susceptible nodes and about the properties of a node reached by crossing an edge. At a given value of θ, the mean degree of the remaining susceptible nodes is

dduψ(θu)ψ(θ)|u=1=θψ(θ)ψ(θ), 2

which equals ψ(1) when θ=1. If we cross an edge, the probability that we end up at a node with degree k is proportional to k. If we start at a node chosen uniformly at random and cross an edge to reach a neighbour, the number of edges we can use to reach a third node has the PGF

uk=1(kpk)uk-1k=1kpk=ψ(u)ψ(1).

If you are a susceptible node with a neighbour of degree k, this neighbour remains susceptible as long as infection has not crossed any of the k-1 edges that lead to a third node. Thus, the probability that a neighbour of a susceptible node remains susceptible is ψ(θ)/ψ(1). If we cross an edge to reach a susceptible neighbour, the number of edges we can cross to reach a third node has the PGF

uk=1(kpkθk)uk-1k=1kpkθk=ψ(θu)ψ(θ).

This distribution is often called the excess degree distribution (at the time t corresponding to θ), and it plays an important role in the dynamics of epidemics on networks. At a given value of θ, the mean excess degree of susceptible nodes is

dduψ(θu)ψ(θ)|u=1=θψ(θ)ψ(θ), 3

where ψ represents the second derivative of ψ.

Pairwise model

The pairwise model provides an intuitive way of describing the dynamics of an SIR epidemic on a configuration model graph. The pairwise model equations, as proposed for instance in Rand (1999a), are:

[S˙]=-β[SI],[I˙]=β[SI]-γ[I],[R˙]=γ[I],[SI˙]=-γ[SI]+β([SSI]-[ISI])-β[SI],[SS˙]=-2β[SSI], 4

where [A], [AB], [ABC] with A,B,C{S,I,R} stand for the number of singles, doubles and triples in the entire network with the given sequence of states when each group is counted in all possible ways. More formally,

[ABC]=i=1Nj=1Nk=1NaijajkIi(A)Ij(B)Ik(C), 5

where (aij)i,j=1,2,,N is the adjacency matrix of the network with entries either zero or one and Ii(A), Ii(B), and Ii(C) are binary variables that equal one when the status of i-th individual is A, B, and C, respectively, and equal zero otherwise. The singles [A] and doubles [AB] are similarly defined. We consider undirected networks with no self-loops, so aii=0 and aij=aji.

To completely describe the model, additional equations for triples are needed. These will depend on quadruples, which will depend on quintuples, and so on. To make the model tractable in the face of an ever-increasing number of variables and equations, one often introduces the notion of a “closure” in which larger structures (e.g., triples) are represented by smaller ones (e.g. pairs). The model (4) can be closed using the methods described in Sect. 3.

The two models that we describe next do not require closure and are known to be exact in the large network limit (i.e., as N) Decreusefond et al. (2012); Bohman and Picollelli (2012); Barbour and Reinert (2013); Janson et al. (2014). However, they are less straightforward to interpret.

Volz’s model

In addition to the limiting (N) probability θ defined in Sect. 2.1, let us also introduce the limiting probabilities pI and pS that a randomly selected edge with one susceptible vertex is of type SI and SS, respectively. In this notation, Volz’s mean-field equations Volz (2008) are:

θ˙=-βpIθ,p˙I=βpSpIθψ(θ)ψ(θ)-βpI1-pI-γpI,p˙S=βpSpI1-θψ(θ)ψ(θ),xS=ψ(θ),x˙I=βpIθψ(θ)-γxI, 6

where the derivative with respect to time is marked with a dot and the derivative with respect to θ is marked with a prime. Here xS and xI denote the limiting (N) proportions of susceptibles and infected, respectively. Note that the first three equations are decoupled from the remaining two and that the proportion of recovered may be obtained from the conservation relationship. The initial conditions are

xS(0)=θ(0)=pS(0)=1,xI(0)=pI(0)=ρ, 7

where 0<ρ1.

DSA model

An alternative description of the limiting dynamics of a large configuration model network under an SIR epidemic was given in Jacobsen et al. (2018). Although originally considered in the context of multi-layer networks, its single layer version has been applied recently to statistical inference problems under the name “dynamical survival analysis” (DSA) KhudaBukhsh et al. (2023). In this approach, the limiting equations are derived in terms of the limiting (N) proportions xSI of SI-type and xSS of SS-type edges and the additional quantity xθ. In Appendix B, we show that the latter coincides with the probability θ defined in Sect. 2.1. The equations are:

x˙θ=-βxSIψxθ,x˙SS=-2βxSIxSSψxθψxθ2,x˙SI=xSI[β(xSS-xSI)ψxθψxθ2-(β+γ)],x˙S=-βxSI,x˙I=βxSI-γxI. 8

As in Volz’s system, the first three equations do not depend explicitly on the dynamics of xS and xI and therefore may be decoupled from the remaining two equations. The initial conditions are:

xS(0)=xθ(0)=1,xI(0)=ρ,xSS(0)=μ,xSI(0)=μρ, 9

where 0<ρ1 and μ>0.

Closing the pairwise model

In practice, one needs to define the time dynamics of the triples [SSI] and [ISI] to use the system (4). Typically, these equations are closed by approximating the dynamics of triples using pairs. This method is referred to as the “triples closure” in House and Keeling (2011) and the “pair approximation” or “pairwise closure” in Jacobsen et al. (2018).

Exact closure condition

While various justifications of closures have been proposed before Kiss et al. (2009), we present a slightly different justification that is focused on the PGF of the degree distribution. The form of the degree distribution plays a key role in obtaining necessary and sufficient conditions on the network to ensure that pairwise closures are exact.

Let [AjBkC] indicate the number of connected triplets ABC as defined in Eq. (5) such that the node in state A has degree j, the node in state B has degree k, and the node in state C has degree . Then

[ABC]=j,k,l[AjBkC] 10

and similarly for [A] and [AB]. Let us also define [S]:=[SS]+[SI]. We derive a closure condition starting from the finest resolution, where we account for the degree of each node. Let state A below denote either S or I. We are interested in approximating [ASI]. Note that we may approximate, in two stages, as follows:

[AjSkI](k-1)[AjSk][SkI]k[Sk](k-1)[AjSk][SI][S], 11

where we assume that a degree k susceptible node’s neighbour is as likely to be infected as any other susceptible node’s neighbour. Intuitively, the first approximation is valid because we start with an AjSk pair and each of the k-1 additional edges connected to the Sk node leads to an I node with probability [SkI]/(k[Sk]). The second approximation follows from the configuration model. These will be used repeatedly in what follows. Summing over the index alone we get:

[AjSkI]=[AjSkI](k-1)[AjSk][SI][S](k-1)[AjSk][SI][S]. 12

A similar approximation to [AjSk] and summation over j leads to [ASk][AS]×(k[Sk]/[S]). This, in turn, leads to:

[ASkI]=j[AjSkI]=j(k-1)[AjSk][SI][S](k-1)[ASk][SI][S](k-1)k[Sk][AS][SI][S]2. 13

Finally, summing over k leads to:

[ASI]=k[ASkI]=k(k-1)k[Sk][AS][SI][S]2[AS][SI][S]2k(k-1)k[Sk]. 14

It remains to handle kk(k-1)[Sk].

Recall that the variable θ (xθ in the DSA model) is the probability that infection has not crossed a randomly chosen edge, and it decreases over time. At a given value of θ, a node of degree k remains susceptible with probability θk, so [Sk]Npkθk where pk is the probability mass on k in the degree distribution. In terms of the degree distribution PGF ψ, for large N we have approximately (see, for instance, House and Keeling (2011); Jacobsen et al. (2018)):

[S]Nψ(θ),[S]kkNpkθk=Nθψ(θ),k(k-1)k[Sk]Nθ2ψ(θ). 15

Using these and (14) leads to:

[ASI]ψ(θ)ψ(θ)ψ(θ)2[AS][SI][S]. 16

Because θ is a new dynamic variable, an equation for it is needed. However, we need no more equations as long as

ψ(θ)ψ(θ)ψ(θ)2=κ=constant, 17

in which case the dependency on θ is curtailed. If θ is the probability that disease has not crossed a randomly chosen edge, then it follows from Eqs. (2) and (3) that the left-hand side of Eq. (17) is the mean excess degree of susceptible nodes divided by their mean degree. Therefore, the condition above simply implies that this ratio remains constant as the susceptible nodes are depleted over time. Below, we show that the networks for which this property holds can be explicitly characterized. Remarkably, for such networks, (17) is equivalent to the exact closure—that is to the asymptotic (N) equality in (16).

Poisson-type distributions

Assuming that ψ(0),ψ(0)>0, it follows that ψ,ψ>0 on the domain [0, 1] and the condition (17) can be rewritten as

ψ(u)ψ(u)=κψ(u)ψ(u). 18

for any u[0,1]. Upon integrating, we get the first-order differential equation

ψ(u)=αψ(u)κ 19

for arbitrary constants α>0 and κ>0. Because ψ is analytic, the equation above is defined beyond the original domain—in particular in the small right-side neighbourhood of the natural initial condition ψ(1)=1.

Table 1 below presents a family consisting of three distributions whose PGFs satisfy the ODE (19). We refer to these distributions as Poisson-type (PT) distributions. It turns out that being the PGF of a PT distribution is necessary and sufficient for (19) to hold:

Table 1.

The PT random variables whose PGFs satisfy (19)

Condition Family Parameters
κ(0,1) Binomial: BINOM(n,p) n=11-κ, p=α(1-κ)
κ=1 Poisson: POI(λ) λ=α
κ>1 Negative binomial: NB(r,p) r=1κ-1, p=α(κ-1)α(κ-1)+1

Theorem 1

(Characterization of the Poisson-type distributions). The PGF of a random variable satisfies (19) if and only if the random variable belongs to the PT family listed in Table 1.

Proof

If κ=1, the ODE given by (19) and the PGF condition ψ(1)=1 imply that ψ(u)=eu(α-1), which is the PGF of the Poisson random variable POI(α) in Table 1. If κ1, separating variables and integrating gives us

ψ(u)1-κ1-κ=αu+c, 20

for some constant c. Taking into account the condition ψ(1)=1, we get

ψ(u)=[α(1-κ)(u-1)+1]11-κ. 21

Now, consider separately the cases κ<1 and κ>1.

  • Case κ(0,1) Because ψ(s)(0)0 for each integer s0, we must have n=(1-κ)-1 is a positive integer and α(1-κ)1. Writing
    ψ(u)=[1-α(1-κ)+α(1-κ)u]n, 22
    we recognise ψ(u) as the PGF of the binomial random variable BINOM(n,p) with p=α(1-κ). Note that we allow here for a degenerate distribution with p=1, which corresponds to ψ(u)=un, that is, an n-regular degree distribution.
  • Case κ>1 Writing
    ψ(u)=1α(κ-1)+11-α(k-1)α(κ-1)+1u1κ-1, 23
    we recognise ψ(u) as the PGF of the negative binomial distribution NB(r,p) with r=1κ-1 and p=α(κ-1)α(κ-1)+1. Note that here necessarily p<1.

Thus by considering all possible values of κ, it follows that there are only three PGF solutions to Eq. (19) corresponding to the families of random variables listed in Table 1.

Note that although the distributions in Table 1 describe all possible solutions to (19), the equivalence of (17), (18), and (19) holds only under the conditions ψ(0)>0,ψ(0)>0. This excludes the special case p=1 in the family BINOM(n,p) corresponding to an n-regular degree distribution. However, for that particular case, we have ψ(u)=un, which also satisfies (17) (but not 18).

Closure theorem and models equivalence

We are now in a position to state the main result on the exactness of the pairwise closure. Here, we use “exactness” in the sense defined in Jacobsen et al. (2018), or, equivalently, in Janson et al. (2014). In both cases, the notion implies that the appropriately scaled stochastic vector of susceptibles, infecteds, and recovereds tends in an appropriate sense to a deterministic vector whose components are described by the system of ordinary differential equations given by (6) or (8). Yet another equivalent definition of the exact closure is that the equality in the triple approximation condition (24) holds upon dividing both its sides by N and taking the limit N.

Theorem 2

(Exact pairwise closure). Assume that either ψ(0)>0,ψ(0)>0, and ψ(1)<, or that ψ(u)=un. The closure condition

[ASI]κ[AS][SI][S] 24

for A{S,I} in the pairwise model given by system (4) is exact (that is, equality in (24) with both sides multiplied by N-1 holds asymptotically as N) if and only if the underlying configuration model network has a Poisson-type (PT) degree distribution. Furthermore, κ=(n-1)/n<1, κ=1, or κ=(r+1)/r>1 if the degree distribution is BINOM(n,p), POI(λ), or NB(r,p), respectively.

Proof

Consider first evaluation of the term ψxθ/ψxθ2. In Table 2, we show that this term is equivalent to κ/xS for all PT distributions. With this in mind, we are ready to show the equivalence between the limiting pairwise model and the DSA models under (24). In view of Theorem 1, this suffices to establish the current result.

Table 2.

Resolving the ψxθ/ψxθ2 term in the DSA equations for binomial, Poisson, and negative binomial distributions

Binomial Poisson Negative binomial
Parameter(s) (np) λ (rp)
ψ(x) (1-p+px)n eλ(x-1) (1-p1-px)r
ψ(x) np(1-p+px)n-1 λeλ(x-1) rp(1-p)r(1-px)r+1
ψ(x) n(n-1)p2(1-p+px)n-2 λ2eλ(x-1) r(r+1)p2(1-p)r(1-px)r+2
ψ(xθ)ψ(xθ)2 n-1n×1xS 1×1xS r+1r×1xS
κ n-1n 1 r+1r

Note that ψ(xθ)=xS

Let us show equivalence between the evolution equations for [SI] and xSI. Recall that xA=limN[A]/N and xAB=limN[AB]/N and that these limits exist uniformly in probability over any finite time interval Jacobsen et al. (2018). From the equation for SI in the pairwise model and (24)

[SI]˙=β([SSI]-[ISI])-[SI](β+γ)=[SI]β([SS]-[SI])κ[S]-(β+γ). 25

Dividing both sides of the last equation by N, taking the limit N [which can be done in view of the appropriate LLN, see Jacobsen et al. (2018)], and using the fact that ψxθ/ψxθ2=κ/xS, we arrive at:

x˙SI=xSI[βxSS-xSIκxS-(β+γ)]=xSIβxSS-xSIψxθψxθ2-(β+γ) 26

which is identical to the equation for xSI in the DSA model. We note that, when the degree distribution is PT, the equation for xθ is no longer needed and the equivalences of the remaining equations follow similarly as above. For instance, from (24) and [SS]˙=-2β[SSI] it follows that x˙SS=-2βκxSSxSI/xS=-2βxSSxSIψxθ/ψxθ2. The exactness of the DSA model Jacobsen et al. (2018) as the limit of the stochastic SIR model on a configuration network, implies thus the exactness of the scaled pairwise model (and (24)) as N.

Figure 1 summarises the equivalences between models. The equivalence of the Volz model and the DSA model for an arbitrary degree distribution is shown in the Appendix.

Fig. 1.

Fig. 1

Summary of model equivalence results. The top is the pairwise model, the bottom left is the DSA model, and the bottom right is the Volz model

The top row of Fig. 2 shows numerical evidence of the exactness of the closure in the pairwise model for PT networks. For PT networks, the agreement between the pairwise model and the expected value of explicit stochastic simulations is excellent. The DSA model continues to work well for non-PT networks (see bottom row of Fig. 2), and it is clear that κ is not constant in time in this case. As expected, this means that none of the three possible closures work. In the left panel of the bottom row we plot the output from the pairwise model for κ=(n-1)/n (dashed line) and κ=1 (dotted line). Both underestimate prevalence which in this case is driven by the 20% of highly connected nodes. This is captured poorly by both closures.

Fig. 2.

Fig. 2

Top row: From left to right, epidemics on networks with regular (each node has n=6 links), Poisson (λ=10) and negative binomial (r=10 and p=1/2) degree distributions are plotted, respectively. Individual stochastic realisations are plotted with thin grey lines, their mean with the thick red line, and the solution of the corresponding pairwise model with a solid black line. Bottom left: Epidemics on a network where 80% of nodes have degree 4 and the rest have degree 34. The DSA model (solid black line) is used to match the average epidemics. The pairwise model closures with κ=(n-1)/n (dashed line) and κ=1 (dotted line) are also plotted. Bottom right: Plot showing that, for a non-PT network (like the network with two distinct value for the degree of nodes), κ is not constant in time. The value of κ at time t=0 (solid constant line) is short lived as shown κ(t) (dashed line) as given in Eq. (17). Other parameters are: N=10,000 nodes, recovery rate γ=1, and per-contact transmission rate β=0.4 for the regular network and β=0.2 for the networks with Poisson, negative binomial, and mixed degree distributions. Epidemics start with 250 infected nodes chosen at random and only epidemics reaching 500 infected individuals are retained. We average over 15 network realisations and 15 epidemics on each network (colour figure online)

Survival analysis perspective

The exact closure condition implies that, under the assumption of a PT degree distribution, the pairwise and DSA models are equivalent. One of the benefits of this equivalence is that the pairwise model shares the statistical interpretation of the DSA model. Indeed, as shown in KhudaBukhsh et al. (2020) [see also discussion with examples in Choi et al. (2019); Bastian and Rempala (2020); Di Lauro et al. (2022); Vossler et al. (2022)), we can interpret the system of Eq. (8] in terms of a statistical model for times to infection. To this end, as in KhudaBukhsh et al. (2020), we may consider St:=xS(t) as the survival probability of a typical node (i.e., the probability that a typical node who was susceptible at time t=0 remains susceptible at time t>0). Note that S0=1 follows from the assumed initial conditions (9) and that S>0, so St is an improper survival function. In this section, we show how to derive a single autonomous differential equation for St (or xS) that allows numerical calculation of the survival probability for any t[0,) solely in terms of the network model parameters. We achieve this in several steps: First, we derive an integral that relates xSS and xS. Because ψxθ/ψxθ2=κ/xS under the pairwise closure condition, we obtain:

x˙SSx˙S=2κxSSxS. 27

Integrating this and using the initial conditions xS(0)=1 and xSS(0)=μ leads to:

xSS(t)=μxS(t)2κ. 28

Second, the equations for xS can be rewritten as:

x˙S=-βxSIxSxS=-βxDxS, 29

where xD=xSI/xS is considered a new variable for which an evolution equation is needed. Considering the derivative of xD, and plugging in the expressions for x˙SI and x˙S, we obtain:

x˙D=x˙SIxs-xSIx˙SxS2=βκxSSxSI-βκxSIxSI-(β+γ)xSIxS+βxSI2xS2=βκμxs2κ-1xSIxS-βκxSIxS2-(β+γ)xSIxS+βxSIxS2=β(1-κ)xD2+βκμxS2κ-1-(β+γ)xD. 30

Given that the equations for xSI and xSS no longer depend on xθ, the system now simplifies to three key equations:

x˙S=-βxDxS,x˙I=βxDxS-γxI,x˙D=β(1-κ)xD2+βκμxS2κ-1-(β+γ)xD. 31

Finally, we can manipulate these equations further. In particular, looking at

x˙Dx˙S+(1-κ)xDxS=-κμxS2κ-2+β+γβ1xS, 32

and considering xD as a function of xS, we can use an integrating factor. This leads to:

-S˙t=β~(1-Stκ)Stκ+γ~1-κSt(1-Stκ-1)+ρ~Stκifκ1,β~(St-St2)+γ~StlogSt+ρ~Stifκ=1, 33

where we now replaced xS(t) by St and set ρ~=βμρ, γ~=β+γ, and β~=μβ. As already noted, the initial condition inherited from (9) is S0=1. Because necessarily S˙=0, Eq. (33) implies that the limiting value S>0 has to satisfy

β~(1-Sκ)+ρ~=γ~1-κ(1-S1-κ)ifκ1, 34
β~(1-S)+ρ~=-γ~logSifκ=1. 35

It is of interest to note that when the degree distribution is Poisson (κ=1), then Eqs. (33) and (35) are identical to the ones known from the mass-action SIR dynamics.

This analysis shows that the dynamics of an SIR epidemic on a configuration model network with a PT degree distribution can be summarised with a single self-contained survival equation describing the evolution of survival probability St. This leads to the following interesting statistical consideration that was already noted for mass-action SIR models in KhudaBukhsh et al. (2020); Di Lauro et al. (2022); Vossler et al. (2022). Assuming that, over a time interval [0, T] where T, we observe the times of infection (t1,,tk) of a randomly selected set of k initially susceptible nodes of our network, we may write the approximate log-likelihood function as

(β~,γ~,ρ~|t1,,tk)=i=1klogSti-klog(1-ST). 36

To obtain quantities other than St, evaluation of additional ODEs is needed as discussed, for instance, in KhudaBukhsh et al. (2020) or Khuda Bukhsh et al. (2022). Let us also note that, since the DSA and Volz models are equivalent (see Lemma 1 in the Appendix), the representation St in  (33) can be similarly derived directly from Volz’s model (6).

Discussion

Over the last two decades, two types of disease network models have emerged as particularly relevant in many practical applications (including the recent COVID-19 pandemic, see KhudaBukhsh et al. (2023)): the so-called pairwise Rand (1999b); Keeling (1999) and edge-based Volz (2008); Miller et al. (2012) approaches. More recently, a version of an edge-based approach, dubbed DSA, was proposed in Jacobsen et al. (2018); KhudaBukhsh et al. (2020) to facilitate statistical inference.

In this paper, we have shown that the three approaches are equivalent and asymptotically exact under the assumption that the contact network underlying the spread of disease is a configuration model random graph with one of the three Poisson-type (PT) degree distributions: Poisson, binomial, or negative binomial. Perhaps more interestingly, we have shown that the pairwise closure for an epidemic on a configuration model network is exact if and only if the ratio of mean excess degree to mean degree for susceptible nodes remains constant over time (as the susceptible nodes are depleted). This condition holds if and only if the degree distribution is PT. As an interesting corollary of our results, we obtained a single equation representation of the pairwise model that allows parameter estimation from time series data marginalised over the network degree distribution. This finding is practically useful as it allows, for instance, statistical inference based solely on the disease incidence data as in the classical, homogeneous SIR models. Because these statistical methods are based on survival times in susceptible individuals, statistical inference can be based on observation of a random sample of the population, which could allow more accurate tracking of future epidemics.

Acknowledgements

IZK acknowledges support from the Leverhulme Trust for the Research Project Grant RPG-2017-370. EK and GAR were partially supported by the US National Science Foundation (NSF) Grant DMS-1853587. The contents are solely the responsibility of the authors and do not represent the official views or policy of the NSF.

A Summary of notation

The following notation is used throughout the paper and in particular in the next section.

  • β:= the force of infection per infectious neighbour (i.e., the constant rate at which infectious nodes infect a neighbour).

  • γ:= the recovery rate (i.e., the constant rate at which infected nodes become recovered).

  • pk:= the probability that a node will have degree k.

  • ψ(x):= the probability generating function for the degree distribution pk.

  • xS:= the proportion of nodes susceptible at time t.

  • xI:= the proportion of nodes infectious at time t

  • xAB:= the proportion of AB-type edges at time t

  • AX:= the set of arcs (ij) such that node i is in set X.

  • MX:= the proportion of arcs in set AX.

  • AXY:= the set of arcs (ij) such that iX and jY.

  • MXY:= the proportion of arcs in set AXY.

  • θ:= the probability that infection has not crossed a randomly chosen edge at time t, also denoted by xθ in some of the models.

  • pI:=MSI/MS, the probability that an arc (ij) with a susceptible i has an infectious j.

  • pS:=MSS/MS, the probability that an arc (ij) with a susceptible i has a susceptible j.

B Equivalence of Volz’s and DSA models

Lemma 1

The Volz model (6)–(7) and the DSA model (8)–(9) are equivalent.

Proof

While our starting point is the system of Volz’s original set of Eq. (6), it is useful to recast these over a different state space in order to show equivalence with the DSA system in Eq. (8). Here, we move from the state space in terms of (θ,pI,pS) to a state space in terms of (θ=xθ,xSI,xSS), where xSI and xSS are simply the limiting counts of all SI and SS -type edges (counted in both directions) scaled by N (the number of nodes in the network) with the limit taken as N.

As alluded to above, θ and xθ have exactly the same interpretation, but we denote them differently to differentiate consistently between models over different state spaces. We now proceed to show how one moves from the original Volz model to the DSA equations. We start by showing that the first equation in (6) is equivalent to the first equation in (8). Starting from (6), and taking into account that MSI=xSI/ψ(1) and that MS=xθψ(xθ)/ψ(1) as shown in Volz (2008), we obtain

θ˙=-βpIθ=-βMSIMSθ=-βxSIψ(1)xθψ(xθ)ψ(1)xθ=-βxSIψ(xθ)=x˙θ. 37

Showing the equivalence between the other equations requires an extra step. That is, the evolution equations for pI and pS need to be rewritten explicitly in terms of SI- and SS-type edges. In Volz (2008), it was shown that the equations for MSI can be written as

M˙SI=βpIpS-pIθ2ψ(θ)ψ(1)-(β+γ)MSI. 38

Using the expressions for MSI and MS in terms of the DSA model parameters above and the fact that MSS=xss/ψ(1), we get pI=xSI/(xθψ(θ)) and pS=xSS/(xθψ(xθ)). Substituting these into (38), we get

M˙SI=βpIpS-pIθ2ψ(θ)ψ(1)-(β+γ)MSI=βxSIxθψ(xθ)xSSxθψ(xθ)-xSIxθψ(xθ)xθ2ψ(xθ)ψ(1)-(β+γ)xSIψ(1)=xSIψ(1)β(xSS-xSI)ψ(xθ)ψ(xθ)2-(β+γ)=x˙SIψ(1). 39

Thus, Eq. (6) is equivalent to the equation for xSI in (8). Following Volz (2008), the evolution equation for SS-type edges can be rewritten to

M˙SS=-2βpIpSθ2ψ(θ)ψ(1). 40

Making the same substitutions as before, we get

M˙SS=-2βxSIxθψ(xθ)xSSxθψ(xθ)xθ2ψ(xθ)ψ(1)=-2βxSIxSSψ(xθ)ψ(xθ)2ψ(1)=x˙SSψ(1), 41

which shows that (40) and the equation for xSS in (8) are equivalent. Since the remaining equations in both systems rely on the first three equations that we have just shown to be equivalent, the Volz and DSA models are equivalent under their respective initial conditions.

Data availability

The datasets generated and/or analysed during the current study are available upon reasonable request from the corresponding author.

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

1

The construction technically permits the inclusion of self-loops and unmatched edges, but in practice, these cases are infrequent and can be disregarded as N, provided that the degree distribution has a finite variance. This has been discussed in detail in Section 7 of Van Der Hofstad (2016).

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Ball F, Britton T, Leung KY, Sirl D. A stochastic sir network epidemic model with preventive dropping of edges. J Math Biol. 2019;78(6):1875–1951. doi: 10.1007/s00285-019-01329-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barbour A, Reinert G. Approximating the epidemic curve. Electron J Probab. 2013;18:1–30. doi: 10.1214/EJP.v18-2557. [DOI] [Google Scholar]
  3. Bastian CD, Rempala GA. Throwing stones and collecting bones: looking for Poisson-like random measures. Math Methods Appl Sci. 2020;43(7):4658–4668. [Google Scholar]
  4. Bohman T, Picollelli M. Sir epidemics on random graphs with a fixed degree sequence. Random Struct Algorithms. 2012;41(2):179–214. doi: 10.1002/rsa.20401. [DOI] [Google Scholar]
  5. Bollobás B (2001) Random graphs. Cambridge series in advanced mathematics, vol 73. Cambridge University Press
  6. Choi B, Busch S, Kazadi D, Ilunga B, Okitolonda E, Dai Y, Lumpkin R, Saucedo O, KhudaBukhsh WR, Tien J, et al. Modeling outbreak data: analysis of a Ebola virus disease epidemic in DRC. Biomath (Sofia, Bulgaria) 2019;8(2):1910037. doi: 10.11145/j.biomath.2019.10.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Decreusefond L, Dhersin J-S, Moyal P, Tran VC. Large graph limit for an SIR process in random network with heterogeneous connectivity. Ann Appl Probab. 2012;22(2):541–575. doi: 10.1214/11-AAP773. [DOI] [Google Scholar]
  8. Di Lauro F, KhudaBukhsh WR, Kiss IZ, Kenah E, Jensen M, Rempała GA. Dynamic survival analysis for non-Markovian epidemic models. J R Soc Interface. 2022;19(191):20220124. doi: 10.1098/rsif.2022.0124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Gross T, D’Lima CJD, Blasius B. Epidemic dynamics on an adaptive network. Phys Rev Lett. 2006;96(20):208701. doi: 10.1103/PhysRevLett.96.208701. [DOI] [PubMed] [Google Scholar]
  10. House T, Keeling MJ. Insights from unifying modern approximations to infections on networks. J R Soc Interface. 2011;8(54):67–73. doi: 10.1098/rsif.2010.0179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Jacobsen KA, Burch MG, Tien JH, Rempała GA. The large graph limit of a stochastic epidemic model on a dynamic multilayer network. J Biol Dyn. 2018;12(1):746–788. doi: 10.1080/17513758.2018.1515993. [DOI] [PubMed] [Google Scholar]
  12. Janson S, Luczak M, Windridge P (2014) Law of large numbers for the SIR epidemic on a random graph with given degrees. Random Struct Algorithms [DOI] [PMC free article] [PubMed]
  13. Keeling MJ. The effects of local spatial structure on epidemiological invasions. Proc R Soc Lond Ser B Biol Sci. 1999;266(1421):859–867. doi: 10.1098/rspb.1999.0716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Khuda Bukhsh WR, Bastian CD, Wascher M, Klaus C, Sahai SY, Weir MH, Kenah E, Root E, Tien JH, Rempala GA (2022) Projecting COVID-19 cases and subsequent hospital burden in Ohio. medRxiv. 10.1101/2022.07.27.22278117 [DOI] [PMC free article] [PubMed]
  15. KhudaBukhsh WR, Bastian CD, Wascher M, Klaus C, Sahai SY, Weir MH, Kenah E, Root E, Tien JH, Rempała GA. Projecting COVID-19 cases and hospital burden in Ohio. J Theor Biol. 2023;561:111404. doi: 10.1016/j.jtbi.2022.111404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. KhudaBukhsh WR, Choi B, Kenah E, Rempała GA. Survival dynamical systems: individual-level survival analysis from population-level epidemic models. Interface Focus. 2020;10(1):20190048. doi: 10.1098/rsfs.2019.0048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Kiss IZ, Simon PL, Kao RR. A contact-network-based formulation of a preferential mixing model. Bull Math Biol. 2009;71(4):888–905. doi: 10.1007/s11538-008-9386-2. [DOI] [PubMed] [Google Scholar]
  18. Kiss IZ, Miller JC, Simon PL. Mathematics of epidemics on networks. Cham: Springer; 2017. p. 31. [Google Scholar]
  19. Miller JC. A note on a paper by Erik Volz: SIR dynamics in random networks. J Math Biol. 2011;62(3):349–358. doi: 10.1007/s00285-010-0337-9. [DOI] [PubMed] [Google Scholar]
  20. Miller JC, Slim AC, Volz EM. Edge-based compartmental modelling for infectious disease spread. J R Soc Interface. 2012;9(70):890–906. doi: 10.1098/rsif.2011.0403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Molloy M, Reed B. A critical point for random graphs with a given degree sequence. Random Struct Algorithms. 1995;6(2–3):161–180. doi: 10.1002/rsa.3240060204. [DOI] [Google Scholar]
  22. Newman ME, Strogatz SH, Watts DJ. Random graphs with arbitrary degree distributions and their applications. Phys Rev E. 2001;64(2):026118. doi: 10.1103/PhysRevE.64.026118. [DOI] [PubMed] [Google Scholar]
  23. Rand D. Correlation equations and pair approximations for spatial ecologies. Adv Ecol Theory Principles Appl. 1999 doi: 10.1002/9781444311501. [DOI] [Google Scholar]
  24. Rand DA (1999) Correlation equations and pair approximations for spatial ecologies. In: Advanced ecological theory: principles and applications, pp 100–142. Blackwell Science, Oxford
  25. Risau-Gusmán S, Zanette DH. Contact switching as a control strategy for epidemic outbreaks. J Theor Biol. 2009;257(1):52–60. doi: 10.1016/j.jtbi.2008.10.027. [DOI] [PubMed] [Google Scholar]
  26. Van Der Hofstad R (2016) Random graphs and complex networks, vol 43. Cambridge University Press
  27. Volz EM. SIR dynamics in random networks with heterogeneous connectivity. J Math Biol. 2008;56(3):293–310. doi: 10.1007/s00285-007-0116-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Vossler H, Akilimali P, Pan Y, KhudaBukhsh WR, Kenah E, Rempała GA. Analysis of individual-level data from 2018–2020 Ebola outbreak in Democratic Republic of the Congo. Sci Rep. 2022;12(1):1–10. doi: 10.1038/s41598-022-09564-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The datasets generated and/or analysed during the current study are available upon reasonable request from the corresponding author.


Articles from Journal of Mathematical Biology are provided here courtesy of Springer

RESOURCES