Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Oct 1.
Published in final edited form as: J Math Biol. 2012 Aug 22;67(4):869–899. doi: 10.1007/s00285-012-0572-3

Model Hierarchies in Edge-Based Compartmental Modeling for Infectious Disease Spread

Joel C Miller 1, Erik M Volz 2
PMCID: PMC3552133  NIHMSID: NIHMS406442  PMID: 22911242

Abstract

We consider the family of edge-based compartmental models for epidemic spread developed in [11]. These models allow for a range of complex behaviors, and in particular allow us to explicitly incorporate duration of a contact into our mathematical models. Our focus here is to identify conditions under which simpler models may be substituted for more detailed models, and in so doing we define a hierarchy of epidemic models. In particular we provide conditions under which it is appropriate to use the standard mass action SIR model, and we show what happens when these conditions fail. Using our hierarchy, we provide a procedure leading to the choice of the appropriate model for a given population. Our result about the convergence of models to the Mass Action model gives clear, rigorous conditions under which the Mass Action model is accurate.

1 Introduction

The spread of epidemics through populations is affected by many factors such as the infectiousness of the disease, the duration of infection, the distribution of contacts through the population, and the typical duration of contacts. Typically for predicting epidemic spread or intervention effectiveness, we want an accurate, but simple, model that captures the relevant effects.

In our earlier work [11], we introduced the edge-based compartmental modeling approach for the spread of SIR diseases in populations with different contact dynamics. Table 1 summarizes the models and their underlying assumptions. In particular we showed that edge-based compartmental models can capture contact duration and social heterogeneity (variation in contact levels) simultaneously in mathematically and conceptually simple terms. The models we studied all assumed that the population was made up of individuals who were identical except for their contact levels. We modeled the population as a network, with nodes representing individuals joined by edges representing potentially transmitting contacts. We also assumed a simple disease, with transmission occurring at rate β per edge and recovery occurring at rate γ. In this paper we investigate the relationships between models and how to choose the simplest model appropriate for a given population.

Table 1.

The basic models and their underlying assumptions. Contacts are assumed to form and break at some rate which can vary from zero (permanent) to infinite (fleeting). Depending on the process governing contact formation, we may know the actual degree k of an individual or we may know its expected degree.

Model Contact Duration Heterogeneity type
Configuration Model (CM) Permanent Actual degree k
Mixed Poisson (MP) Permanent Expected degree κ
Dynamic Variable-Degree (DVD) Finite Expected degree κ
Dynamic Fixed-Degree (DFD) Finite Actual degree k
Mean Field Social Heterogeneity (MFSH) Fleeting Contact rate k or κ
Dormant Contact (DC) Finite active and dormant Maximum degree km
Mass Action (MA) Fleeting None

The contact duration falls into three possibilities: it can be permanent, finite, or fleeting. In the permanent case, a contact that exists at any time has always existed and will always exist. In the finite case contacts may change over time. In the fleeting case, contacts are so brief that over any macroscopic time scale an individual samples a very large number of neighbors, and so it is safe to assume that the total contact time with infected individuals matches its expected value.

The distribution of contact levels can be split into two types. In the first class of models we discuss (expected degree models), we assign an expected degree κ to a node, with different nodes having different values of κ. The probability that an edge exists between two nodes is proportional to the expected degrees of each node. Edges are created independently of one another, so the existence of an edge between u and v does not alter whether an edge can exist between u and w. Note that the expected degree can take any non-negative real value, and we assume that the distribution of κ is given by the probability density function ρ(κ). If the contacts are permanent, then we arrive at a “Mixed Poisson” network: a node with a given expected degree κ has its actual degree chosen from a Poisson distribution. These are also called Chung-Lu Networks after [4]; these are a type of inhomogeneous random graphs [2, 5], and are almost identical to network classes introduced in [3,17]. If existing contacts break at rate η and (independently) an individual with expected degree κ forms new contacts at rate ηκ, then at any time the networks will be Mixed Poisson networks, but the structure will vary in time. The degree of any individual varies in time, but has average equal to its expected degree. These are the Dynamic Variable Degree networks. If the duration is so short that at every time the contacts are newly selected, then this is the Mean Field Social Heterogeneity model (in the expected degree formulation).

In the second class of models (actual degree models) we assign an actual integer number of contacts k, the degree, to each individual though in the case of the dormant contact model not all of these contacts must be active at all times. We think of an individual as having k stubs (or half-edges) which pair randomly with stubs of other nodes to form edges. In this case the existence of an edge between u and v removes an available stub from u, and so it affects the probability of an edge between u and w. The distribution of degrees is given by the probability mass function P(k). If the contacts are permanent, this is a Configuration Model network [16]. If individuals break contacts in such a way that they immediately form a new contact with other individuals who are simultaneously breaking a contact, then degrees do not change: this is the Dynamic Fixed Degree model. If contacts break so quickly that at each moment the contacts are newly selected, then this is the Mean Field Social Heterogeneity model (in the fixed degree formulation). In a final model, the Dormant Contact model, individuals have a given number km (assigned randomly to each individual) of possible contacts, referred to as “stubs”. A given stub is either active (involved in a contact) or inactive (not in a contact). Inactive stubs join with other stubs at rate η1 and active stubs dissolve their contact at rate η2. This Dormant Contact model reduces to any of the other expected or actual degree models in appropriate limits.

The distinction between the expected degree models and the actual degree models becomes apparent when we calculate the probability that an individual is susceptible. For the expected degree models, there is a continuum of risk levels and so we will have to calculate the per-expected degree probability of not having been infected. In contrast for the actual degree models the risk is discretized. The calculation is slightly different in each case, but the underlying concepts are the same. The expected degree models tend to be marginally more difficult conceptually, but they are simpler mathematically.

K〉 is large while the rate of transmission per edge scales such that β is of order 1 / 〈K〉 and the recovery rate γ is fixed, then the probability that any given edge transmits even once is small. The probability it transmits twice is negligible: the disease “sees” an edge at most once, so whether it is permanent, fleeting, or finite has no impact on disease spread. Thus we may treat the model as if the contacts are fleeting, which simplifies the equations. If further the contact distribution is such that the contact levels are generally close to the mean, then we can neglect variation in contact levels, and turn to the simple mass action model of [6,1]. The precise condition required for this is somewhat technical: we must have |〈K4〉 − 〈K4| / 〈K4 ≪ 1 and 〈K〉 ≫ 1 with β̂ = βK〉 fixed. When this does not hold, there can be a significant deviation away from the mass action model.

In this paper we begin by providing a flow chart leading to selection of the appropriate model for a given population. Following the flow chart, we introduce the hierarchy of the models which underlies the flow chart. We describe the precise assumptions of the models and sketch their derivation. Throughout we assume that the epidemic is started by a very small initial proportion infected. We demonstrate some of the simpler parts of the hierarchy. After that, we consider some of the more difficult aspects of the hierarchy. We finally discuss some of the implications and limitations of our approach. In particular, we note that we give a simple heuristic for when the Mass Action equations are appropriate. In the Supplementary Information (SI) we provide more rigorous justifications for the claims in the hierarchy section. Because of its importance, we include the rigorous proof of conditions under which the Mass Action equations hold in the main text.

2 Model Selection

In Figure 1 we present a flow chart that can be used to select the appropriate model for a given population. The conditions depend on the degree distribution and the rate at which edges change. These are relatively straightforward to measure for a population [14,24,19]. From the observations of population and disease parameters, we can choose the simplest model to accurately represent disease spread in a given population. The equations for each model are developed in Section 3.

Fig. 1. Flow chart for model choice.

Fig. 1

Depending on the how the edge turnover rates compare to the infection and recovery rates, as well as the degree distribution, we arrive at different models. The model abbreviations are as in Table 1. This flow chart assumes that all structure in the population is due to heterogeneity in contact levels.

Most existing modeling of infectious disease spread are based on the Mass Action model [6]. This flow chart gives appropriate conditions under which this model is reasonable. In general, we must have most degrees close to the average degree, but at the same time either contacts must have high turnover or the probability of transmission per contact is very low. If these conditions hold, it is appropriate to use the MA model.

Before using this flow chart it is always prudent to be sure that the population does not violate other assumptions of the models. For example, different contact structure between age groups may require more consideration. If there are important features not captured by these models it may be possible to develop a custom model that captures the relevant detail (see [12,23]). Otherwise we may not be able to rely on edge-based compartmental models and may have to use simulation. Some features, such as non-constant rates of infection or recovery or a latent period can be captured by straightforward generalizations of the models.

Although most of these models are new, some of these (or closely related models) have been applied previously. An early version of the CM model was introduced in [21], and an earlier, closely related approach which only gives final size information has been widely used (e.g., [15,9,10]). The DFD model has been used by [22,20] to study syphilis and HIV infections. The MFSH models have been used widely, primarily to understand HIV spread [1,7,8,13, 18]

The questions asked are somewhat vague in the sense that whether a number is large or not is somewhat a matter of opinion. In Section 3 we address this in more detail. We show how the results of the models converge as the parameter values change.

3 The model hierarchy

In this section we investigate the hierarchy of Figure 2 underlying the flow chart in Figure 1. We first give a brief overview of the standard simple model, the Mass Action model. We then consider the hierarchy of models. We find it convenient to consider the expected degree models before the actual degree models. We consider the models roughly in order of increasing complexity, explaining the underlying assumptions and sketching the derivation of the equations from [11]. If we can reduce the model to another model, we explain why this should happen and give some details of the mathematical explanation. More complete derivations are in the SI.

Fig. 2.

Fig. 2

The hierarchy structure. The edges summarize the relations between the models. A solid arrow from one model to another represents that the target model can be derived as a limiting case of the base model. We give heuristic explanations in the text for these and more rigorous derivations in the SI. A dashed arrow denotes that the target model can be derived as a special case of the base model. The dashed arrows are straightforward and the justification is given in the main text.

The Mean Field Social Heterogeneity models require further attention. We have two formulations, one in terms of expected degree κ and the other in terms of actual degree k. We will not address this initially while we discuss the main structure of the hierarchy, but at the end of this section we show that the two formulations are mathematically equivalent in the sense that each can be derived from the other. So we do not distinguish between the two models in the hierarchy of Figure 2. We will also show that all other models reduce to Mean Field Social Heterogeneity in the large 〈K〉 limit if βK〉 and γ are both constant. In turn we have conditions under which the Mean Field Social Heterogeneity model reduces to the Mass Action model. Given any model with βK〉 and γ fixed, 〈K〉 → ∞ and 〈K4〉 / 〈K4 → 1, we arrive at the Mass Action model1.

3.0.1 The Mass Action model

The Mass Action (MA) model assumes that all individuals have the same rate of contact formation k (or κ) and edges (contacts) are sufficiently short-lived that we may neglect contact duration. The transmission rate per contact is β, and so the combined transmission rate is β̂ = βk. If we take S, I, and R to be the proportion of the population that is infected, then figure 3 leads to

Fig. 3.

Fig. 3

The flow diagram for the MA model. S is the proportion susceptible, I the proportion infected, and R the proportion recovered. The flux from I to R is γI, and the flux from S to I is βIS.

S.=-β^IS,I.=β^IS-γI,R.=γI

However, this may be simplified somewhat by noting that S + I + R = 1. We replace the equation for İ with I = 1 − R

S.=-β^IS,I=1-S-R,R.=γI

3.1 Expected degree models

We now study the expected degree models for which each node has an expected degree κ assigned using the probability distribution function ρ(κ). We allow κ to be a continuous variable. At any given time, the probability that two nodes u and v share an edge is proportional to κu κv, and each edge is assigned independently of all others.

We briefly sketch the approach used to derive equations in these networks. Full details are in [11]. We define Θ as a function of time such that 1 − Θ represents the per-unit κ probability of having been infected. To be precise let u and v be nodes such that v has a slightly larger κ than u: κv = κu + Δκ with Δκ ≪ 1. By calculating the increased risk to v, we can arrive at a formula for s(κ, t) the probability an individual with a given κ is susceptible. The probability that u is susceptible is s(κu, t) and the probability that v is still susceptible is s(κu + Δκ, t) = [1 − (1 − Θ)Δκ]s(κu, t) + Inline graphic(Δ κ2). So in the small Δκ limit, (1 − Θ) Δκ is the probability that the small amount of extra κ v has has ever contributed an edge that has transmitted to v. Taking Δκ → 0, we have ∂s / ∂κ = (1 − Θ)s. So s(κ, t) = exp[−κ (1 − Θ)] and the probability a random node is susceptible is S(t) = Ψ (Θ (t)) where

Ψ(x)=0e-κ(1-x)ρ(κ)dκ

The difference in the various expected degree models is in how long edges last: they may be permanent, fleeting or finite.

For each system, we sketch the derivation of the equations. A full derivation and comparison with simulations for each model appear in [11]. Our focus in this paper is on understanding how the systems relate to one another rather than details of the derivation. We begin by considering permanent edges.

3.1.1 Mixed Poisson

In the Mixed Poisson (MP) model, the population is assumed to be static, so that if a contact ever exists, then it has always existed and will always exist. The relevant flow diagram is in figure 4. We set ΦS, ΦI, and ΦR to be the per-unit κ probabilities of having a susceptible, infected, or recovered neighbor that has not transmitted. For the MP model, Θ = ΦS + ΦI + ΦR. We can solve for ΦS = Ψ′(Θ) / Ψ′(1), and show that ΦR = β(1 − Θ) / γ from which we can find ΦI in terms of Θ. So Θ̇ = −βΦI becomes an equation just in terms of Θ. Putting this all together, the governing equations are

Fig. 4.

Fig. 4

The MP flow diagram. (Left) edge quantities: ΦS, ΦI, ΦR, and 1− Θ are the per-unit κ probabilities of having a neighbor which is susceptible, infected and has not transmitted, recovered and did not transmit, or which has transmitted respectively. (Right) individual quantities: S the probability of being susceptible, I the probability of being infected, and R the probability of being recovered.

Θ.=-βΘ+βΨ(Θ)Ψ(1)+γ(1-Θ)R.=γI,S=Ψ(Θ),I=1-S-R

3.1.2 Mean Field Social Heterogeneity [expected degree formulation]

We now consider the opposite limit in which edges are fleeting. The Mean Field Social Heterogeneity (MFSH) model [1,7,8,13,18] generalizes the MA model by allowing for variations in contact rate among the people. At any given time the node is expected to have κ edges, but they change over rapidly. The relevant flow diagram is shown in figure 5. This introduces some new variables, ΠS, ΠI, and ΠR which are the probabilities a new contact is with a susceptible, infected or recovered individual. Unlike the MP case, ΘΦS + ΦI + ΦR. Through some simplifications similar to the MP case, we find ΠR = γ (1 − Θ) / β, so ΦI = ΠI = 1 − Ψ′ (Θ) / ψ′ (1) − γ(1 − Θ) / β. This leads to a similar system of governing equations

Fig. 5.

Fig. 5

The MFSH flow diagram (for the expected degree formulation). (Left) the variables ΦS, ΦI, and ΦR are the per-unit κ probabilities for an individual to be connected to a susceptible, infected, or recovered node. However, because the edges are fleeting, these change over quickly, and are thus always equal to Pi;S, Pi;I, and Pi;R. The variable 1 − Θ remains the per-unit κ probability of having received infection. (Center) The variables ΠS, ΠI, and ΠR are the proportion of contacts at any given time which are made by susceptible, infected, or recovered individuals respectively. (Right) The variables S, I, and R remain the proportion of individuals who are susceptible, infected, or recovered.

Θ.=-β+βΨ(Θ)Ψ(1)+γ(1-Θ)R.=γI,S=Ψ(Θ),I=1-S-R

These equations differ in only one term from the MP equations.

The only difference in the assumptions of the MFSH model and the MA model is that the MA model assumes all contact rates are the same. Indeed, if all expected degrees are the same κ, then Ψ (Θ) = exp[− κ (1 − Θ)]. We set R = γ (1 − Θ) / β, I = 1 − SR, and S = Ψ (Θ). We first note that with this Ψ (Θ), we find Θ̇ = −βI. Setting β̂ = βκ and taking the time derivatives of S and R, we see that = −β̂IS and = γI. Thus we have arrived at the MA equations.

More generally we expect that if the variation in contact rate is sufficiently small, the MFSH model should behave like the MA model. In Section 3.3.3 we discuss this further.

We note that in [11], the final size derivation for this model has a sign error. Setting Θ̇ = 0, and solving, we find Θ=βγ(-1+Ψ(Θ)Ψ(1))+1. Solving this equation and setting R(∞) = 1 − Ψ (Θ) gives the final size.

3.1.3 Dynamic Variable-Degree

In the Dynamic Variable-Degree (DVD) model, an individual may create or terminate edges at any time. A node with expected degree κ creates edges at rate κη. Any existing edge breaks at rate η, so a node with expected degree κ will on average have κ edges, though the value fluctuates. The flow diagram in figure 6 is similar to the previous diagrams. We can use the flow diagram to find a differential equation for ΦI, which can be solved in terms of ΠR. However, we cannot analytically solve for ΠR in terms of Θ, so we require additional equations. The governing equations are

Fig. 6.

Fig. 6

DVD flow diagram. The variables have the same meanings as in the MFSH case, but because edges have nonzero duration, we must include the possibility that a partner changes status while an edge exists. Note that ΦS = ΠS = Ψ′(Θ) / Ψ′ (1). The fluxes in and out of ΦI lead to a differential equation which can be integrated to give an expression for ΦI in terms of Θ and ΠR. This leads to a differential equation for Θ in terms of Θ and ΠR.

Θ.=-βΘ+βΨ(Θ)Ψ(1)+γ(1-Θ)+η(1-Θ-βγΠR),Π.R=γΠI,ΠS=Ψ(Θ)/Ψ(1),ΠI=1-ΠS-ΠR,R.=γI,S=Ψ(Θ),I=1-S-R.

The new variables ΠS, ΠI, and ΠR give the probabilities that a newly formed edge will connect to a susceptible, infected, or recovered node respectively.

If the changeover rate is sufficiently fast we anticipate that the model should reduce to the MFSH model. This statement can be made precise by considering a susceptible node u. The node u is constantly creating new edges and breaking existing edges. Its risk of infection depends on how many infected neighbors it has. If edges are long-lasting, then knowing that u has not been infected suggests that its neighbors may still be susceptible. However, If an edge to an infected neighbor is likely to break before the neighbor transmits, then knowing that u is still susceptible says little about current neighbors. In mathematical terms this means if η / β is large, we anticipate that the DVD model reproduces the MFSH model. This is made more precise in the SI.

In the opposite limit, we would expect that having small η leads to an effectively static network, so the MP model should result. In fact this is true, but the precise condition is somewhat more subtle than might be anticipated. It is not enough that η / β and η / γ be very small because the epidemic can last for many generations. In practice the static model will work well at early times, but may fail at later times as the contact structure accumulates changes. We set t0 to be a time early enough that the number of infections by time t0 is very small, but large enough that shortly thereafter the total number of infections is no longer negligible. At some later time t the MP model will be reasonable if η (tt0) is small. A more precise condition that the MP model is reasonable if η(ΠI + ΠR) / r ≪ 1 where r is the early exponential growth rate is also described in the SI.

We show convergence of the DVD model to the MP and MFSH heterogeneity model as η → 0 or η → ∞ in figure 7. The technical mathematical details showing this convergence are in the SI.

Fig. 7. Convergence of DVD to MP and MFSH models.

Fig. 7

We consider the DVD model with varying values of η. The probability density function for the expected degree is ρ(κ) = eκ / [exp(10) −1] for κ ∈ (0, 10) and 0 otherwise. This yields Ψ (x) = [1 − exp(− 10(1 − x))] / [10(1 − x)]. We take β = 0.2 and γ = 1. As η decreases (top), the MP model results, while as η increases (bottom) the MFSH model results. Note the difference in axes from top to bottom.

3.2 Actual degree models

For our second class of models, we assume that each node has an actual degree k assigned using the probability mass function P (k). The degree must be a non-negative integer. Each individual is given k stubs, and at any time those stubs may join in pairs with stubs of other nodes to form edges. In most models we assume that the stubs are always in pairs (though the partner may change), so the degree is k. In the Dormant Contact model, we allow stubs to be active or dormant, and thus take km [distributed according to P (km)] to be the maximum degree of a node, with ka and kd the active and dormant degrees respectively ka + kd = km. In all of these models, we use θ(t) to denote the probability that a stub has not transmitted infection from a neighbor to its node by time t, and φS, φI, and φR to denote the probabilities that a stub has not transmitted infection and currently connects to a node of the given status. The probability a node with a given k is susceptible is θk, and taking a weighted average over all k, we find S(t) = ψ (θ (t)) where

ψ(x)=kP(k)xk

As above, we sketch the derivations of the equations. Full details of the derivations and comparison with simulation are in [11]. We begin again with permanent contacts.

3.2.1 Configuration Model

The Configuration Model (CM) networks are similar to the MP networks. In a CM network, the exact degree of an individual is assigned. A node is given k stubs, assigned using the probability mass function P (k). Once all stubs are assigned to nodes, stubs are randomly paired into edges. The resulting network is static. We define θ to be the probability that the neighbor along a random stub from u has not transmitted infection to u (so 1 − θ is the probability the neighbor has transmitted). Using the flow diagram in figure 8, we find that

Fig. 8.

Fig. 8

The flow diagram for the CM model. (Left) edge quantities: The variables φS, φI, and φ are the probabilities that a neighbor of a given node has not yet transmitted to that node and has the given status. The variable θ = φS + φI + φR is the probability that the neighbor has not yet transmitted. We can solve for φI in terms of θ, which leads to a simple differential equation for θ in terms of θ. (Right) individual quantities S, I, and R are the proportions in each state.

θ.=-βθ+βψ(θ)ψ(1)+γ(1-θ)R.=γI,S=ψ(θ),I=1-S-R

This is similar to the MP model. In fact, it is possible to show that a MP network is a special case of the CM networks. In MP networks, a node u with expected degree κu has its actual degree chosen from a Poisson distribution of mean κu (in the limit of a large network). Thus in an MP network the probability a node has degree k is P(k)=0[e-κκκ/k!]ρ(κ)dκ. Thus ψ(x)=kP(k)xk=0e-κ(κkxk/k!)ρ(κ)dκ=Ψ(x), and so the MP model emerges as a special case of the CM model.2

3.2.2 Mean Field Social Heterogeneity [actual degree formulation]

In the actual degree version of Mean Field Social Heterogeneity (MFSH), each individual has some number of stubs k assigned independently of other individuals. Stubs change edges quickly so that the neighbor at any given time has no bearing on who the neighbor is later. We use the flow diagram in figure 9. The new variables πS, πI, and πR represent the proportion of all stubs that belong to susceptible, infected, or recovered nodes. A newly formed edge connects a node to a susceptible, infected, or recovered node with probabilities πS, πI, and πR respectively. We can find that π̇R = −γθ̇ / βθ from which we can find πR in terms of θ. This gives πI in terms of θ which allows us to find a differential equation for θ in terms of θ. The governing equations are

Fig. 9.

Fig. 9

The MFSH flow diagram (for the actual degree formulation). (Left) the variables φS, φI, and φR denote the probabilities that a stub has not yet transmitted infection to an individual and is currently connected to a partner of a given type. These sum to θ, the probability the stub has not yet transmitted infection. Each of these is θ times the corresponding πS, πI or πR. (Center) the proportion of stubs that belong to susceptible, infected, or recovered individuals or equivalently, the probability a new edge is with an individual of each type. (Right) The proportion of the population in each state.

θ.=-βθ+βθθψ(θ)ψ(1)-γθlnθR.=γI,S=ψ(θ),I=1-S-R

Using techniques similar to those for the expected degree formulation of the MFSH model, we can show that if all degrees are the same, then this reduces to the MA model. Similarly, if the degrees are sufficiently close to the mean degree in the sense that 〈K4〉 / 〈K4 → 1 and both γ and βK〉 are fixed, then the solution again converges to that of the MA model. In section 3.3.1 we show that in fact this model is equivalent to the expected degree formulation of the MFSH model.

3.2.3 Dynamic Fixed-Degree

In the Dynamic Fixed-Degree (DFD) model a node is given k stubs, which are paired with stubs of other nodes into edges. As time progresses, an edge may break, and the freed stubs immediately form edges with stubs from other edges that break at the same time, a process we refer to as “edge swapping”. The rate any edge breaks is η. From figure 10, the resulting equations are

Fig. 10.

Fig. 10

The DFD flow diagram. (Left) the arrows at the top of the diagram represent breaking edges. The total rate at which stubs that have not yet transmitted infection break is ηθ. The proportions of these that then join up with individuals of a given type is proportion to the corresponding π variable. We cannot find a simple relation giving φI, so our equations cannot simplify as much as in previous models.

θ.=-βφI,φ.S=-βφIφSψ(θ)ψ(θ)+ηθπS-ηφS,φ.I=βφIφSψ(θ)ψ(θ)+ηθπI-(β+γ+η)φI,π.R=γπI,πS=θψ(θ)ψ(1),πI=1-πR-πS,R.=γI,S(t)=ψ(θ),I(t)=1-S-R.

Unlike previous models, these equations cannot be simplified into a single equation for θ̇.

The DFD model plays the same role in the actual degree case that DVD model played in the expected degree case. It experiences similar limiting behavior. If η / β is large, we recover the MFSH model. Alternately the CM is an accurate approximation so long as η (tt0) is small where t0 is a time around when the epidemic begins to infect significant numbers. Again, a more precise condition that η (πI + πR) / r ≪ 1 where r is the early exponential growth rate is described in the SI.

Figure 11 shows the convergence of this model to the CM and MFSH models as η → 0 or η → ∞.

Fig. 11. Convergence of DFD to CM and MFSH models.

Fig. 11

We consider the DFD model with varying values of η. The degrees are k = 6, k = 8, and k = 10, with probability 1/3 each. This yields ψ(x) = (x6 + x8 + x10)/3. We take β = 0.2 and γ = 1. As η decreases (top), the CM model results, while as η increases (bottom) the MFSH model results. Note the difference in axes from top to bottom.

3.2.4 Dormant Contacts

We finally move to the Dormant Contact (DC) model which captures all the previous expected and actual degree models as limiting cases. In the DC model, each node is given km stubs [with km chosen using P (km)]. However, only a fraction of them are active. At any given time, the node will have ka active stubs and kd dormant stubs, so km = ka + kd is the maximum number of active stubs. Active stubs become dormant at rate η2 and dormant stubs become active at rate η1. We define ψ(x) = Σkm P(km)xkm. Using figure 12, the governing equations are

Fig. 12.

Fig. 12

The flow diagram for the DC model. Now when a stub ceases to be in an edge, it enters a dormant phase until it finds a new partner. The new variable φD represents the probability a stub has not yet transmitted infection to an individual and is dormant. The new ξ variables represent the proportion of all stubs which belong to nodes of a given type and are part of edges (i.e., are active). We set ξ = ξS + ξI + ξR to be the proportion of stubs that are active. The π variables are now the proportion of stubs which belong to nodes of a given type and are available to form edges (i.e., are dormant). We set π = πS + πI + πR to be the proportion of stubs that are dormant. The probabilities a new edge is with an individual of each type are πS / π, πI / π and πR / π. Again, we are not able to significantly simplify the resulting equations.

θ.=-βφIφ.S=-βφIφSψ(θ)ψ(θ)+η1πSπφD-η2φSφ.I=βφIφSψ(θ)ψ(θ)+η1πIπφD-(η2+β+γ)φIφ.D=η2(θ-φD)-η1φDξ.R=-η2ξR+η1πR+γξI,ξS=(θ-φD)ψ(θ)ψ(1),ξI=ξ-ξS-ξRπ.R=η2ξR-η1πR+γπI,πS=φDψ(θ)ψ(1),πI=π-πS-πRξ=η1η1+η2,π=η2η1+η2R.=γI,S=ψ(θ),I=1-S-R

The new variable φD represents the probability that a stub has not transmitted infection to its node and is currently dormant. The variables πS, πI, and πR now measure the proportion of all stubs which are both dormant and belong to a susceptible, infected, or recovered node, with π = πS +πI +πR = η2 / (η1 +η2) the probability a stub is dormant. The ratios πS / π, πI / π, and πR / π give the probabilities that a newly formed edge connects to a susceptible, infected, or recovered node. The variables ξS, ξI, and ξR give the probabilities that a stub is active and belongs to a susceptible, infected, or recovered node, with ξ = ξS + ξI + ξR = η1 / (η1 + η2) the probability a stub is active.

It is relatively straightforward to see that if η1 is much larger than η2, then the proportion of time a stub is dormant is tiny, π ≪ 1. Consequently at any moment a node is expected to have ka = (1 − π)kmkm active contacts. As η2 / η1 shrinks, this approximation improves. So in this limit, the DC model should reduce to the DFD model with edges breaking at rate η2. Indeed, in the equations above, if we take η = η2 and assume η1η, then φD is negligibly small, so the φD terms drop out of the φS and φI equations. The values of πS and πI both go to zero as η1 grows, but a little more care shows that πS / π approaches ξS and πI / π approaches ξI and the ξ variables solve the same equations in this limit as the π variables in the DFD equations. Thus the DC equations reduce to the DFD equations in this limit. More details are in the SI.

In the opposite limit, if η2 is large compared to η1, then stubs spend most time dormant, ξ ≪ 1. If the number of stubs is sufficiently large, the expected number of active stubs κ = Inline graphic[ka] = ξkm will not be negligible. The variation in the number of active stubs will be significant relative to its expected value κ. However, κkm so the number of dormant stubs kd = kmka can be approximated as kmπ. The rate that dormant stubs become active is η1, so the rate at which the node forms new edges is approximately η1kd = η1kmπ which itself is η2κ, while each existing edge breaks at rate η2. This is the assumption underlying the DVD model. A careful analysis of the equations show that indeed we can reduce them to the DVD model if η2η1. More details are in the SI.

Alternately, we can have the DC model converge to the MFSH model if 〈K〉 → ∞, as long as βK〉 remains fixed. In this case we find that the MFSH model is a good approximation, using β̂ = βξ as the transmission rate. The underlying argument of this is that as 〈K〉 increases, the probability of transmission per edge decreases. Consequently, it becomes unimportant whether the edge has long or short duration because it is incredibly rare for the disease to try to transmit along the same edge twice. The total infectiousness of an individual is then given by the number of active edges times the per-edge infection rate. We can assume that only a proportion ξ of the stubs are active at any time, and we can absorb this into β. So an individual with k stubs causes transmission at rate β̂k where β̂ = ξβ. This is independent of how (or if) the edges are changing in time.

Figure 13 shows how the DC model reduces in the various limits.

Fig. 13. Convergence of DC to DFD, DVD, and MFSH models.

Fig. 13

We take β = γ = 1. (top) We consider the convergence of the DC model to the DFD model as η1 increases. We take ψ(x) = (x2 + x8) / 2. For the DC model we take η2 = 1 and vary η1. As η1 increases, the DC model converges to the DFD model result for η = 1. (middle) We consider the convergence of the DC model to the DVD model as η1 decreases and the degrees increase to compensate. For the DC model, we have ψ(x) = (x1 / ξ + x4 / ξ)/2 where ξ = η1 / (η1 + η2). We use η2 = 1 and vary η1. For the DVD model, we take η = 1 and Ψ(x) = (exp[−(1 − x)] + exp[−4(1 − x)])/2. (bottom) We consider the convergence of the DC model to the MFSH model with η1 = η2 and η1, η2 → ∞. We take ψ(x) = (x2 + x8)/2. The MFSH model has the same ψ, but uses βξ = β/2 as the transmission rate. Note that for all three calculations, the η1 = 1 curves are the same.

3.3 Further Analysis of Mean Field Social Heterogeneity models

In this section we show that the two MFSH models are equivalent. We also show that all the non-Mass Action models described here reduce to the MFSH model if the average number of contacts is large, and the probability of infection per edge scales like 1 / 〈K〉. Technically, our results show that there is no difference between the results of any of the (non-MA) models in this limit, so any of these models would be appropriate, but generally the equations of the MFSH model are simplest so we use it.

3.3.1 Equivalence of Mean Field Social Heterogeneity formulations

There are two formulations of the MFSH model, one with expected degrees and the other with actual degrees.

In the expected degree formulation, each individual has a given expected degree κ (which varies by individual). On average, at any given moment of time the individual has κ contacts. The exact number of contacts may vary from moment to moment, but how many contacts exist at one moment and who those contacts are with are independent from any other moment. Thus over a short period of time, we may safely assume that the total number of contacts with infected individuals matches the expected number.

In the actual degree formulation, each individual is assigned an actual degree k and given k stubs. At any time each of those k stubs forms an edge with a stub from another node, but who the stub connects to changes rapidly. Again, over a short period of time, we may assume that the number of contacts with infected individuals matches the expected number.

Actual degree formulation as a special case of expected degree formulation

Because contacts are very short, over any time interval, the total contact time is well-approximated based on the expected amount of contact at any given moment. Thus whether the individuals have the same number of contacts at each moment, or whether the amount varies from moment to moment, the effect over any macroscopic time interval is the same. Thus the actual degree model should be a special case of the expected degree model. In the expected degree formulation, the probability that a node with κ expected contacts is susceptible is exp[− κ (1 − Θ)] and in the actual degree formulation the probability that a node with k contacts is susceptible is θk. If κ = k, we expect these to equal, and so we anticipate θ = exp(Θ −1), or Θ = 1 + ln θ. Plugging this into the expected degree equations, we arrive at the actual degree equations. So the two methods are equivalent subject to a change in variables.

Expected degree formulation as a limiting case of the actual degree formulation

The fact that the actual degree formulation leads to the expected degree formulation is based on similar reasoning. The underlying additional idea is that any continuous distribution can be approximated by a sufficiently well-refined discrete distribution. We then need a way to take a well-refined discrete distribution and rescale it so that all the probability is massed at integer values. To do this we make the observation that for the MFSH models, we can multiply every individual’s (expected or actual) degree by L with no impact on the epidemic so long as we also divide the transmission rate by L. The resulting equations remain unchanged.

To make this more precise, consider a given β and ρ(κ) and assume L is large. We approximate the continuous distribution of κ using a discrete distribution where the probability of k / L is k/L(k+1)/Lρ(κ)dκ. Increasing L gives a finer scale approximation. We then multiply every k / L by L, to get P(k)=k/L(k+1)/Lρ(κ)dκ where k is an integer. So long as we divide β by L, the spread of the epidemic on a population with the given P (k) and the rescaled β will closely approximate the epidemic spread in the original expected degree population, with the approximation improving as L → ∞. We find that the actual degree equations converge to the expected degree equations as L → ∞. Full details are in the SI.

As an example we take the uniform distribution from 0 to 10 for κ. So Ψ(x) = (1 − exp[−10(1 − Θ)]) / [2(1 − Θ)]. We initially start with a discrete distribution on the integers 0 through 9 with weight 1 / 10 on each. So our initial ψ (x) is k=09xk/10=(x10-1)/10(x-1) with β = β̂. We then refine the distribution. For given L we have ψ(x)=k=010L-1xk/10L=(x10L-1)/[10L(x-1)], and we take β = β̂ / L. Convergence is shown in figure 14.

Fig. 14. Convergence of actual degree formulation of MFSH to expected degree formulation.

Fig. 14

The actual degree formulation of the MFSH model for the appropriate discrete distribution (described in text) converges to the expected degree formulation of the MFSH model with contact rates chosen uniformly from 0 to 10.

3.3.2 Reduction of all models to Mean Field Social Heterogeneity model at large average degree

One particularly important limit corresponds to people having many contacts, but a low probability of transmitting per contact before recovering. That is, 〈K〉 is large, and β / γ is comparably small. In this limit, we expect that the duration of contact becomes unimportant, because the infection is unlikely to cross an edge more than once so the disease has no way to know how long the edge lasts. Heterogeneity in contact levels may still play an important role. If the heterogeneity is sufficiently small, it is reasonable to expect that the MA SIR model is appropriate.

In section 4.2 we rigorously derive the MFSH model from the DC model assuming 〈K〉 → ∞ with fixed βK〉 and fixed γ. A similar proof will apply for the other models, or we can simply argue that the DC model reduces to all of the others, and careful attention to detail shows that they inherit this limiting case.

3.3.3 Reduction of MFSH to MA model if 〈K4〉 / 〈K4 → 1 and βK〉 fixed

It is straightforward to show that the MFSH model becomes the MA SIR model if every node has the same contact rate. However more generally, we would expect that if the contact rates are sufficiently close together, the model should behave like the SIR model. In fact this holds if 〈K4〉 / 〈K4 → 1 with βK〉 fixed. Intuitively, if the number of nodes with higher or lower contact rates is very small, their contribution to the spread of the disease is not significantly different from the contribution of an average node, and so the disease should spread as if the contact rate were homogeneous. We do not have a good intuitive explanation for why the precise condition relies on 〈K4〉 / 〈K4. However, we prove it rigorously in section 4.3.

We expect this case to be particularly relevant if the average degree is large and infectiousness is low. Combining this with the previous result, we conclude that all models converge to the MA model under appropriate conditions as 〈K〉 → ∞.

4 Rigorous proof of convergence to Mass Action equations

In this section, we give a rigorous proof of one of the most significant results. Namely, if 〈K〉 is large, but 〈K4〉 / 〈K4 is approximately 1 and βK〉 and γ are small compared to 〈K〉, then regardless of which model we use, the result is well-approximated by the MA equations.

We first show some examples, and then provide the details needed to prove the result for the DC model. The same result for simpler models can be derived by the same manner (though in some cases the proof will be simpler). For our proof, we assume we have a sequence of populations and diseases indexed by n such that 〈K〉 → ∞, 〈K4〉 / 〈K4 → 1 with ξβK〉 fixed. We show that this converges to the mass action equations using β̂ = ξβK〉.

We do the proof in two steps. First, we show that in the limit of large 〈K〉 and constant βξK〉, the DC equations converge to the MFSH equations. We then show that the MFSH equations converge to the MA equations if additionally 〈K4〉 / 〈K4 → 1.

4.1 Example

As examples, we consider several different population structures using the MP model in figure 16. In the first, we take an exponential distribution of expected degrees. To vary the average degree, we change the decay rate of the exponential.

Fig. 16.

Fig. 16

The expected degree distributions for our examples of convergence as 〈K〉 → ∞, with and without 〈K4〉 / 〈K4 → 1.

ρ(κ)=e-κ/KK

This does not satisfy the conditions that 〈K4〉 / 〈K4 → 1. In the other examples, we take a uniform distribution, with expected degrees chosen uniformly from (〈K〉 − 〈Kα, 〈K〉 + 〈Kα). That is,

ρ(κ)={0κ<K-Kα12KαK-Kα<κ<K+Kα0κ>K+K

This satisfies the condition for any α < 1. Using 〈 4〉 to denote the average of the 4th power of the expected degree3, we have 〈4〉 = 〈K4 +2 〈K2+2α + 〈K4α / 5. If α = 1, then 〈K4〉 / 〈K4 does not approach 1. The distribution would not make sense for α > 1 since some nodes would have negative expected degree.

For all distributions, we take β = 2 / 〈K〉 and γ = 1. The corresponding MA model has β̂ = 2. In figure 17 we plot the results for the exponential distribution. This distribution does not satisfy the conditions, and we see that the solutions do converge, but not to the MA model. In figure 18, we take the uniform distribution from 〈K〉 − 〈Kα to 〈K〉 + 〈Kα. We again see that when the conditions are not satisfied (the exponential distribution and the distribution with α = 1), the solutions may still converge, but not to the corresponding MA model. However, when the conditions are satisfied, the solutions converge to the MA model.

Fig. 17. Sample convergence as.

Fig. 17

K〉 → ∞. We use the exponential distribution described in figure 16. The distribution of expected degrees decays exponentially and 〈K4〉 / 〈K4 does not approach 1 as 〈K〉 grows. The solutions appear to converge, but they do not converge to the MA SIR model.

Fig. 18. Sample convergence as.

Fig. 18

K〉 → ∞. We use the uniform distribution described in figure 16. In the first, second, and third rows, we use α = 1, 0.85, 0.5 respectively. The condition fails for α = 1 and the solutions do not converge to the MA model, but they converge to the MA model for smaller α.

4.2 DC to MFSH

We now begin our proof that the DC model converges to the MA model. We begin by showing that it converges to the MFSH model. We will use the following result several times in the proof: Assume g(t) is a function of time and

g.=-c(t)g+f(t)

If c(t) ≥ C > 0 for all time and |f| ≤ F, and further |g(0)| ≤ F / C, then for all time |g(t)|F / C. To see this, assume that f and g are positive. If g is increasing beyond F / C, then −cg + f must be negative, violating the assumption that g is increasing. Similar results hold for negative values of f and g. Our proofs skip some of the intermediate steps. Complete details of all the algebra is in the SI.

We want to prove that if degrees increase while the infection rate simultaneously decreases, with γ constant, then we must arrive at the Mass Action equations. We do this for the Dormant Contact model. We introduce the notation o(1) to denote a function whose value goes to zero as 〈K〉 → ∞. Our result above implies that if f = o(1) and c is bounded away from 0 as 〈K〉 changes, then g = o(1). We will show that neglecting small terms in the large degree limit the DC model becomes

θ.=-βθζR.=γI,S=ψ(θ),I=1-S-R

where ζ = 1 − θψ′(θ) / ψ′(1) + (γ / β̃) ln θ.

We define ζ = πI + ξI This is the proportion of all stubs which belong to infected nodes. We prove a series of results:

  1. θ = 1 + o(1).

  2. φD = π + o(1).

  3. η1πR = η2ξR + o(1)

  4. φS=ξψ(θ)ψ(1)+o(1).

  5. πI = πζ + o(1) ξI = ξζ + o(1).

  6. φI = ξθζ + o(1)

  7. θ̇ = −β̃ζ + βo(1).

  8. ζ=1-θψ(θ)ψ(1)+(γ/β)lnθ+to(1)

If we drop the error terms in the final two results, then we have the equations governing the Mean Field Social Heterogeneity model in the actual degree case, with β̃ = βξ playing the role of the transmission rate. The existence of the error terms means that at earlier time, the approximation is better, and it can deviate more as time increases. The solution converges uniformly for any t less than any arbitrary chosen value T as 〈K〉 grows. So for large enough 〈K〉, the region over which the MFSH model gives an accurate approximation will include the entire period of the epidemic.

1. θ = 1 + o(1)

To estimate the probability a stub receives at least one transmission, we first estimate the expected number of transmissions a stub receives, which gives a bound on the probability of at least one transmission. Each time a stub receives infection, this corresponds to a transmission from the neighboring node through one of its stubs. So the expected number of inward transmissions equals the expected number of outward transmissions. This is an upper bound on the probability a stub receives an inward transmission, 1 − θ. After some straightforward calculations, we can show that

ββ+η2+γ11-η1η2(η2+γ)(η1+γ)

is a bound on 1 − θ. So long as γ ≠ 0, this is o(1) since β = o(1). Thus θ = 1 + o(1).

If γ = 0, this argument can be salvaged by a cruder bound. At time t after the initial time, the expected number of transmissions is bounded by βt = to(1). So choosing T as described above, we will still be able to prove uniform convergence for tT since θ = 1 + T o(1) = 1 + o(1) if T is held fixed. If T is chosen to be large enough initially, we have uniform convergence during the dynamic phase of the epidemic.

2. φD = π + o(1)

We show φDπ = o(1):

ddt(φD-π)=η2(θ-φD)-η1φD=η2[1+o(1)]-(η1+η2)φD=(η1+η2)π-(η1+η2)φD+o(1)=-(η1+η2)(φD-π)+o(1)

In the next to last step we used the fact that π = η2 / (η1 + η2) so η2 = (η1 + η2)π. So φDπ is at most o(1) / (η1 + η2) = o(1).

3. η1πR = η2ξR + o(1)

We set xS = η1πSη2ξS, xI = η1πIη2ξI, and xR = η1πRη2ξR. Using the previous results we can show that xS = o(1). Since the sum xS +xI +xR equals η1πη2ξ which is zero, we conclude xI = −xR + o(1). Some algebra shows R = −(η1 + η2)xR + γxI = − (η1 + η2 + γ)xR + o(1), from which the result follows quickly.

4. φS=ξψ(θ)ψ(1)+o(1)

We have

ddt[φS-ξψ(θ)ψ(1)]=φ.S-ξθ.ψ(θ)ψ(1)=-η2φS+η1πSπφD-βφIψ(θ)ψ(θ)(φS-ξψ(θ)ψ(1))=-η2φS+η1πψ(θ)ψ(1)-βφIψ(θ)ψ(θ)(φS-ξψ(θ)ψ(1))+o(1)

where we used the fact that πS = φD ψ′(θ) / ψ′(1) and φD = π + o(1) and that ψ′(θ) / ψ′(1) ≤ 1. Since η1π = η2ξ, we have

ddt[φS-ξψ(θ)ψ(1)]=-η2(φS-ξψ(θ)ψ(1))-βφIψ(θ)ψ(θ)(φS-ξψ(θ)ψ(1))+o(1)=-(η2+βφIψ(θ)ψ(θ))(φS-ξψ(θ)ψ(1))+o(1)

Since ψ″(θ) / ψ′(θ) is nonnegative, the coefficient of φSξψ′(θ) / ψ′(1) in the final expression is at least η2, so we conclude that φS = ξψ′(θ) / ψ′(1)+o(1).

5. πI = πζ + o(1) ξI = ξζ + o(1)

We first define q = (−φD + πθ)ψ′(θ) / ψ′(1). Since θ = 1 + o(1) and φD = π + o(1), we have −φD + πθ = o(1). Since 0 ≤ ψ′(θ) ≤ ψ′(1), we have q = o(1). We will show that πIπζq = o(1) and so πI = πζ + o(1).

ddt(πI-πζ-q)=-π.S-π.R-πζ.-q.=-ddt(φDψ(θ)ψ(1))-γπI-η2ξR+η1πR-π(-ddt(θψ(θ)ψ(1))-γζ)-ddt((-φD+πθ)ψ(θ)ψ(1))=-γ(πI+πζ-q)+o(1)

So πIπζq = o(1), and since q = o(1), πI = πζ + o(1). A similar proof shows that ξI = ξζ + o(1).

6. φI = ξθζ + o(1)

We will actually show that φI + φS = ξθζ + ξθ2ψ′(θ) / ψ′(1) + o(1). Once this result is shown, we use the fact that φS = ξψ′(θ) / ψ′(1) + o(1) = ξθ2ψ′(θ) / ψ′(1) + o(1) to get our final result. We begin by taking the derivative of φI + φSξ [θζ + θ2ψ′(θ) / ψ′(1)]

ddt(φI+φS-ξ[θζ+θ2ψ(θ)ψ(1)])=φ.I+φ.S-ξ[θ.ζ+θζ.+θ.θ2ψ(θ)ψ(1)+2θ.θψ(θ)ψ(1)]

After some manipulations (see SI) we have

ddt(φI+φS-ξ[θζ+θ2ψ(θ)ψ(1)])=-(η2+γ)(φI+φS-ξ[θζ+θ2ψ(θ)ψ(1)])+o(1)

So φI + φSξ[θζ + θ2ψ(θ) / ψ′(1)] = o(1). Because θ = 1 + o(1) and φS = ξψ′(θ) / ψ′(1) + o(1) it follows then that φI = ξθζ + o(1).

7. θ̇ = −β̃θζ + βo(1)

This step is trivial. Since φI = ξθζ + o(1), we have

θ.=-βφI=-βθζ+βo(1)

8. ζ=1-φψ(θ)ψ(1)+(γ/β)lnθ+to(1)

We make the observation that ζ.=-ddt[θψ(θ)/ψ(1)]-γζ. We have

ζ.-ddt[1-θψ(θ)ψ(1)+(γ/β)lnθ]=-ddt[θψ(θ)ψ(1)]-γζ+ddt[θψ(θ)ψ(1)]-γθ.βθ=-γζ-γθ.βθ

From our previous result we have ζ = −[θ̇ + βo(1)] / β̃θ. Substituting this in we have

ζ.-ddt[1-θψ(θ)ψ(1)+(γ/β)lnθ]=-γθ.βθ-γβo(1)βθ-γθ.βθ=o(1)

Our result follows immediately.

So the DC model reduces to the actual degree formulation of the MFSH model. Because the expected degree formulation contains the actual degree formulation as a special case, it suffices now to prove that the expected degree formulation converges to the MA model in the appropriate limit.

4.3 MFSH to MA

Our proof that the MFSH model converges to the MA model does not require that 〈K〉 → ∞, but it does require 〈K4〉 / 〈K4 → 1. However, in order to conclude that the other models approach the MA model, we need the conditions of the previous section to hold as well.

We first make the observation that if 〈K4〉 / 〈K4 → 1, then 〈K2〉 / 〈K2 → 1 as well. By Jensen’s inequality, we have 〈K2 ≤ 〈K2〉 and K2K4. So 1=K2/K2K2/K2K4/K2=K4/K41. So 〈K2〉 / 〈K2 is bounded from below by 1 and from above by something converging to 1.

We first prove two lemmas. We define s1 and s2 by

s1=1KΨ(Θ)Ψ(1)-Ss2=1KΨ(Θ)-S

We show s1 and s2 go to zero. We have

limns1=limn|Ψ(Θ)K2-S|=limn|0(e-κ(1-Θ)-S)(κ2-K2)ρ(κ)dκK2|

(see SI for details). We now use the Cauchy-Schwarz inequality to bound this. We have

|0(e-κ(1-Θ)-S)(κ2-K2)ρ(κ)dκK2|0(e-κ(1-Θ)-S)2ρ(κ)dκ1/20(k2-K2)2ρ(κ)dκ1/2K20(κ2-K2)2ρ(κ)dκ1/2K2

where we use the fact that |exp[− κ (1 − Θ)] − S| ≤ 1 to show that the first term in the numerator of the first equation is at most 1. All that remains is to expand the numerator and bound it.

0(κ2-K2)2ρ(κ)dκ1/2K2=0(κ4-2κ2K2+K22)ρ(κ)dκ1/2K2=|K4-K22K4|1/2

But our assumption is that as n → ∞, both 〈K4〉 / 〈K4and 〈K2〉 / 〈K2 go to 1. Thus this goes to zero as n → ∞. This completes the proof that s1 → 0.

To show that s2 → 0 is similar.

s2=|1KΨ(Θ)-S|=|0(e-κ(1-Θ)-S)(κ-K)ρ(κ)dκK|0(e-κ(1-Θ)-S)ρ(κ)dκ1/20(κ-K)ρ(κ)dκ1/2KK2-K21/2K

and as n → ∞, this tends to 0 as well.

We have our bounds on s1 and s2, so we are now ready to complete the proof that the Mean Field Social Heterogeneity model converges to the Mass Action model as n → ∞.

We define ΠI = − Θ̇ / β = 1 − Ψ′(Θ) / Ψ′(1) − γ (1 − Θ) / β. In [11] we saw that this represents the probability that a new neighbor is infected. We expect ΠI to be very close to I and set y = ΠII. Then

y.=Π.I-I.=-Θ.Ψ(Θ)Ψ(1)+γΘ./β-[-S.-R.]=-Θ.Ψ(Θ)Ψ(1)+γΘ./β-[-Θ.Ψ(Θ)-γI]=βΠIΨ(Θ)Ψ(1)-γΠI-βΠIΨ(Θ)+γI=β(I+y)Ψ(Θ)Ψ(1)-γy-β(I+y)Ψ(Θ)=βK(I+y)[(S+s1)-(S+s2)]-γy=β^(I+y)(s1-s2)-γy

We know that I + y is at most 1, and s1 and s2 both tend to zero as n increases, so we have approaches −γy as n → ∞. Note also that at early time ΠI and I are both very close to 0 so y begins as a very small number. It is straightforward to conclude that y can never grow larger than the maximum value of s1 which tends to zero as n → ∞. Consequently, y → 0 as n → ∞.

We finally have

S.=-βΠIΨ(Θ)=-β^ΠI(S+s2)=-β^IS-β^yS-β^ys2

Since y and s2 both tend to 0 as n → ∞, we have = −β̂IS. This means that we have the Mass Action model in the limit.

5 Discussion

We have shown a number of relations between the various edge-based compartmental models for infectious disease spread originally derived in [11]. We used this to develop a flow chart (figure 1) which can guide the choice of appropriate model for a given population.

We note that while the edge-based compartmental models allow us to capture effects that were previously inaccessible through analytic techniques, there are still many effects that are not captured by the models considered here. Our flow chart does not address these. It is always prudent to consider the disease and population to ensure that the assumptions of our models are not strongly violated. A number of adaptations of the mass action SIR models exist for populations in which the disease has multiple stages, or the population has important substructures. In our next paper [12] we show that the general edge-based compartmental modeling approach can be used to accommodate many of these effects.

There are two interesting open (and related) questions which we call attention to. Firstly, we showed that if 〈K〉 → ∞, 〈K4〉 / 〈K4 → 1 with βK〉 fixed, the models converge to the Mass Action model. However, there are many cases where 〈K4〉 / 〈K4 approaches some other value, and our calculations appear to converge. It would be interesting to identify what the relevant reduced equations are in this limit. Secondly, it is perhaps surprising that the condition for convergence to the Mass Action model depends on 〈K4〉 / 〈K4 → 1 rather than 〈K2〉 / 〈K2 → 1. It can be shown analytically that if 〈K2〉 / 〈K2 → 1 but 〈K4〉 / 〈K4 Inline graphic1 then the early exponential growth rate is higher than the Mass Action model. Our numerical calculations suggest that there is an early phase that deviates from Mass Action, but after that phase the solution is indistinguishable. The early phase becomes shorter as 〈K〉 grows. So we believe that there is pointwise convergence although not uniform convergence. The underlying explanation for this is that in this limit there is a vanishingly small proportion of nodes with very high degree. They are quickly infected in an epidemic and then removed from the active population, leaving behind a population that follows the Mass Action dynamics. We have not investigated either of these questions in any detail.

Supplementary Material

285_2012_572_MOESM1_ESM

Fig. 15. Convergence to MFSH.

Fig. 15

(top) Convergence of the Expected Degree models assuming a uniform distribution from 0 to 2 〈K〉. (bottom) Convergence of the Actual Degree models assuming a third each with 〈K〉 / 2, 〈K〉, and 3 〈K〉 / 2 stubs. The solid curves correspond to 〈K〉 = 1 (top) and 〈K〉 = 2 (bottom). The dotted curves correspond to various larger values of 〈K〉. Because β decreases to balance the increase in 〈K〉, the MFSH curves are the same for all 〈K〉.

Acknowledgments

JCM was supported by 1) the RAPIDD program of the Science and Technology Directorate, Department of Homeland Security and the Fogarty International Center, National Institutes of Health and 2) the Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard School of Public Health under Award Number U54GM088558 from the National Institute Of General Medical Sciences. EMV was supported by NIH K01 AI091440. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute Of General Medical Sciences or the National Institutes of Health.

Footnotes

1

It is worth noting that in the expected degree formulations the average fourth power of the degree and the average fourth power of the expected degrees are not equal, but if the ratio with 〈K4 tends to 1 for either, it does for the other as well.

2

In finite networks, there are some correlations between degrees that are different between the models, but this disappears in the large network limit.

3

Our condition that 〈K4〉 / 〈K4 → 1 applies equally for the fourth power of actual or expected degree since at leading order they are the same.

Contributor Information

Joel C. Miller, Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard School of Public Health; Fogarty International Center, NIH; & Departments of Mathematics and Biology, Penn State University

Erik M. Volz, Department of Epidemiology, University of Michigan, Ann Arbor

References

  • 1.Anderson Roy M, May Robert M. Infectious Diseases of Humans. Oxford University Press; Oxford: 1991. [Google Scholar]
  • 2.Bollobás B, Janson S, Riordan O. The phase transition in inhomogeneous random graphs. Random Structures and Algorithms. 2007;31(1):3. [Google Scholar]
  • 3.Britton T, Deijfen M, Martin-Löf A. Generating simple random graphs with prescribed degree distribution. Journal of Statistical Physics. 2006;124(6):1377–1397. [Google Scholar]
  • 4.Chung F, Lu L. Connected components in random graphs with given expected degree sequences. Annals of Combinatorics. 2002;6(2):125–145. [Google Scholar]
  • 5.van der Hofstad Remco. Critical behavior in inhomogeneous random graphs. Random Structures and Algorithms. To be published. arXiv:0902.0216v2 [math.PR] [Google Scholar]
  • 6.Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Royal Society of London Proceedings Series A. 1927 Aug;115:700–721. [Google Scholar]
  • 7.May Robert M, Anderson RM. The transmission dynamics of human immunodeficiency virus (HIV) Phil Trans R Soc Lond B. 1988;321:565–607. doi: 10.1098/rstb.1988.0108. [DOI] [PubMed] [Google Scholar]
  • 8.May Robert M, Lloyd Alun L. Infection dynamics on scale-free networks. Physical Review E. 2001 Nov;64(6):066112. doi: 10.1103/PhysRevE.64.066112. [DOI] [PubMed] [Google Scholar]
  • 9.Meyers Lauren Ancel. Contact network epidemiology: Bond percolation applied to infectious disease prediction and control. Bulletin of the American Mathematical Society. 2007;44(1):63–86. [Google Scholar]
  • 10.Miller Joel C. Epidemic size and probability in populations with heterogeneous infectivity and susceptibility. Physical Review E. 2007;76(1):010101(R). doi: 10.1103/PhysRevE.76.010101. [DOI] [PubMed] [Google Scholar]
  • 11.Miller Joel C, Slim Anja C, Volz Erik M. Edge-based compartmental modelling for infectious disease spread. Journal of the Royal Society Interface. 2012;9(70):890–906. doi: 10.1098/rsif.2011.0403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Miller Joel C, Volz Erik M. Edge-based compartmental modeling with disease and population structure. 2011 submitted. available at http://arxiv.org/abs/1106.6344.
  • 13.Moreno Y, Pastor-Satorras R, Vespignani A. Epidemic outbreaks in complex heterogeneous networks. The European Physical Journal B-Condensed Matter and Complex Systems. 2002;26(4):521–529. [Google Scholar]
  • 14.Mossong Jöel, Hens Niel, Jit Mark, Beutels Philippe, Auranen Kari, Mikolajczyk Rafael, Massari Marco, Salmaso Stefania, Tomba Gianpaolo Scalia, Wallinga Jacco, Heijne Janneke, Sadkowska-Todys Malgorzata, Rosinska Magdalena, John Edmunds W. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Medicine. 2008;5(3):381–391. doi: 10.1371/journal.pmed.0050074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Newman Mark EJ. Spread of epidemic disease on networks. Physical Review E. 2002;66(1):016128. doi: 10.1103/PhysRevE.66.016128. [DOI] [PubMed] [Google Scholar]
  • 16.Newman Mark EJ. The structure and function of complex networks. SIAM Review. 2003;45:167–256. [Google Scholar]
  • 17.Norros I, Reittu H. On a conditionally poissonian graph process. Advances in Applied Probability. 2006;38(1):59–75. [Google Scholar]
  • 18.Pastor-Satorras Romualdo, Vespignani Alessandro. Epidemic spreading in scale-free networks. Physical Review Letters. 2001 Apr;86(14):3200–3203. doi: 10.1103/PhysRevLett.86.3200. [DOI] [PubMed] [Google Scholar]
  • 19.Salathé Marcel, Kazandjiev Maria, Lee Jung Woo, Levis Philip, Feldman Marcus W, Jones James H. A high-resolution human contact network for infectious disease transmission. Proceedings of the National Academy of Sciences. 2010;107(51):22020–22025. doi: 10.1073/pnas.1009094108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Volz E, Frost SDW, Rothenberg R, Meyers LA. Epidemiological bridging by injection drug use drives an early HIV epidemic. Epidemics. 2010 doi: 10.1016/j.epidem.2010.06.003. [DOI] [PubMed] [Google Scholar]
  • 21.Volz Erik M. SIR dynamics in random networks with heterogeneous connectivity. Journal of Mathematical Biology. 2008;56:293–310. doi: 10.1007/s00285-007-0116-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Volz Erik M, Meyers Lauren Ancel. Susceptible–infected–recovered epidemics in dynamic contact networks. Proceedings of the Royal Society B: Biological Sciences. 2007;274(1628):2925–2933. doi: 10.1098/rspb.2007.1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Volz Erik M, Miller Joel C, Galvani Alison, Meyers Lauren Ancel. Effects of heterogeneous and clustered contact patterns on infectious disease dynamics. PLoS Comput Biol. 2011 Jun;7(6):e1002042. doi: 10.1371/journal.pcbi.1002042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wallinga Jacco, Teunis Peter, Kretzschmar Mirjam. Using data on social contacts to estimate age-specific transmission parameters for respiratory-spread infectious agents. American Journal of Epidemiology. 2006;164(10):936. doi: 10.1093/aje/kwj317. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

285_2012_572_MOESM1_ESM

RESOURCES