Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Jul 3;139:110077. doi: 10.1016/j.chaos.2020.110077

Spreading of infections on random graphs: A percolation-type model for COVID-19

Fabrizio Croccolo a,, H Eduardo Roman b
PMCID: PMC7332959  PMID: 32834619

Abstract

We introduce an epidemic spreading model on a network using concepts from percolation theory. The model is motivated by discussing the standard SIR model, with extensions to describe effects of lockdowns within a population. The underlying ideas and behaviour of the lattice model, implemented using the same lockdown scheme as for the SIR scheme, are discussed in detail and illustrated with extensive simulations. A comparison between both models is presented for the case of COVID-19 data from the USA. Both fits to the empirical data are very good, but some differences emerge between the two approaches which indicate the usefulness of having an alternative approach to the widespread SIR model.

Keywords: SIR Model, Random graphs, Critical percolation, Monte Carlo simulations

1. Introduction

The study of epidemics spreading in human populations has a long history both on the mathematical aspects (see e.g. [2], [4], [14]), as well as on the modelling of outbreak and control of their evolution [1], [6], [11], [12], [20]. Spreading phenomena has been studied extensively also in the realm of statistical physics of disordered systems (e.g. [3], [5], [17]).

In this paper, we introduce a lattice network model for epidemics spreading based in part on concepts taken from percolation theory. To motivate the network approach to spreading, we first discuss the SIR model [14], also extending it to encompass the effects of lockdowns mimicked by using a time decaying reproduction number. To assess the usefulness of the lattice model, we consider COVID-19 data from the USA and compare the results of the simulations with SIR predictions, both in the presence of lockdowns.

The paper is organized as follows: We start out in Section 2 with a brief review of the SIR model, with emphasis on some analytical results and its extension to the description of lockdown effects. Illustrative examples are shown, together with a motivation for the need of going beyond a ‘mean-field’ approach. The network model is then discussed in detail in Section 3, and the percolation ideas, relevant to the present case, are discussed. Extensive (Monte Carlo) simulations are shown to illustrate the advantages and difficulties typically associated with such lattice models. In Section 4, we apply it to the current case of COVID-19 USA data, by comparing the network results with the predictions of the SIR model. The paper ends with our concluding remarks in Section 5.

2. SIR models with extensions to lockdown effects

Infection spreading is typically modelled by the SIR model, introduced in [14]. We briefly review it for the purpose of introducing the notation and presenting extensions to describe lockdown effects. We follow standard terminology in epidemic literature1

We consider a population with a fixed number of individuals, N. To describe the outbreak of an infectious disease, the population can be divided into the following four categories: Susceptible (S), Infected (I), Recovered (R) and Dormant (D). The number of individuals in each of the first three categories depends on time t, so that we will indicate them as S(t), I(t) and R(t), while the number of Dormant D remains constant during the whole process. Susceptible subjects are initially healthy but can become infected; infected people carry the infection and can transmit it to susceptible ones, while recovered people are infected who have healed or have died, thus they do not spread the infection any further. The fourth category, D, includes non-susceptible subjects (perhaps immune individuals to the specific disease), and in general healthy subjects who do not get in contact with infected ones during the whole spreading process (Fig. 1 ). The reason for introducing the fourth category will become clearer when discussing solutions of the SIR system of equations.

Fig. 1.

Fig. 1

Structure of a population of N individuals before the outbreak of the disease: Susceptible, NN0, versus non-susceptible or dormant, N0. The latter are assumed to be inaccessible to the infection and being disseminated uniformly within the population. Although this classification is apparently superfluous within a SIR approach, it becomes useful for spreading phenomena on networks.

At any time t, the number of individuals in each category must obey the conservation equation,

S(t)+I(t)+R(t)+N0=N, (1)

where we have denoted N0=D. We assume the unit of time to be one day.

The SIR model is described by a set of three differential equations for the time variation of the number of individuals in each category, S(t), I(t) and R(t), which, upon taking into account the condition Eq. (1), read,

dSdt=βS(t)IN=β¯S(t)INeff, (2)
dIdt=βI(t)SNγI(t)=β¯I(t)SNeffγI(t), (3)
dRdt=γI(t), (4)

where Neff=NN0=S(t)+I(t)+R(t), is the effective number of individuals taking part in the process, β is the infection (or contact) rate between infected and susceptible subjects, β¯=βNeff/N is the effective infection rate in the presence of dormant, and γ is the healing (or immunization) rate of infected. It is convenient to work with normalized quantities, s(t)=S(t)/Neff, i(t)=I(t)/Neff, and r(t)=R(t)/Neff, so that the SIR equations become,

dsdt=β¯s(t)i(t), (5)
didt=(β¯s(t)γ)i(t)=(R¯0s(t)1)γi(t), (6)
drdt=γi(t). (7)

The above equations have exactly the same form as in the case N0=0. The idea of considering explicitly a fraction of the whole population not taking part in the spreading phenomenon, f0=N0/N (0 ≤ f 0 ≤ 1), allows us to interpret the so-called reproduction number, R 0, as composed of two factors, a purely ‘biological’ one,  ~ β/γ, and a ‘structural’ one (1f0), denoted as, R¯0=(β/γ)(1f0). The second factor represents the effect of dormant individuals non-in-contact with others, thus ‘hindering’ or slowing down the spreading process. The fraction f 0 can change in time, but for simplicity we assume it constant. Within the realm of the SIR model dormant people do not seem necessary, however, they play a prominent role within the context of network models of infection spreading, as we will discuss in detail in Section 3. More generally, dormant subjects can be considered as those individuals who interact very weakly with others, thus representing a subset of the population being in a sort of ‘quarantine’ from others.

One can derive some general relations between the categories by considering ratios between the SIR differential equations (Eqs. ((5), (6), (7))). First, divide (6) by (5), yielding,

dids=1+γβ¯s,i(t)i(0)=s(0)s(t)+1R¯0log(s(t)/s(0)), (8)

and dividing (5) by (7), we find,

dsdr=β¯sγ,log(s(t)/s(0))=R¯0(r(t)r(0)). (9)

By specifying the conditions, i(0)=1/Neff, s(0)=11/Neff, and i()=0, Eq. (8) becomes,

s()=1+1R¯0log(s()/s(0)), (10)

which can be written as,

s()=s(0)eR¯0(1s()). (11)

Notable limits are, s()=s(0) when R¯00, and s()=0 when R¯0. Also Eq. (6) admits a partial solution when di/dt=0, in particular at t=tpeak, i.e. at the peak of the infected curve, yielding,

s(tpeak)=1R¯0,andi(tpeak)+r(tpeak)=11R¯0. (12)

Lockdown effects can be modelled using an exponential time dependence of β (see e.g. [15]). Here, we employ a softer decay that appears to work very well, i.e.

β¯(t)=β¯(τ0t)q,tτ0, (13)

and β¯(t)=β¯, for t ≤ τ 0, where τ 0 is the time at which lockdown starts, and q > 0 is a parameter. This means we deal with a time decaying reproduction number, R¯0(t)=β¯(t)/γ. Using this form in Eq. (6), we can obtain the time t lock at which the infected curve displays its new maximum. The condition is, R¯0(tlock)s(tlock)=1, which, together with Eq. (12), yields,

tlock=τ0(s(tlock)s(tpeak))1/q>τ0, (14)

since s(t lock) > s(t peak), as there are more susceptible subjects in the presence of lockdowns than otherwise. Illustrative examples are reported in Fig. 2 , for R¯0=16.5 (with β=1.1 and γ=1/30) and f0=1/2.

Fig. 2.

Fig. 2

(Upper panel) SIR model for β=1.1,γ=1/30,R0=β/γ=33 and f0=N0/N=0.5, yielding β¯=0.55 and R¯0=16.5. The case f0=N0=0 (dashed lines) is shown for comparison. (Lower panel) SIR model with distancing effects for β¯(t)=β¯(τ0/t)q, with τ0=18 and q=2. The dashed lines represent the solution without lockdowns, and are shown for comparison.

In the upper panel, we show the standard case, with γ representing the inverse of a typical COVID-19 healing period. We show also for comparison the case f0=0 (dashed lines). In this case, Eq. (11) predicts s(∞)≅0, clearly consistent with the numerical data. As is apparent from the upper panel of Fig. 2, the effect of f 0 > 0 is to shift the infected peak at longer times by keeping a still high reproduction number. The effects of lockdowns, using relation Eq. (13), are displayed in the lower panel of the figure, for the typical cases τ0=18 days and q=2. Notice that Eq. (14) predicts tlock180.66/0.0660 days, in agreement with the numerical results, and the infected peak is reduced by a factor of about 4.

Let us return to Eq. (11). In the language of random graph theory [4], [9], [10], Eq. (11) is formally equivalent to the self-consistent relation for the fraction of nodes, P1s(), belonging to the giant component of random graphs with a finite mean node degree  < k > , given by,

P()=(1ekpP()), (15)

where 0 < p < 1 is the probability of occupancy of a link between two nodes. It can be shown that a giant cluster exists when kp=c>1. This model is also related to the mean-field theory of random spin glasses with finite coordination number (see e.g. [13]). Clearly, P(∞) → 1 if c → ∞, while for c1=ε, with 0 ≤ ε ≪ 1 we find,

P()=2pc1c2,whenc1+. (16)

The correspondence with Eq. (11) is achieved if we take R¯0c and assume s(0)=1, which is the case since s(0) ≈ 1 in our spreading model. Using 1s()=r(), we find,

r()=2R¯01R¯02,whenR¯01+. (17)

This correspondence is actually not surprising since the SIR equations are valid in a mean-field sense, where fluctuations and correlations among the categories are neglected. This analogy suggests us that we should go beyond mean-field theory by studying infectious spreading on a network where correlations can be implemented. This is done in the following Section.

3. Spreading phenomena on random graphs: Percolation concepts

Tracing infected people in a population and how they move is essential to make an accurate assessment of the extent a virus has spread in a region, country or the whole world, in order to implement effective lockdowns in each particular place (see e.g. [7]). Here we discuss a simple network model defined on a two dimensional square lattice. The sites of the lattice represent individuals belonging to one of the four categories (S, I, R, D), which we will distinguish with different colours in the plots, i.e. green, red, blue, and yellow, respectively.

Two individuals are said to be connected, i.e. transmission of the disease can occur, if they are nearest-neighbours (NN) on the lattice, representing a ‘short-range’ contact interaction. The NN choice is done just for convenience, and it can be relaxed in other versions of the model. The bonds between sites represent therefore the links in the graph, and the coordination number of 4 gives the maximum node degree, under ‘static’ conditions. The latter means that the individuals are considered to be at rest in their lattice locations, initially, while the virus can move around from site to site if the following rules are obeyed: (1) The virus can cross a bond from an infected site to a susceptible one; (2) no virus transmission occurs otherwise; (3) infected sites heal after τ H days, becoming recovered sites, so that they can neither infect others nor being infected again (immune sites); and (4) dormant sites do not participate in the spreading process.

It is essential to consider additional links ‘dynamically’ as the spreading goes on. This is done in order to describe those individuals who move around for different reasons. Thus, any infected site can reach sites that are not NN to it, and the infection can spread according to the above rules. In this case, the infection is transmitted with a probability βL=t0/τL, with τ L > t 0, and t0=1 day, provided the target site is a susceptible one.

In summary, we have taken a two dimensional lattice to facilitate the visualization of the network, and considering both NN transmissions as well as long-range ones, though with a lower probability. Since the extra links are not determined from the beginning, but are added dynamically, we do not show them in the plots, facilitating the identification of the categories. Another reason for choosing a square lattice is that the percolation threshold for site percolation (in this case the susceptible sites) is about 0.6, meaning that if we take say, f0=1/2, there are not percolating (‘infinite’) clusters of susceptible subjects on the lattice. This ‘hindering’ effect is useful when implementing lockdowns, since the remaining susceptible clusters (of connected NN sites) are disconnected from each other (they are indeed ‘finite’). One can say that ‘long-range’ links can connect different susceptible clusters, which otherwise would remain disconnected.

We show in Fig. 3 and 4 results of simulations for a single configuration, without and with lockdowns, respectively, on a (100x100) lattice for times t ≤ 100 days. Fixed boundary conditions are employed. The starting configuration has a random distribution of either susceptible or dormant sites, chosen with probability f0=1/2. The initial conditions include a single infected site right at the centre of the lattice, and no recovered subject. We keep track of the existing infected sites, each carrying a clock that starts ticking when the site gets infected. After a time τ H it becomes recovered (immune). Death sites are not implemented, but they can be estimated simply as a fraction of recovered ones.

Fig. 3.

Fig. 3

The graph of connected individuals used in the simulation. Each site of the (100x100) square lattice represents an individual belonging to one of the four categories: (S) Susceptible (green), (I) Infected (red), (R) Recovered (blue), (D) Dormant (yellow). Panels: (Upper left) Starting configuration (t=0) for f0=0.5 with S=5042,I=1,R=0 and D=4957; (Upper right) t=50; (Lower left) t=60; (Lower right) t=70. The model parameters are: τI=2 (transmission time) and β=1/τI=1/2,τH=30 (healing time) and γ=1/30, yielding R0=β/γ=15, and τL=10 (long range transmission time) yielding βL=1/10. Times are expressed in days. The average node degree for the starting configuration is k=2, while additional links are added dynamically as the network evolves in time. The newly created links yield an additional mean degree Δk=1110/50430.22, corresponding to an effective mean node degree keff=2.22. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Fig. 4.

Fig. 4

Same as in Fig. 3 in the case of lockdowns: Panels: (Upper left) Starting lockdowns (t=20); (Upper right) t=50; (Lower left) t=60; (Lower right) t=70. The model parameters are: τ0=20 (start of lockdowns) and τL= (no long range transmissions) yielding βL=0. The average node degree for the starting configuration is k=2, while additional links are added dynamically until t=τ0. The newly created links yield an additional mean degree Δk=156/50430.03, corresponding to an effective mean node degree keff=2.03.

We count the number of dynamical links generated during the spreading, from which we can determine, a posteriori, the effective mean node degree,  < k eff > , in our network. We find  < k eff > ≅2.22 without lockdowns (Fig. 3) and  < k eff > ≅2.03 with lockdowns (Fig. 4), indicating an effective reproduction number Reff=keff11. It turns out that the relatively small reduction of the mean node degree in the presence of lockdowns is sufficient to reduce the number of infected subjects considerably, as one can see from the very different structure of recovered clusters from both figures. The time evolution of the three categories are displayed in Fig. 5 , and look qualitatively similar to those from the SIR model in Fig. 2. We should mention that a single lattice simulation takes few seconds on a typical laptop, even for lattices of size (400x400), allowing to obtain accurate mean values by averaging over several configurations if required.

Fig. 5.

Fig. 5

Time evolution of the normalized SIR functions for the lattice network-spreading model. (Upper panel) No lockdowns (Fig. 3). (Lower panel) Lockdowns: tτ0=20 (Fig. 4).

4. Analysis of COVID-19 USA data

As an application of the present ideas we consider COVID-19 USA data (see also [8], [18], [19]), from the point of view of both SIR and network models. In order to do so, and due to the complexity of the data, we need to introduce additional features in particular for the SIR model.

For the USA data (Fig. 6 ), the lockdown regime can be described by the Ansatz, similar to Eq. (13),

β¯(t)=β¯D(τ0t)q(t),tτ0, (18)

where β¯D=(βD/γ)(1f0), with, in general, a time dependent exponent q(t).

Fig. 6.

Fig. 6

Time evolution of COVID-19 in the USA. SIR model (upper panel): N=3106,f0=1/2,β=0.65,γ=1/30,R¯0=9.75; Lockdowns: τ0=36,βD=0.45,q(t)=2.5t/100,d(t)=0.17[0.3+1/(1+t/60)]. Time lag tLag=29. Network model (lower panel): L=400,N=90106,β=0.65,γ=1/30,R¯0=9.75,βL=1/8,k=1.994,Δk=661/799000.008 and keff=2.002; Lockdowns: τ0=20,βD=0.55,βLD=1/32,q=2.5,d=0.06. Time lag tLag=45 for cases and tLag=21 for deaths. Data up to May 25, 2020.

Let us consider first the case of the SIR model (upper panel in Fig. 6). For the latter, we use the decreasing function with time, q(t)=qt/100, with the constant value q=2.5. This feature was needed in order to reproduce the slowly decreasing behaviour of the daily cases (blue circles in Fig. 6). In addition, a rather complicated form for the factor d(t) determining the time evolution of deaths, Deaths=d(t)×R(t), was required. We find that the form d(t)=0.17[0.3+1/(t/60)] reproduces the curve of deaths quite well (black circles in Fig. 6), but the results might be improved using more parameters. This remains to be understood. The whole fitting curves were shifted in time by an amount tLag=29 days, that is, the initial data were actually discarded from the fits.

In the case of the network model (lower panel in Fig. 6), the situation is simpler since all parameters can be taken as constants; the values depending on the regime under consideration. Regarding lockdowns (t ≥ τ 0), the value q=2.5 works rather well, while we take a finite long-range transmission probability βLD=1/32, i.e. 4 times smaller than its value βL=1/8 (t < τ 0), suggesting that indeed the lockdowns are not fully implemented and few additional infected subjects are still moving around. Also the number of deaths can be estimated from the actual recovered ones using a single value d=0.06 (6%). As well as in the case of SIR, also here we used times lags for the fits, i.e. tLag=45 for the cases and tLag=21 for deaths.

5. Conclusions

We have introduced a network model for the spreading of an infectious disease in a population based on random graph and percolation theory concepts. The model is conveniently defined on a square lattice allowing a simple visualization of the four different categories in the problem: Infected, susceptible, recovered and dormant subjects. The first three groups form the core of the widely used SIR model, while the fourth one is introduced here for representing those individuals who are disconnected to some extent from the rest of the population. They do not participate in the spreading phenomena but their presence acts as an effective slowing down of spreading by blocking an otherwise direct transmission between infected and susceptible people. In the language of percolation theory, the ‘connected’ susceptible subjects form finite clusters on the lattice that are separated from each other. To allow the spreading to overcome these ‘connection gaps’ as the process evolves in time, we allow infected people to reach any other susceptible site in the lattice, and infect it with a relatively lower probability than inside a finite cluster via NN contacts. We denote these new links, not present initially in the ‘lattice graph’, dynamical links. This dynamical approach allows us to describe lockdown effects in terms of the slowing down or total lacking of the dynamical links.

We have assessed the performance of the network model by fitting COVID-19 USA data and compared the results with predictions using the SIR model. The network model works very well by just using constant parameters, while the SIR model requires more involved time dependent parameters to achieved similar fitting accuracy. We conclude that the present network model can become a valuable technique to complement the widely used SIR model.

CRediT authorship contribution statement

Fabrizio Croccolo: discussed the results and wrote the paper. H. Eduardo Roman: developed the model, performed the simulations, discussed the results and wrote the paper.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

1

It is widespread usage in epidemiology (see e.g. [16]) to refer to ‘Susceptible’, ‘Infected’ and ‘Recovered’, rather than using longer phrases such as ‘population of susceptible individuals’ or ‘the susceptible category’. Here, we add the ‘Dormant’ category, referring to individuals who temporally do not interact with others.

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.chaos.2020.110077.

Appendix. Supplementary materials

mmc1.zip (22.7MB, zip)
mmc2.zip (31.5MB, zip)

References

  • 1.Anderson R.M., May R.M. Oxford Univ. Press; 1992. Infectious Diseases of Humans: Dynamics and Control. [Google Scholar]
  • 2.Bailey N.T. Tech. Rep. 1957. The mathematical theory of epidemics. [Google Scholar]
  • 3.Ben-Avraham D., Havlin S. Cambridge University Press; 2000. Diffusion and reactions in fractals and disordered systems. [Google Scholar]
  • 4.Bollobás B. Springer Science & Business Media; 2013. Modern graph theory (vol. 184) [Google Scholar]
  • 5.Bunde A., Havlin S. Springer Science & Business Media; 2012. Fractals and disordered systems. [Google Scholar]
  • 6.Chen Y.-C., Lu P.-E., Chang C.-S., Liu T.-H. A Time-dependent SIR model for COVID-19 with Undetectable Infected Persons. 2020 doi: 10.1109/TNSE.2020.3024723. (preprint arXiv: 2003.00122) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Chinazzi M., Davis J.T., Ajelli M., Gioannini C., Gioannini C., Litvinova M., et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science. 2020;368:395–400. doi: 10.1126/science.aba9757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Dong E., Du H., Gardner L. An interactive web-based dashboard to track COVID-19 in real time. The Lancet Infectious Diseases (Elsevier) 2020 doi: 10.1016/S1473-3099(20)30120-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Erdős P. In: Spencer J., editor. MIT Press; Cambridge, MA: 1973. The Art of Counting. [Google Scholar]
  • 10.Erdős P., Rényi A. On the evolution of random graphs. Publ Math Inst Hung Acad Sci. 1960;5(1):17–60. [Google Scholar]
  • 11.Fernández-Villaverde J., Jones C.I. NBER Working Paper 27128. 2020. Estimating and Simulating a SIRD Model of COVID-19 for Many Countries, Sates and Cities. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Fraser C., Riley S., Anderson R.M., Ferguson N.M. Factors that make an infectious disease outbreak controllable. Proceedings of the National Academy of Sciences. 2004;101(16):6146–6151. doi: 10.1073/pnas.0307506101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kanter I., Sompolinsky H. Mean-field theory of spin-glasses with finite coordination number. Phys Rev Lett. 1987;58(2):164. doi: 10.1103/PhysRevLett.58.164. [DOI] [PubMed] [Google Scholar]
  • 14.Kermack W.O., McKendrick A.G. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London Series A, Containing Papers of a Mathematical and Physical Character. 1927;115(772):700–721. [Google Scholar]
  • 15.Palladino A., Nardelli V., Atzeni L.G., Cantatore N., Cataldo M., Croccolo F., et al. Modelling the spread of Covid19 in Italy using a revised version of the SIR model. 2020 preprint, arXiv: 2005.08724. [Google Scholar]
  • 16.Smith D, Moore L. The SIR Model for Spread of Disease. https://www.maaorg/press/periodicals/loci/joma/the-sir-model-for-spread-of-disease-the-differential-equation-model 2001.
  • 17.Stauffer D., Aharony A. CRC Press; 2018. Introduction to percolation theory. [Google Scholar]
  • 18.Wu J.T., Leung K., Bushman M., Kishore N., Niehus R., de Salazar P.M., et al. Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China. Nature Medicine. 2020 doi: 10.1038/s41591-020-0822-7. [DOI] [PMC free article] [PubMed] [Google Scholar]; 1–5 (Nature Publishing Group).
  • 19.Xu Z., Shi L., Wang Y., Zhang J., Huang L., Zhang C., et al. Pathological findings of COVID-19 associated with acute respiratory distress syndrome. The Lancet respiratory medicine (Elsevier) 2020 doi: 10.1016/S2213-2600(20)30076-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Yang Z., Zeng Z., Wang K., Wong S.-S., Liang W., Zanin M., et al. Modified seir and ai prediction of the epidemics trend of covid-19 in china under public health interventions. J Thorac Dis. 2020;12(3):165. doi: 10.21037/jtd.2020.02.64. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.zip (22.7MB, zip)
mmc2.zip (31.5MB, zip)

Articles from Chaos, Solitons, and Fractals are provided here courtesy of Elsevier

RESOURCES