The spread of infectious diseases from a physics perspective

J H V J Brabers

doi:10.1093/biomethods/bpad010

. 2023 Jun 20;8(1):bpad010. doi: 10.1093/biomethods/bpad010

The spread of infectious diseases from a physics perspective

J H V J Brabers ^✉

PMCID: PMC10469146 PMID: 37662617

Abstract

This article deals with the spread of infectious diseases from a physics perspective. It considers a population as a network of nodes representing the population members, linked by network edges representing the (social) contacts of the individual population members. Infections spread along these edges from one node (member) to another. This article presents a novel, modified version of the SIR compartmental model, able to account for typical network effects and percolation phenomena. The model is successfully tested against the results of simulations based on Monte-Carlo methods. Expressions for the (basic) reproduction numbers in terms of the model parameters are presented, and justify some mild criticisms on the widely spread interpretation of reproduction numbers as being the number of secondary infections due to a single active infection. Throughout the article, special emphasis is laid on understanding, and on the interpretation of phenomena in terms of concepts borrowed from condensed-matter and statistical physics, which reveals some interesting analogies. Percolation effects are of particular interest in this respect and they are the subject of a detailed investigation. The concept of herd immunity (its definition and nature) is intensively dealt with as well, also in the context of large-scale vaccination campaigns and waning immunity. This article elucidates how the onset of herd-immunity can be considered as a second-order phase transition in which percolation effects play a crucial role, thus corroborating, in a more pictorial/intuitive way, earlier viewpoints on this matter. An exact criterium for the most relevant form of herd-immunity to occur can be derived in terms of the model parameters. The analyses presented in this article provide insight in how various measures to prevent an epidemic spread of an infection work, how they can be optimized and what potentially deceptive issues have to be considered when such measures are either implemented or scaled down.

Keywords: mathematical epidemiology, Covid-19, SIR-Model, percolation, phase transitions, renormalization

0. Introduction

The Covid-19 pandemic has spurred an enormous scientific interest in the spread of infective diseases and in the evolution of epidemic outbreaks of such diseases. For a long time these subjects had appealed mainly to researchers active in the field of (mathematical) epidemiology and the scientific interest from other disciplines had been somewhat on the modest side. Possibly responsible for the latter is (in part) the enormous progress in medicine, especially during the second half of 20th century, which has provided mankind with effective treatments and prophylactics against a wide variety of severe infections. Large-scale epidemic or even pandemic spread of dangerous pathogens, without a cure being available, was considered a thing of the past by many (in 1969, e.g. Surgeon General William H. Stewart informed the U.S. Congress that it was time ‘to close the book on infectious diseases’ [1]). The general opinion was that chronic diseases would become more dominant and should therefore be the main focus of attention among the scientific community. This drove many decisions on research funding (in favour of research on chronic conditions and diseases). The Covid-19 outbreak has shattered this wide-spread illusion in a most dramatic way however.

By its nature, the subject of epidemic infection outbreaks is one of (at least) substantial complexity. This complexity makes it a highly non-trivial exercise to capture even the most elementary features of the phenomenon in simple mathematical/mechanistic models that can be dealt with by algebraic methods alone. Even one of the earlier, and still frequently used, models for the spread of infectious diseases, known as the SIR-model and proposed by Kermack and McKendrick [2], leads to a set of differential equations that does not allow for an algebraic solution except in the simplest of cases. However, it deals with some essential elements of an epidemic by parting the population in three categories of population members: susceptibles (S) that have not been infected but which are vulnerable to infection, active infections (I) spreading the infection via transmission to susceptibles and removed infections (R) representing those members that have been infected but who are no longer infectious (able to spread the infection). Note that the population is considered to be homogeneous in this context, in the sense that all susceptible population members are equally vulnerable to infection at the beginning of an epidemic. No diversity (heterogeneity) among the population members with respect to differences in susceptibility due to (e.g.) contact patterns, behaviour, age or underlying medical conditions is considered. As such, the entire population is at risk at the start of an outbreak. In its mathematical form introduced by Kermack and McKendrick, to be referred to as the standard SIR-model hereafter, the model is able to reproduce some remarkable features of epidemics like for instance the fact (often observed in real outbreaks) that no matter how easily transmitted the pathogen involved may be, a finite part of a population that falls victim to an epidemic will always remain uninfected (i.e. susceptible). Infection removal turns out to be responsible for this observation. Only when infections are not removed, and each active infection in a population will remain an active infection indefinitely, the entire population gets infected in the end. The standard SIR-model thus combines simplicity with considerable descriptive power, and therefore often serves as a first resort to epidemiologists in the early stages of a new outbreak, especially when the pathogen involved is novel and its properties and characteristics largely unknown (as was the case for Covid-19). However, also the fact that not only the standard SIR-model but also its extended versions require numerical techniques and, depending on the complexity of a particular model, quite the necessary computational power (CPU-time), is a reason to be considered for the fact that mathematical epidemiology is a field with quite some unexplored territories. Only during the last two decades or so, computational power previously only available from main-frame of super computers has become readily available to a wide community of researchers (and some problems in mathematical epidemiology simply do require that power).

Some characteristics of epidemic growth of infections are not dealt with by the standard SIR-model however. When we look closer at the concept of a population from an epidemiological perspective, it is clear that a population actually represents a network of population members (nodes in mathematical terms), with each member being either in contact or involved in some other kind of ‘interaction’ with other members (nodes) in the population. The standard SIR-model does not account for this network structure. It is obvious, however, that this network structure is likely to have an influence on the propagation of an infection through a population. In physical systems consisting of networks or showing network-like structures (such as crystal lattices, porous media, electric circuits, etc.), effects related specifically to the network characteristics of the system are not only common but, actually, the rule. There is no reason to assume that population networks will make an exception in this respect, especially not in the context of infection propagation. In fact, the spread of an infection through a population lattice can be perceived as a physical process analogous to the flow of a liquid through a porous medium or to the (macroscopic) polarization of spins on a lattice or network under the influence of the (microscopic) interactions between the individual spins (for which the Ising model, which we will encounter more than once throughout this article, represents the simplest case [3]). The standard SIR-model, however, treats the spread of infectious diseases in terms of an analogue of the so-called mean-field approaches used in physics to describe systems with collective interactions. The environment of the population members (nodes) of a specific type (S, I, or R) is considered the same for all members of the type and equal to some average over all the members of that type. Local fluctuations and lattice effects are averaged out. This yields a fair approximation in some cases, but generally leads to quantitative and, possibly, even qualitative differences from the exact behaviour of an infection in a given population and for given parameters [especially quantitative risk assessments on the basis of the standard SIR-model alone (for instance when the model is used as a first resort to predict the evolution of the infection rate in the initial phase of an epidemic) should therefore be considered with utmost care and reserve]. Nevertheless, and despite these shortcomings, the standard SIR-model serves as a reference or starting point of a range of variants and more extended models (like the SIS, SIRS, SEIR, or SEIS models [4]), or for even more complex models that account for additional heterogeneities among the population members. However, such models often inherit the above-mentioned shortcomings of the standard SIR-model. In the present article, we will therefore focus on the standard SIR-model as a basis for developing an approach that specifically accounts for the structure and topology of the population network, and therewith allows for the inclusion of the effects of local fluctuations and network correlations. We do this in recognition of the fact that the conceptual shortcomings of the standard SIR-model thus addressed and dealt with also apply to the wider range of more extended compartmental models that are based on mathematical formulations along the same conceptual lines of thought as the one underlying the standard SIR-model. It may be obvious therefore that the method outlined and illustrated in the forthcoming sections may also be of use in the case of these more extended models.

An important phenomenon directly related to physical systems with a network structure is percolation (for a good introduction to the subject, see Ref. [5]). Percolation becomes relevant when larger numbers of nodes in a network are removed (either randomly or according to some spatial distribution function) or become ‘inert’ in the sense that they can no longer pass-on an interaction of some kind. Examples are the removal of spins on an Ising lattice, the replacement of magnetic atoms or ions by nonmagnetic ones in real magnetic systems or the removal of joints in a network of resistors. The essence of percolation is the formation of a single macroscopic cluster of ‘active’ or non-removed nodes that spans throughout the entire population, having a size of the same order of magnitude as the size of the population. Such a cluster is formed when the number of removed or ‘inert’ nodes is sufficiently low. In cases where there are no nodes removed at all, the cluster becomes identical with the population. When a large enough number of nodes is (randomly) removed however, the macroscopic cluster starts breaking up into many smaller isolated clusters, until at some critical value of the removal rate the macroscopic cluster vanishes completely, leaving only so-called secondary clusters of ‘microscopic’ size. The latter is referred to as the percolation transition, which has all the characteristics of a real phase transition from the viewpoint of statistical physics, including universality, scaling laws and critical exponents. Percolation transitions have a huge impact on the properties of real systems. In magnetic systems for instance, they relate to a collapse of the magnetic order (magnetization), whereas in resistor networks they are accompanied by a notable increase in the equivalent resistance of the network. It may be obvious that, by their very nature, percolation phenomena may also play a role in the spread of infectious diseases. There is a conceptual similarity for instance between the (random) replacement of magnetic atoms in a solid by non-magnetic ones and (random) vaccination of susceptible members of a population network. Vaccination with a vaccine that provides full immunity against infection turns a susceptible member of a population into an ‘inert’ member that will not only remain uninfected but, inherently, will also not be able to pass on an infection to another member. The relevance of percolation to the problem of epidemic growth of infections is clear therewith, and has not entirely escaped the attention of both physicists and (mathematical) epidemiologists. As a result, a good deal of literature (a large part of it coming from the mathematical/theoretical physics community) has become available on the topic already (for references specifically dealing with the subject see Refs [6–9] for instance). Despite this, the role of percolation in the spread of an infection through a population network still seems to be somewhat of a ‘niche’ in mathematical epidemiology. For example, algebraic implementations of frequently used mechanistic models (such as the standard SIR-model, but also the more extended models derived from it) do not account for the role of percolation at all, neither explicitly nor implicitly. This may be explained by the fact that a significant part of the reported research efforts dealing with the issue is fairly recent. The reasons for the latter must be sought in the fact that it is notoriously difficult to capture the essence as well as the complexity of percolation phenomena in mathematical formulas: easy as they are to visualize, describing them in mathematical terms is an entirely different matter. Therefore, computer simulations are the most widely used tool for investigating percolation. Monte-Carlo techniques provide a very useful (and frequently deployed) method in this respect (see Ref. [10, pp. 87–94]), since these techniques can simulate the actual process of (random) node-removal and, where appropriate, its evolution with time. They also allow for a comprehensive evaluation of results. However, a disadvantage of Monte-Carlo methods is that they require (very) substantial computational efforts for achieving meaningful results and, correspondingly, a lot of CPU time compared with many other computational exercises. Hence, it was only after (relatively) fast and powerful computing facilities had become readily available on a thus far unprecedented scale during the last few decades, that the possibility of actually using Monte-Carlo techniques became easily accessible to a wider scientific audience. It is therefore not unsurprising that, until recently, the role of percolation in the epidemic spread of infections has attracted only a rather modest level of attention by the wider scientific community, despite the obvious significance of the topic.

Another issue with Monte-Carlo simulations is that the results they generate do not readily provide theoretical insight into the simulated phenomenon. They are experiments in their own right, carried out on a computer instead of in a laboratory, but nevertheless they often require further analysis in the same way data obtained in the real world need to be analyzed before they make sense.

The first aim of this article is to deal with the aforementioned shortcomings of the standard SIR-model and to incorporate the network structure of the population into a more general model. Such a model should thus account for lattice correlations as well as percolation effects. The partition of the population in susceptibles, active infections and removed infections is maintained. Only a fourth partition is introduced as an option, consisting of vaccinated population members who are (partially) immune to infection. However, the purpose of the model is insight, not numbers.¹ It consists of a mathematical framework that incorporates the influence of the network structure through expansion of key physical quantities as a (Taylor) series in the number of cumulative infections. These series expansions replace their simplified (linear) counterparts in the differential equations of the standard SIR-model. This approach leads to formulas that are exact, but only in a strictly formal sense, since the coefficients in the series expansions cannot be calculated ‘ab initio’ in the majority of the cases of interest. Therefore, the framework is less suitable for making accurate predictions on how an existing epidemic in the real world will develop (the results presented in this article make clear however that such predictions are generally highly problematic anyway). Nevertheless, valuable insights can be obtained on the basis of the model, as well as (qualitative) rules of thumb that can be of use during efforts to bring an actual epidemic under control. The model is extensively tested against data provided by Monte-Carlo simulations for both vaccinated and unvaccinated populations.

Another phenomenon addressed extensively in this article is herd-immunity, a concept introduced on the basis of the often made observation that in populations affected by an outbreak, a significant number of population members escape infection, and that after the first major wave of infections has gone through the population and has faded out (after having reached its peak) no subsequent waves will strike that same population again (at least not for quite a while). Among the first to develop a policy based on this kind of observations and to coin the phrase ‘herd-immunity’ was George Potter [11] who, confronted as a veterinarian by an outbreak in 1918 of the bacterial infection Brucella Abortus among cattle, advised farmers not to kill animals befallen by the infection but to leave them in the herd to let them recover, all for the purpose of bringing the outbreak under control therewith. However, herd-immunity has been a slippery subject ever since the early days of its conception. In fact, the concept of herd-immunity is not even well-defined [12]. Within the scientific community there are many definitions circulating. One of the most prominent ones defines herd-immunity as the stage in the evolution of an epidemic where the number of active infections has reached its peak, after which it is dropping down to eventually fade-out. However, the fact that the active infection rate drops does not imply an immediate return to normality. The transmission of the infection to susceptible population members will still go on for a while and new infections will continue to emerge. Another definition of herd-immunity, one that resonates more with intuition, is in a more literal sense and is obtained when we define herd-immunity as the stage in the evolution of an epidemic where the number of active infections has become almost negligible, leaving a population immune to new major (epidemic) outbreaks of the pathogen involved under all circumstances. It is obvious that these two definitions are conceptually related, and therefore probably also related to the same underlying mechanisms, thus illustrating the confusion that surrounds the concept of herd-immunity. As to the underlying mechanisms there is some confusion as well. In the standard SIR-model, herd-immunity (no matter which definition is used of the two definitions mentioned) is a direct result of infection removal. However, the achievement of herd-immunity is often illustrated/explained in a pictorial way on the basis of a (partially immunized) population network in which (fully) immunized members block the routes of the infection towards susceptible members of the population, thus ‘screening’ them from active infections. Obviously, the mechanisms leading to herd-immunity are not entirely clear as well or at least the subject of ambiguity. It does even occur that in the same paper results from the standard SIR-model are quoted (for instance to calculate the herd-immunity threshold), whereas an explanation of herd-immunity is given in terms of the aforementioned pictorial scheme (see, e.g. Ref. [13]). An additional aim of this article is therefore to bring some clarity in these issues. The inclusion, also presented in this article, of the network structure of populations and percolation effects into a generalized SIR-model seems to provide an ideal foundation for such an effort.

Another central aspect of mathematical epidemiology is reproduction numbers, among which the basic reproduction number R₀ has a special status. Reproduction numbers are usually defined as the total number of secondary infections that a single active infection generates from a given moment in time onwards. The basic reproduction number R₀ is a special case here, and represents the total number of secondary infections generated by a single active infection present at the start of an outbreak (t = 0) for the case where the population is fully susceptible at the start of the epidemic. In practice, estimates for reproduction numbers are calculated in many different ways. However, a closer inspection shows that very often such calculations are in fact approximations based on a rather crude translation of the actual concept of reproduction numbers, as it is defined, into an algebraic expression in terms of the parameters of an underlying epidemiological model (such as the SIR-model). For instance, the expression for R₀ on the basis of the standard SIR-model is very often written as $R_{0} = p_{i} / p_{r}$ , where p_i and p_r are the central parameters of the SIR-model, namely the transmission rate (p_i) and the rate of removal, or constant of removal (p_r). The reasoning thereby is that $〈 t 〉 = 1 / p_{r}$ represents the average lifetime of an active infection (p_r has dimension 1/time) and that with a transmission rate p_i (which also has dimension 1/time) the total number of infections generated during the lifetime of an active infection present at t = 0 is simply given by $R_{0} = p_{i} \cdot 〈 t 〉 = p_{i} / p_{r}$ . But what is entirely ignored here is that the number of susceptibles is not a constant during the lifetime of an active infection and may even vary substantially when p_r is small (i.e. $〈 t 〉$ is large) relative to p_i. Since the (temporal) values of reproduction numbers are often considered as indicators for herd-immunity (for instance, the herd-immunity threshold obtained on the basis of the standard SIR-model is often expressed as $1 - 1 / R_{0}$ ), a more detailed discussion about them seems inevitable in the context of this article. The aim is to clarify how exact, non-approximative, expressions for reproduction numbers can be given on the basis of the generalized SIR-model presented here, as well as to see whether (and how) the network structure of a population has an effect on reproduction numbers generally, and what the precise relation is between reproduction numbers and herd-immunity.

This article is primarily written with a readership consisting of physicists and (mathematical) epidemiologists in mind. In line with this objective, much emphasis is laid on analogies between epidemiological phenomena and phenomena in physics. Concepts and notions borrowed from statistical and condensed matter physics make their appearance regularly (though this may not be mentioned explicitly on every occasion). The issues dealt with are addressed therewith from a physics perspective, with the aim of providing a wider viewpoint on the spread of infectious diseases, a topic that not only gained an enormous (instant) relevance when the Covid-19 pandemic broke out, but which will remain of importance in times still to come.

The general outline of the article is as follows. Section 1 presents the generalized SIR-model that accounts for network effects and local fluctuations. Section 2 deals with reproduction numbers and their interpretation. Semi-algebraic solution methods for three relevant approximative examples of the basic differential equations of the generalized SIR-model (presented in Section 1) are discussed in Section 3. In Section 4, the generalized SIR-model is tested against to results of Monte-Carlo simulations of epidemic outbreaks in an SIR context. Section 5 deals with properties and consequences of the model, and with several important insights obtained from it that lay the basis for the more detailed discussion of herd-immunity presented in the next section. Section 6 is entirely devoted to herd-immunity. It addresses the general mechanisms responsible for herd-immunity, the definition of herd-immunity and the herd-immunity threshold. As a logical follow up of Section 6, Section 7 deals with vaccination and vaccine-acquired herd-immunity. Finally, Section 8 gives an in-depth account of the role of percolation effects in the spread of infectious diseases and establishes an interesting link between herd-immunity transitions and phase transitions as they are known from statistical physics. It is also shown how the generalized SIR-model presented in Section 1 is able to account for these percolation effects.

1. SIR-model with network correlations and local fluctuations

We represent the population in which the epidemic is spreading as a network (or lattice) of nodes (lattice points) connected by links (lattice bonds) to other nodes in the network (see Fig. 1). In doing so, we follow a trend that has been ongoing for quite some years already, and which stems from an increased recognition of the profound significance of the network structure of a population to the spread of an infection (see for instance, Refs [8, 14–18]). The nodes in the network represent the individuals belonging to the population, links connecting two nodes the social interaction between the individuals represented by the nodes. As such, the multiple of links connecting a single node to other nodes in the network can be seen as the social network of the individual represented by the central node. It is along these links that the infection is transmitted and the epidemic spreads.

Figure 1: — Schematic representation of a population network. There are three types of nodes: susceptibles (○), active infections ( $•$ ) and removed infections ( $\otimes$ ). The dashed square symbolizes the social network of the node in the centre (central grey dot).

Even the analysis of relatively simple networks and lattices is, in general, a complicated matter however. In most cases, many typical phenomena that may take place in a network, such as cluster formation and percolation transitions, defy an exact (algebraic) treatment, and their full analysis requires numerical methods or even rigorous computer simulations. The purpose of this section, however, is to capture some of the essential features of the spread of an epidemic in a phenomenological algebraic model that offers not only the possibility to obtain semi-quantitative results but, above all, more insight into the mechanisms and phenomena involved.

Following the standard SIR-model [2], we assume three ‘types’ of individuals or nodes: s, susceptible ones (uninfected but vulnerable to infection via social contacts); i, infectious ones (active, transmissible infections) and r, removed infections, that relate to individuals that have either recovered from an infection and acquired (indefinite) full immunity, or individuals that have succumbed to an infection (unlike in real life, there is no difference between these two possibilities from a strictly mathematical viewpoint). The assumption of indefinite (life-long) immunity is quite a strong one and it does in fact not apply in case of a large number of infectious diseases in real-life. Whether acquired by overcoming an infection or through vaccination, immunity tends to decrease with time in these cases (‘waning immunity’), sometimes to such an extent even, that after a while there is no effective immunity left (so that the population members involved thus rejoin the compartment of susceptibles). It is not unrealistic however to assume that an acquired immunity will last at least for the entire duration of an epidemic (or wave of infections) and also for a significant period of time after the fade-out of the epidemic/wave (the latter being of crucial importance to the phenomenon of herd-immunity as we will see later on). In fact there are many practical examples of this [for instance, in connection with the SARS-CoV-2 virus responsible for Covid-19, a recent study of data from 19 countries revealed that for a range of early strains of the virus, a high protection (immunity) against reinfection is obtained for at least a significant period of time after the removal of (i.e. recovery from) an infection [19]].

To avoid unnecessary complications, we assume that each individual keeps contact with the same number ν of other individuals in the population. That is, each node is connected via links to an equal number of other nodes in the network.

As soon as the first active infections occur in the population (either by having been ‘imported’ from outside the population or by any other feasible mechanism) and the epidemic starts spreading, the respective population fractions $s_{s}, s_{i}, s_{r}$ of, respectively, susceptible, infected and removed nodes/individuals start to evolve with time. However, it is easy to see that, by definition:

s_{s} + s_{i} + s_{r} = 1 .

(1.1)

The standard SIR-model is entirely centred around these quantities $s_{s}, s_{i}, s_{r}$ and is based on viewpoints similar to those underlying the mean-field descriptions of thermodynamic phase transitions. Local fluctuations in the environments (social networks) of the individual nodes are neglected (in fact even ignored). The environment of a node (i.e. the nodes linked to a particular node) is supposed to be homogeneous, with each node in the environment being of type s, i or r with a probability given by s_s, s_i and s_r, respectively.

With p_i representing the rate of transmission of infection (per active infection and per susceptible individual) and p_r the rate of infection removal, the following coupled set of (non-)linear differential equations describes the time evolution of the epidemic in the SIR-model:

\dot{s_{i}} = p_{i} s_{i} s_{s} - \dot{s_{r}},

(1.2)

\dot{s_{s}} = - p_{i} s_{i} s_{s},

(1.3)

\dot{s_{r}} = p_{r} s_{i},

(1.4)

where the dotted symbols represent the time derivatives.² We now introduce the cumulative number of infections at a given time:

s = s_{i} + s_{r},

(1.5)

in terms of which s_s can be expressed [via (1.1)] as $s_{s} = 1 - s$ , so that by also using Equation (1.4) we can rewrite Equations (1.2) and (1.3) as:

\dot{s_{i}} = p_{i} s_{i} (1 - s) - p_{r} s_{i},

(1.6)

\dot{s} = p_{i} s_{i} (1 - s) .

(1.7)

The rationale behind the term $p_{i} s_{i} s_{s} = p_{i} s_{i} (1 - s)$ is that a fraction s_i of the population is infected and that each individual contacted by an infected person is susceptible to transmission with equal probability s_s (the fraction of susceptible individuals among the total population). It is mainly in this particular Ansatz that the analogy between mean-field methods in the theory of thermodynamic phase transitions and the SIR-model is rooted (for a brief outline of mean-field methods, see Ref. [20] for instance). However, it is well known that such an approach does not come without significant shortcomings, even in a qualitative sense. Not only the local fluctuations are ignored, but also the correlations that exist between probabilities of finding individual nodes linked to nodes of a certain type. Such correlations nearly always arise and depend on the particular geometry and topology of the lattice or network [the number of surrounding nodes to which each particular node is linked (ν) plays a crucial role in this respect for instance]. This is particularly important in the context of percolation phenomena, which can be seen as a useful paradigm for understanding group- or herd-immunity (as we will see later on).

To obtain an approach that, at least in a formal sense, takes account of local fluctuations and the above-mentioned correlations, we focus on the different type of links (or pairs of nodes) that can be identified in the network. We have links connecting an active infection (i) with a susceptible (s) node, links connecting an active infection with another active infection, links connecting a susceptible node with a removed infection (r) and so on. When the epidemic spreads, the total numbers of these links or pairs in the network ( $n_{x y}, x, y = s, i, r$ ) evolve with time until a stationary (equilibrium) state is reached (marking the end of the epidemic). From simple considerations (adopted from solid-state physics where they are applied to crystal lattices), the following relations between the numbers of pairs of each type and the number ν of social contacts of an individual can be obtained:

2 n_{i i} + n_{i s} + n_{i r} = n ν s_{i},

(1.8)

2 n_{s s} + n_{i s} + n_{s r} = n ν s_{s},

(1.9)

2 n_{r r} + n_{i r} + n_{s r} = n ν s_{r},

(1.10)

where n represents the total number of nodes in the population/network. The idea here is that each node is equally attributed to (divided among) the ν links connecting it to the other nodes in the network. As such, a link connecting a node of type x ( $x = s, i, r$ ) to a node of type y ( $y = s, i, r$ ) accounts for $1 / ν$ of an x-type node and $1 / ν$ of an y-type node [and when x = y for $2 / ν$ of an x-type node). Summing over all the links (which is equivalent to summing over all the pairs of nodes connected via a (single) link] should yield the total number of s-, i- and r-nodes in the network.

We number the nodes of each particular type $x = s, i, r$ from 1 to ns_x. Let $ν_{y x} (l_{x})$ be the number of nodes of type $y = s, i, r$ linked to the l_xth node of type x. We introduce:

< s_{y i} > = \frac{1}{n_{i}} \sum_{l_{i}} ν_{y i} (l_{i}) = \frac{1}{n s_{i}} \sum_{l_{i}} ν_{y i} (l_{i}) .

(1.11)

< s_{y s} > = \frac{1}{n_{s}} \sum_{l_{s}} ν_{y s} (l_{s}) = \frac{1}{n s_{s}} \sum_{l_{s}} ν_{y s} (l_{s}) .

(1.12)

< s_{y r} > = \frac{1}{n_{r}} \sum_{l_{r}} ν_{y r} (l_{r}) = \frac{1}{n s_{r}} \sum_{l_{r}} ν_{y r} (l_{r}) .

(1.13)

The $< s_{y x} >$ represent the average number of y-type nodes linked to an x-type node (where the average is taken over all the nodes of x-type in the network). It is easy to verify that when $x \neq y$ , the numbers of xy-links or pairs in the network are directly related to the averages $< s_{y x} >$ via:

n_{x y} = n s_{x} < s_{y x} > = n s_{y} < s_{x y} >

(1.14)

and when y = x via:

n_{x x} = \frac{1}{2} n s_{x} < s_{x x} >,

(1.15)

where the division by 2 corrects for double-counting x-nodes. By using these identities and dividing out the ns_x, Equations (1.8), (1.9), and (1.10) can be rewritten for those cases where $s_{i}, s_{s}, s_{r} \neq 0$ as:

< s_{i i} > + < s_{s i} > + < s_{r i} > = ν,

(1.16)

< s_{s s} > + < s_{i s} > + < s_{r s} > = ν,

(1.17)

< s_{r r} > + < s_{i r} > + < s_{s r} > = ν,

(1.18)

a result not too surprising in itself.

Transmission of infection may take place only upon contact between individuals with an active infection (i) and a susceptible person (s), i.e. between i-type and s-type nodes in the network (forming an i–s pair). The rate of transmission is therefore proportional to the number of i–s pairs n_is and can be expressed as:

\dot{s} = \frac{p'_{i} n_{i s}}{n},

(1.19)

which may serve as a replacement for Equation (1.7), whereas the rate of change of the active infections can be written as:

\dot{s_{i}} = \frac{p'_{i} n_{i s}}{n} - p_{r} s_{i}

(1.20)

to replace Equation (1.6). The parameter $p'_{i}$ is the rate of transmission per i–s pair. Using (1.11) and (1.14), we can rewrite (1.19) as:

\dot{s} = ({p^{'}}_{i} / n) \sum_{l_{i}} ν_{s i} (l_{i}) = {p^{'}}_{i} s_{i} < s_{s i} >

(1.21)

and (1.20) as:

\dot{s_{i}} = p'_{i} s_{i} < s_{s i} > - p_{r} s_{i} .

(1.22)

It is worth noticing that (1.21) and (1.22) are in fact exact results for infinitely large populations and finite s_i (and as such expected to apply also very well to finite yet sufficiently large populations). They constitute an exact generalization of the standard SIR-model that accounts, at least in principle, for fluctuations and for correlations arising from to the typical network structure of the population.

To demonstrate how the master equations (1.6) and (1.7) of the standard SIR-model relate to their generalization in the form of (1.21) and (1.22) it is useful to focus more closely on the parameters p_i and $p'_{i}$ and to express them in terms of other relevant parameters. For that purpose we introduce the frequency f_cn, which stands for the number of contacts made per node (or individual) per unit of time (i.e. $f_{c n} = 1 / τ_{c n}$ , with τ_cn being the time between 2 successive contacts made by a single node), as well as the transmission probability w_i, which is the probability that the infection is passed on from one person to another upon contact. It is easy to see from its definition implied by Equations (1.6) and (1.7) that:

p_{i} = w_{i} f_{c n}

(1.23)

so that the normalized rate of new infections in the standard SIR-model [Equation (1.7)] can be reexpressed as:

\dot{s} = w_{i} f_{c n} s_{i} (1 - s) .

(1.24)

The total number of pairs (i.e. links) in the network is $n_{p} = ν n / 2$ (each node is connected via ν links, each link is shared by two nodes). Now, let f_cl be the number of contacts per unit time made via a single link. The total number of contacts per unit time made throughout the entire population can now be written straightforwardly as:

f_{c p} = \frac{ν n f_{c l}}{2}

(1.25)

and, since it is easy to see that $f_{c n} = ν f_{c l}$ , alternatively as:

f_{c p} = \frac{n f_{c n}}{2} .

(1.26)

Of all the $n_{p} = ν n / 2$ pairs (i.e. links) in the network, a fraction:

α_{s i} = \frac{n_{s i}}{n_{p}} = \frac{2 n_{s i}}{ν n}

(1.27)

consists of s – i pairs. For the number of s – i contacts per unit time we get:

f_{s i} = α_{s i} f_{c p} = \frac{n_{s i} f_{c n}}{ν} .

(1.28)

Using this result, the normalized rate of new infections in our generalization of the SIR-model is now obtained as:

\dot{s} = \frac{w_{i} f_{s i}}{n} = \frac{w_{i} f_{c n} n_{s i}}{ν n},

(1.29)

which, since $n_{s i} / n = s_{i} 〈 s_{s i} 〉$ [see Equation (1.11)], can be rewritten as:

\dot{s} = \frac{w_{i} f_{c n} s_{i} 〈 s_{s i} 〉}{ν} .

(1.30)

Comparison of (1.21) and (1.30) shows that we can identify $p'_{i}$ as $p'_{i} = w_{i} f_{c n} / ν = p_{i} / ν$ (which is in fact quite a logical result that also follows from the definition of w_i and p_i). In addition, we have from (1.16):

〈 s_{s i} 〉 = ν - (〈 s_{i i} 〉 + 〈 s_{r i} 〉),

(1.31)

so that (1.30) can be reworked into:

\dot{s} = p_{i} s_{i} \frac{〈 s_{s i} 〉}{ν} = p_{i} s_{i} (1 - \frac{〈 s_{i i} 〉 + 〈 s_{r i} 〉}{ν}) .

(1.32)

An alternative (but equivalent) expression for $\dot{s}$ can be obtained by deploying the symmetry relation [see (1.14)]:

n_{s i} = n_{i} 〈 s_{s i} 〉 = n_{s} 〈 s_{i s} 〉

(1.33)

by which we can also write $n_{s i} / n$ as:

n_{s i} / n = s_{s} 〈 s_{i s} 〉 = (1 - s) 〈 s_{i s} 〉 .

(1.34)

Substitution into (1.29) then yields (with $w_{i} f_{c n} / ν = p_{i} / ν$ ):

\dot{s} = \frac{w_{i} f_{c n} (1 - s) 〈 s_{i s} 〉}{ν} = \frac{p_{i} (1 - s) 〈 s_{i s} 〉}{ν} .

(1.35)

And with $〈 s_{i s} 〉$ written as [see Equation (1.17)]:

〈 s_{i s} 〉 = ν - (〈 s_{s s} 〉 + 〈 s_{r s} 〉)

(1.36)

we thus obtain:

\dot{s} = p_{i} (1 - s) (1 - \frac{〈 s_{s s} 〉 + 〈 s_{r s} 〉}{ν}) .

(1.37)

The differential equation relating s_r to s_i remains unchanged in the presence of correlations and is, as before, represented by (1.4). Comparison of Equations (1.32) and (1.7) thus shows that extending the SIR-model by including correlations arising from the typical network structure of the population implies (at least in a mathematical sense) in fact nothing more than replacing the factor $(1 - s)$ in (1.7) by $(1 - \frac{(〈 s_{i i} 〉 + 〈 s_{r i} 〉)}{ν})$ or the factor s_i by $(1 - \frac{(〈 s_{s s} 〉 + 〈 s_{r s} 〉)}{ν})$ . With $\dot{s}$ written as in either (1.32) or (1.37), we thus obtain two equivalent equations to replace (1.6), respectively, given by:

\dot{s_{i}} = p_{i} s_{i} (1 - \frac{〈 s_{i i} 〉 + 〈 s_{r i} 〉}{ν}) - p_{r} s_{i} .

(1.38)

\dot{s_{i}} = p_{i} (1 - s) (1 - \frac{〈 s_{s s} 〉 + 〈 s_{r s} 〉}{ν}) - p_{r} s_{i} .

(1.39)

By arbitrarily combining one of Equations (1.32) and (1.37) with one of Equations (1.38) and (1.39), we obtain a system of two ordinary differential equations by which (in principle) the variation of s and s_i with time is defined [and indirectly, via (1.1) and (1.5), also the variation of s_s and s_r]. However, an actual solution of such a system requires the explicit algebraic form of either $〈 s_{i i} 〉$ and $〈 s_{r i} 〉$ or $〈 s_{s s} 〉$ and $〈 s_{r s} 〉$ to be known. It is for that purpose that we seek an appropriate parameter in terms of which not only the s_i, s_s and s_r but also the averages $〈 s_{x y} 〉$ ( $x, y = i, s, r$ ) can be expressed. That is, we look for an independent quantity in terms of which the entire problem can be parametrized. Strictly speaking, time (t) meets that requirement, but is also an inappropriate/impossible choice since we actually want to solve s_i, s_s and s_r for t. A proper choice however is s. Accounting for the cumulative number of infections, $s = s (t)$ can only increase with time. As a result, s(t) is a bijective function of t (i.e. each t corresponds to a unique value of s). This implies that a parametrization of individual quantities in terms of only s is possible and, moreover, entirely equivalent to a parametrization of those quantities in terms of t (as such, s plays a role similar to that of the state variables in thermodynamic systems).

However, the task of finding a representation of the $〈 s_{x y} 〉$ in terms of s is a tough problem bedevilled with difficulties that also arise in the analysis of Ising problems (or lattice problems in general). Ironically, the root cause of these difficulties is actually the same thing that we seek to incorporate into our present analysis, namely the lattice or network correlations. They very often prevent an easy systematic enumeration of the states of the system that correspond to a specific value of a relevant state variable (as a consequence, the exact solution to the 3D Ising problem is still an open issue for instance).

A pragmatic way out of these difficulties is to consider the exact $〈 s_{x y} 〉$ as series expansions in s:

〈 s_{x y} 〉 = \sum_{k = 0}^{\infty} a_{k}^{x y} s^{k}

(1.40)

to be truncated at will in practice, thus providing us with approximations for the $〈 s_{x y} 〉$ in the form of finite-order polynomials in s. This approach offers an alternative for other methods of describing the influence of the network that have gained popularity over time. One of those methods is based on the assumption that the network of interest is scale free [21, 22], so that the degree-distribution function P(l) of the network asymptotically tends towards a non-integer power law [i.e. $P (l) \sim l^{- γ}$ with $γ \notin N$ and l (the degree of the nodes³) being sufficiently large]. Although of substantial merit in many fields [including epidemiology (for tackling the problem of strong susceptibility heterogeneity [23])], the scale-free hypothesis is nevertheless not without issues to be considered however. The concept of scale-free networks is in fact not only still the subject of intensive research, but also of vivid debate. For instance, the question is still open at present, whether the scale-free property is as universal a feature of (larger classes) of networks as sometimes suggested, and therewith whether scale-free networks are indeed as omnipresent as often believed. More recent research strongly suggests that scale-free properties are not a universal feature of networks in general [24, 25], and that, without a detailed investigation of the network itself, it is not a priori clear whether a particular network is indeed scale-free or not. Moreover, the power law of the degree-distribution function of a scale-free network only applies to sufficiently large values of l. The latter prohibits its use in case of population networks under (very) strict social distancing (or lockdown) measures, when the number of contacts between members has been reduced substantially (sometimes to even a handful only), unless a reasonable assumption about the behaviour of P(l) for low values of l can be made for a specific population network under specific (epidemiological) circumstances. Methods as the one outlined in this article, based on series expansions of the form (1.40), may provide a suitable alternative in this respect, at least in some epidemiological contexts (such as a homogeneous population or populations of limited heterogeneity), since it is (in principle) exact and applies to all population networks, irrespective of their structure and the numbers of connections between the individual nodes in the network.

We now assume a scenario where the initial number of active infections at the onset of an epidemic is finite but almost negligible at the scale of the total population, so that $s_{i} \approx 0$ at t = 0. We furthermore assume that no infections were present before the start of the epidemic, so that there are no recovered and immune individuals at t = 0 and therefore $s_{r} (0) = 0$ (exactly). When considering the number of contacts of a single node to be the same for all nodes and equal to ν (as we did before), we can easily identify the expansion parameters $a_{0}^{s i}$ and $a_{0}^{i s}$ as $a_{0}^{s i} = ν$ and $a_{0}^{i s} = 0$ in this case, so that (see, respectively, (1.31) and (1.36)):

ν - (〈 s_{i i} 〉 + 〈 s_{r i} 〉) = 〈 s_{s i} 〉 = ν + \sum_{k = 1}^{\infty} a_{k}^{s i} s^{k} .

(1.41a)

ν - (〈 s_{s s} 〉 + 〈 s_{r s} 〉) = 〈 s_{i s} 〉 = \sum_{k = 1}^{\infty} a_{k}^{s i} s^{k} .

(1.41b)

Substituting (1.41a) for the terms in brackets, respectively, in (1.38) and (1.32), we get:

\dot{s_{i}} = p_{i} s_{i} [1 + \sum_{k = 1}^{\infty} \frac{a_{k}^{s i} s^{k}}{ν}] - p_{r} s_{i}

(1.42a)

and

\dot{s} = p_{i} s_{i} [1 + \sum_{k = 1}^{\infty} \frac{a_{k}^{s i} s^{k}}{ν}],

(1.42b)

whereas substitution of (1.41b) for the terms in brackets in, respectively, (1.37) and (1.39) yields:

\dot{s_{i}} = p_{i} (1 - s) \sum_{k = 1}^{\infty} \frac{a_{k}^{i s} s^{k}}{ν} - p_{r} s_{i}

(1.43a)

\dot{s} = p_{i} (1 - s) \sum_{k = 1}^{\infty} \frac{a_{k}^{i s} s^{k}}{ν} .

(1.43b)

As to the series expansions in (1.42a,b), it should be emphasized that only cases where $a_{1}^{s i} < 0$ have relevance and ‘physical’ meaning, since generally $〈 s_{s i} 〉$ must decrease with increasing s. This to hold, also when $s ≪ 1$ , specifically requires that $a_{1}^{s i} < 0$ .

Equations (1.42a and b) provide a most insightful example of how correlations and fluctuations enter the mathematics of the problem and into the basic equations of the SIR-model. Comparison with (1.6) and (1.7) shows that the inclusion of correlations and fluctuations into the SIR-model comes down to nothing more than a replacement of the term $(1 - s)$ in (1.6) and (1.7) by a series expansion of the form $(1 + a_{1}^{s i} s / ν + a_{2}^{s i} s^{2} / ν + a_{3}^{s i} s^{3} / ν + \dots)$ . Since the expansion coefficients $a_{k}^{s i}$ are generally expected to be functions of ν, the characteristics of the population network (and more in particular the number ν of social contacts of an individual) thereby enter the problem via the quotients $a_{k}^{s i} / ν$ . Only if the coefficients $a_{k}^{s i}$ were proportional to ν, the influence of ν would cancel out against the ν-dependence of the $a_{k}^{s i}$ . This is generally not the case however, and it is therefore already, that the structures of social networks within a population can be considered as a key factor in the evolution of an epidemic. This is in itself not an entirely new or unexpected insight (in fact it provides the epistemic basis for all practical measures limiting social contacts in order to bring outbreaks of infectious diseases under control). The particular merit of Equations (1.42a,b) however is that they put this already known viewpoint in simple mathematical terms that allow for a better understanding of the phenomena involved. It may be obvious that similar considerations also apply to the expansion coefficients $a_{k}^{i s}$ and the quotients $a_{k}^{i s} / ν$ , and therefore also to Equations (1.43a,b).

2. Reproduction numbers

The main results from the model outlined in this article primarily evolve around the parameters p_i and p_r, with an additional role for the parameters accounting for the network structure of the population. However, the dynamics of the spread of infectious diseases is often described and analysed in terms of different key parameters: the (well-known) reproduction numbers, where a distinction is made between the basic reproduction number R₀ and the effective reproduction number R [26].

Reproduction numbers are defined, within a given setting/context, as the total number of active infections generated (via transmission) by a single active infection in its entire active period. Their concept was introduced in the early 20th century by Sir Ronald Ross in connection with his research on Malaria [27].

The basic reproduction number R₀ applies to the case where the entire (i.e. full) population is susceptible at the start of the epidemic (t = 0) and represents the total number of new active infections generated by a single initial infection in its entire active period (which starts at t = 0). As such, R₀ is a measure for the full epidemic potential of a transmissible pathogen.

Complementary to R₀, the effective reproduction number R applies to situations where, for whatever reason, the population is not fully susceptible (heterogeneous population) at the beginning of an active infections lifespan. One may think in this respect, for instance, of parts of the initial population having some form of immunity (genetic or acquired), or of the progressive depletion of the reservoir of susceptibles that results from the spread of infections during an ongoing epidemic. In the latter case, the effective reproduction number R is given by the total number of new active infections to be generated by a single active infection that comes into existence at a time t > 0 after the start of the epidemic (t = 0), when the reservoir of susceptibles has already been reduced in size by the earlier spread of infections that has taken place since the beginning of the epidemic.

We restrict our discussion to populations which are fully susceptible at t = 0 (except for the initial infections). In such cases, the effective reproduction number R relates, as described in the above, to the depletion of the reservoir of susceptibles (caused by the spread of the infection) as the one and only source of population heterogeneity. As such, the effective reproduction number is time-dependent in these cases: $R = R (t)$ . On the basis of the present model, expressions for R₀ and R can be given in direct accordance with their definitions.

Consider an ensemble of $n_{i} = n_{0}$ active infections at time $t = τ$ . Due to infection removal, the ensemble decays over time according to $\dot{n_{i}} = - p_{r} n_{i}$ so that for $t \geq τ$ :

n_{i} (t) = n_{0} e^{- p_{r} (t - τ)},

(2.1)

where $n_{i} (t)$ represents the remaining number of active infections at an instant t.

The (average) total number of new infections generated after $t = τ$ by an infection active at $t = τ \geq 0$ is thus given by:

R (τ) = \frac{1}{n_{0}} \int_{τ}^{\infty} p_{i} n_{i} (t) \frac{〈 s_{s i} (t) 〉}{ν} d t = p_{i} \int_{τ}^{\infty} e^{- p_{r} (t - τ)} \frac{〈 s_{s i} (t) 〉}{ν} d t

which can be reexpressed as:

R (τ) = - \frac{p_{i}}{p_{r}} \int_{τ}^{\infty} \frac{〈 s_{s i} (t) 〉}{ν} d e^{- p_{r} (t - τ)} .

(2.2)

The difference between the basic and the effective reproduction number mathematically boils down therewith to the value of the lower bound τ of the intervals of integration in (2.1) and (2.2) (i.e. in the context of this discussion). For τ = 0, $R (τ)$ represents the basic reproduction number: $R_{0} = R (0)$ . For $τ > 0, R (τ)$ represents an effective reproduction number.

Introducing:

Q = Q (τ, p_{i}, p_{r}, ν) \equiv - \int_{τ}^{\infty} \frac{〈 s_{s i} (t) 〉}{ν} d e^{- p_{r} (t - τ)},

(2.3)

we can write the identity for $R_{τ}$ given by (2.2) as:

R_{τ} = Q \cdot \frac{p_{i}}{p_{r}} .

(2.4)

The factor Q accounts for the depletion of the reservoir of susceptibles that results from the spread of the infection (i.e. the reduction of s_s and therewith of $〈 s_{s i} 〉$ with increasing s(t)). The basic reproduction number R₀ relates therewith to the special value of $Q = Q (0, p_{i}, p_{r}, ν) \equiv Q_{0}$ for τ = 0, in which case the transmission of the infection starts in a fully susceptible population:

R_{0} = Q_{0} \cdot \frac{p_{i}}{p_{r}} .

(2.5)

In the literature, reproduction numbers are often linked to criteria for epidemic spreading of an infection. For instance, it is often stated that an (exponential) increase in the number of infections will emerge when R > 1 ( $R_{0} > 1$ ), whereas the infection rates will decline and fade-out when R < 1 ( $R_{0} < 1$ ). To illustrate such criteria, a pictorial impression of epidemic evolution is often presented, in which active infections pass on their infection to a total of R susceptibles which, once infected themselves, pass on their infection to another R susceptibles, etc. The result is a ‘tree’ of infections, in which each infection forms a node (branch splitting) from which its infection is passed on along a total of R outgoing branches, so that after N generations (branch splittings) a total of R^N infections can be (indirectly) assigned to a single infection. Although quite illustrative and of considerable educational value, this picture is not entirely correct however. For instance, it ignores the depletion, accounted for by Q, of the reservoir of susceptibles upon progression of the epidemic. This depletion continues to progress for $t > τ$ as well. As such, each generation of new infections will find less susceptibles available to pass the infection on to than the previous generation. Therefore, the frequently drawn image [for the purpose of explaining exponential growth of the number of (cumulative) infections with time] of an infection tree with an equal R-related number of branches ‘growing’ out of each infected node does not give a fully correct representation of what actually happens. Even in the very early stages of the spread of an infection (especially when the social networks of the individual population members are small), an accurate quantitative analysis generally requires the depletion of the reservoir of susceptibles to be taken into account (as will be clearly illustrated later on in Section 8.5), despite the fact that from a qualitative (or semi-quantitative) point of view, early behaviour can actually be described fairly well by stochastic branching processes (see for instance Refs [28, 29]). In Section 5, we will see that, although often taken for granted, also the condition R > 1 ( $R_{0} > 1$ ) for epidemic growth of the number of infections requires serious (re)consideration.

3. Solutions of the differential equations for approximative cases

In their closed (exact) form, the sets of coupled differential equations (1.42a,b) and (1.43a,b) generally require a full numerical solution method. However, by truncating the series expansions down to terms of sufficiently low order, approximate sets of equations can nevertheless be obtained that, on one hand, capture the essentials of the influence of the network structure of the population in both a qualitative and quantitative sense, while on the other allow for a combined algebraic/numerical solution method, such that one of the equations in a set can be solved exactly via algebraic methods, and only the equation in a set thus remaining requires a full numerical approach.

3.1 Second-order polynomial approximation of $〈 s_{s i} 〉$

Truncation of the series expansion of $〈 s_{s i} 〉$ down to the terms of order $\leq$ 2 provides the simplest approximative expression for $〈 s_{i} (s) 〉$ that nevertheless presents an extension of the standard SIR-model and thus provides a very useful ‘toy-model’ for studying some of the fundamentals of the spread of infectious diseases through a population network.

Upon truncation of terms of order >2 Equations (1.42a,b) reduce to:

\dot{s_{i}} \approx p_{i} s_{i} (1 + \frac{a_{1}^{s i}}{ν} s + \frac{a_{2}^{s i}}{ν} s^{2}) - p_{r} s_{i} .

(3.1a)

\dot{s} \approx p_{i} s_{i} (1 + \frac{a_{1}^{s i}}{ν} s + \frac{a_{2}^{s i}}{ν} s^{2}) .

(3.1b)

With $\dot{s_{i}} = \dot{s} (\partial s_{i} / \partial s)$ , we can rewrite (3.1a) as:

\dot{s} (\frac{\partial s_{i}}{\partial s}) = \dot{s} - p_{r} s_{i}

which, upon writing (3.1b) as $\dot{s} = s_{i} P_{i} (s)$ , can be reworked into:

(\frac{\partial s_{i}}{\partial s}) = 1 - \frac{p_{r}}{P_{i} (s)}

(3.2)

with $P_{i} (s)$ representing the right-hand part of (3.1b): $P_{i} (s) = p_{i} (ν + a_{1}^{s i} s + a_{2}^{s i} s^{2}) / ν$ . We thus obtain:

(\frac{\partial s_{i}}{\partial s}) = 1 - \frac{ν p_{r} / p_{i}}{ν + a_{1}^{s i} s + a_{2}^{s i} s^{2}}

(3.3a)

which we rewrite as:

(\frac{\partial s_{i}}{\partial s}) = 1 - \frac{ν p_{r} / a_{2}^{s i} p_{i}}{(s - s_{+}) (s - s_{-})}

(3.3b)

with $s_{+}$ and $s_{-}$ representing the roots (real and complex) of the equation $a_{2}^{s i} s^{2} + a_{1}^{s i} s + ν = 0$ , as given by:

s_{\pm} = \frac{a_{1}^{s i}}{2 a_{2}^{s i}} (- 1 \pm \sqrt{1 - 4 ν a_{2}^{s i} / {(a_{1}^{s i})}^{2}})

(3.4)

which have the rather convenient property that $s_{+} s_{-}$ is given by the very simple expression $s_{+} s_{-} = ν / a_{2}^{s i}$ , which will be of use later.

Equation (3.3b) can be rewritten as:

(\frac{\partial s_{i}}{\partial s}) = 1 - (\frac{ν p_{r} / a_{2}^{s i} p_{i}}{s_{+} - s_{-}}) [\frac{1}{(s - s_{+})} - \frac{1}{(s - s_{-})}]

(3.5)

and its integration is straightforward:

s_{i} = \int 1 - (\frac{ν p_{r} / a_{2}^{s i} p_{i}}{s_{+} - s_{-}}) [\frac{1}{(s - s_{+})} - \frac{1}{(s - s_{-})}] d s

= s - (\frac{ν p_{r} / a_{2}^{s i} p_{i}}{s_{+} - s_{-}}) log (\frac{s - s_{+}}{s - s_{-}}) + C,

(3.6)

where the complex logarithm function is implicated and C the constant of integration. The latter follows from the (initial) condition that $s_{i} = s = s_{0}$ when t = 0. We have:

C = (\frac{ν p_{r} / a_{2}^{s i} p_{i}}{s_{+} - s_{-}}) log (\frac{s_{0} - s_{+}}{s_{0} - s_{-}})

(3.7)

so that (3.6) can be rewritten as:

s_{i} = s - (\frac{ν p_{r} / a_{2}^{s i} p_{i}}{s_{+} - s_{-}}) log (\frac{(s_{0} - s_{-}) (s - s_{+})}{(s_{0} - s_{+}) (s - s_{-})}) .

(3.8)

Using this result, the ODE for s in this case can then be integrated, which requires a numerical procedure, thus concluding the solution of the set of coupled ODEs (1.42a,b) in the present approximation.

3.2 Third-order polynomial approximation of $〈 s_{s i} 〉$

Upon truncation of terms of order >3 in the series expansions in (1.42a,b) we have:

\dot{s_{i}} \approx p_{i} s_{i} (1 + \frac{a_{1}^{s i}}{ν} s + \frac{a_{2}^{s i}}{ν} s^{2} + \frac{a_{3}^{s i}}{ν} s^{3}) - p_{r} s_{i} .

(3.9a)

\dot{s} \approx p_{i} s_{i} (1 + \frac{a_{1}^{s i}}{ν} s + \frac{a_{2}^{s i}}{ν} s^{2} + \frac{a_{3}^{s i}}{ν} s^{3}) .

(3.9b)

The most important reason for evaluating this case is that 3 is the largest polynomial order that offers the possibility of a relatively simple analytical solution of (1.42a). An additional benefit is that the number of model parameters is nevertheless kept within limits, while still allowing for a fair to very good description of the exact s-dependence of $〈 s_{s i} 〉$ and $〈 s_{s i} 〉$ . The solution of (3.9a) can be obtained in a closed algebraic form as a function of s (see Appendix 1a):

s_{i} = s - \frac{ν p_{r}}{a_{3}^{s i} p_{i}} \cdot log (\frac{{(s_{0} - s_{1})}^{a} {(s_{0} - s_{2})}^{b} {(s_{0} - s_{3})}^{c}}{{(s - s_{1})}^{a} {(s - s_{2})}^{b} {(s - s_{3})}^{c}}),

(3.10a)

where $s_{1, 2, 3}$ represent the roots (real and complex) of the third-order polynomial $a_{3}^{s i} s^{3} + a_{2}^{s i} s^{2} + a_{1}^{s i} s + ν = 0$ and the complex logarithm is applied (as in the previous section). The exponents a, b and c [defined by (A1.10) in Appendix 1a] are represented by the identities (A1.11) given in Appendix 1a. Via substitution of this result into (3.9b), the following non-linear ordinary differential equation for s as a function of t is then obtained:

\dot{s} \approx p_{i} (1 + \frac{a_{1}^{s i}}{ν} s + \frac{a_{2}^{s i}}{ν} s^{2} + \frac{a_{3}^{s i}}{ν} s^{3}) [s - \frac{ν p_{r}}{a_{3}^{s i} p_{i}} \cdot log (\frac{{(s_{0} - s_{1})}^{a} {(s_{0} - s_{2})}^{b} {(s_{0} - s_{3})}^{c}}{{(s - s_{1})}^{a} {(s - s_{2})}^{b} {(s - s_{3})}^{c}})],

(3.10b)

which can only be solved numerically.

3.3 Third-order polynomial approximation of $〈 s_{i s} 〉$

In addition, the following set of ODE’s is obtained by truncating terms of order >3 in the series expansions for $〈 s_{i s} 〉$ in (1.43a,b):

\dot{s_{i}} = p_{i} (1 - s) \cdot \frac{a_{1}^{i s} s + a_{2}^{i s} s^{2} + a_{3}^{i s} s^{3}}{ν} - p_{r} s_{i} .

(3.11a)

\dot{s} = p_{i} (1 - s) \cdot \frac{(a_{1}^{i s} s + a_{2}^{i s} s^{2} + a_{3}^{i s} s^{3})}{ν} .

(3.11b)

The benefit of truncating terms of order >3 here is again that 3 is the largest polynomial order that offers the possibility of a relatively simple solution of (1.43b). As shown in Appendix 1b, the following equation can be obtained on the basis of (3.11b):

\begin{array}{l} log {{(\frac{s}{s'})}^{\frac{1}{s_{+} s_{-}}} \cdot {(\frac{(s - s_{+}) {(1 - s)}^{- s_{+}^{2}}}{(s' - s_{+}) {(1 - s')}^{- s_{+}^{2}}})}^{\frac{1}{s_{+} (1 - s_{+}) (s_{+} - s_{-})}} \cdot {(\frac{(s - s_{-}) {(1 - s)}^{- s_{-}^{2}}}{(s' - s_{-}) {(1 - s')}^{- s_{-}^{2}}})}^{\frac{1}{s_{-} (1 - s_{-}) (s_{-} - s_{+})}}} \\ = \frac{a_{3}^{i s} p_{i} (t - t')}{ν} \end{array},

(3.12)

where $s'$ represents the value of s at some arbitrary moment in time $t = t' > 0$ (taking an arbitrary moment in time instead of t = 0 as a reference will be of particular use in Section 7). Solving s(t) for given t directly from this equation cannot be done via algebraic methods and requires a numerical procedure. However, the entire s–t curve can be obtained straightforwardly by using (3.12) to calculate t as a function of s and subsequently swap the axes. The ODE (3.11a) for s_i can then be solved via numerical integration under substitution of the appropriate values for s(t) obtained on the basis of (3.12). This then concludes the solution of (1.43a) and (1.43b) in this particular approximation.

As a general remark, it should be mentioned that solutions for s(t) have ‘physical’ meaning only for $0 \leq s \leq 1$ , and when they describe a situation where $\dot{s} \geq 0$ , since (by definition) the cumulative number of infections cannot decrease with time. As such, physical solutions are confined to the specific interval of s-values where

\dot{s} = p_{i} s_{i} \frac{〈 s_{s i} 〉}{ν} = p_{i} (1 - s) \frac{〈 s_{i s} 〉}{ν} > 0,

(3.13)

that is, where

〈 s_{s i} 〉 \equiv a_{3}^{s i} s^{3} + a_{2}^{s i} s^{2} + a_{1}^{s i} s + ν > 0 \land 〈 s_{i s} 〉 \equiv a_{3}^{i s} s^{3} + a_{2}^{i s} s^{2} + a_{1}^{i s} s > 0 .

(3.14)

4. Simulations

The merits of the extension of the standard SIR-model presented in the previous sections [essentially consisting of the introduction of the series expansions (1.11), (1.12) and (1.13)] can be demonstrated very well on the basis of results from simulations of the spread (in an SIR context) of infections through entire populations.

For the purpose of the aforementioned simulations, populations were considered in the form of a two-dimensional (2D) square lattice, each node of a lattice representing a population member, while the edges of the lattice connect the nodes to their four nearest-neighbours contacts. Contacts were not restricted to nearest-neighbour contacts only however. To simulate the effects of a wider variety of restrictive social measures, nodes could also be considered as being at the centre of a $(2 N + 1) \times (2 N + 1)$ square of nodes representing potential contacts (note that there is always an even number of contacts to a single node in such cases). The values of N could be chosen at will. The limit of $N \to \infty$ was approximated through a simulation where the contacts of a node were selected among all other members of the population. Nodes could be labelled as either susceptible (S), infected (I) or removed (R), in accordance with the SIR context chosen as the epidemiological model or setting. Periodic boundary conditions were applied to guarantee that all nodes have a similar ‘social bubble’, i.e. an equal number of contact links connected to it.

The spread of an infection through a population can basically be seen as a stochastic process of a Markov type [30]. Such processes are particularly suited for simulations on the basis of a Monte-Carlo scheme (see [10, p. 17ff]). With code written in Fortran, the algorithm used here was basically as follows. First of all, the population lattice is brought in its initial state by labelling all population members (nodes) as susceptible (S), except for a fixed number of randomly selected population members (nodes) that will be labelled as infected (I) and serve as initial infections. Random selection of nodes is done by calculating their 2D coordinates on the square lattice on the basis of 2 (pseudo-) random numbers provided by the build-in random-number generator of the compiler. Then the simulation of the actual spread of the infections through the population begins and proceeds in the following way. A member of the population (node) is selected at random. Then a second node is selected in the same way from the nodes in the contact environment of the first node (i.e. from the nodes linked to the first node as its possible contacts). If the first node is labelled infected (I) and the second node is susceptible (S), or vice versa, a pseudo-random random number r is generated and compared with the transmission probability $w_{i} = p_{i} / 2$ (chosen by the user as a constant). If $r \leq w_{i}$ , the infection is passed on to the susceptible node by labelling it as infected as well (instead of susceptible). For the purpose of simulating infection removal, another node (population member) is then selected, again at random. If this node turns out to be labelled as infected, a new random number r is generated and compared with the (user-defined) constant of infection removal p_r. If $r \leq p_{r}$ , the node is relabelled as a removed infection (R). This entire procedure of infection and subsequent removal is repeated a vast number of times.

When such a procedure is applied to each population in a large ensemble of N_e populations that are all in same state, an average number of $N_{t} \approx N_{e} p_{i} s_{i} 〈 s_{s i} 〉$ of infections will be transmitted, whereas an average total of $N_{r} \approx N_{e} p_{r} s_{i}$ active infections will be removed. It is easy to see therewith that successive application of this procedure to a single population simulates the process of infection and removal described by the master equations (1.38) and (1.39).

The spread of the infection can be followed at arbitrary time scales by regularly monitoring the status (S, I or R) of all nodes in the population. The unit of time is itself the subject of a certain arbitrariness as well in this respect. In can be defined as corresponding to a fixed but arbitrarily chosen number of successive contacts made. In the simulations presented throughout this article, the unit of time was taken such that it spans a number of contacts equal to the number of nodes/members in the population. So, in a single unit of time each member of the population makes exactly 1 contact on average.

Figure 2 illustrates the results of a simulation for a case where p_r = 0 and contacts of the nodes were selected throughout the entire population. Every node is a potential contact to every other node therewith. Such cases represent the equivalent of the so-called mean- or molecular-field cases in the theory of (magnetic) phase transitions [31]. They are limiting cases, for which the standard SIR-model represented by Equations (1.6) and (1.7) is actually exact. As such, they make an excellent test case to verify whether the simulation scheme described in the above may be a useful validation tool for models that go beyond the standard SIR-model.

Simulations presented throughout this article were generally carried out on population lattices with a number of nodes typically in the order magnitude of 10⁶. The data presented in Fig. 2 for instance were obtained from a simulation where the population was represented by a 2001 × 2001 square lattice (i.e. n = 4 004 001 nodes). These are quite large population sizes indeed, which comes with the advantage that simulations become less prone to the typical finite-size effects that often complicate the interpretation of Monte-Carlo simulations for systems of relatively small size (see [10, p. 35ff]). It should also be noted that such population sizes are actually quite realistic. A 2001 × 2001 square lattice consists of a number of nodes in the order of the size of the population of a country like Norway, e.g. [32]. Tests by running a simulation multiple times for the same input parameters (s₀, p_i, p_r) consistently showed qualitatively and (within negligible margins for the purpose of this article) quantitatively similar results, free from notable ‘statistical noise’ related to (pseudo) stochastic fluctuations inherent to the Monte-Carlo methods used. This might not have been the case when, for instance, the population size had been chosen too small. Output results based on a single simulation run thus provide useful and representative data on the spread of an infection for a given set of input parameters. The simulation results and the simulated data presented in this article are therefore based on single simulation runs only, since there is no compelling need to use averages over several of such runs.

The simulation data presented in Fig. 2 are in perfect agreement with the standard SIR-model. The dashed curve in the main figure shows the cumulative number of infections vs. time as obtained by solving Equation (1.7) for the same initial conditions and parameters used in the simulation (i.e. $n_{0} = 999$ so that $s_{0} = n_{0} / n = 2.4950 \cdot 10^{- 4}, p_{i} = 2 w_{i} = 1$ , p_r = 0). The solutions of the standard SIR-model are given by a so-called logistic function [33] in this case:

s (t) = s_{i} (t) = \frac{1}{a e^{- p_{i} t} + 1},

where $a = (1 - s_{0}) / s_{0}$ . The simulated data follow the dashed curve remarkably well, and create confidence therewith in the adequacy of the implemented simulation scheme. An even more significant match with the standard SIR-model can be observed in the variation of $\dot{s}$ vs. s shown in the inset of Fig. 2. The standard SIR-model yields $\dot{s} = p_{i} s (1 - s)$ for p_r = 0 [see Equations (1.6) and (1.7)], which is represented by the dotted curve in the inset. The datapoints (▽) show $\dot{s}$ as obtained from a numerical evaluation from the simulated data. The (near) perfect agreement between the simulated data and the standard SIR-model is again obvious. As such, we may conclude that the simulations provide very reliable data for this case. It should also be noted in this respect that the large size of the populations used in the simulations already seems to pay off in the absence of any visual stochastic noise in the simulated data (which smoothly follow the dashed/dotted curves).

Simulations of cases where $p_{r} \neq 0$ confirm the adequacy of the simulation scheme even more. When contacts to a single node are again selected from the entire population, the standard SIR-model applies to these cases as well. Figure 3a and b shows the results of simulations for the same (initial) conditions as the results shown in Fig. 2, except that $p_{r} = 0.5$ instead of p_r = 0. Figure 3a shows the simulated number of cumulative infections n_c and Fig. 3b the number of active infections $n_{i} = n s_{i}$ , in both cases as a function of time. The dotted curves represent the corresponding numerical solutions of the system of differential equations (1.6) and (1.7) for, respectively, n_c and n_i (n.b. remind that $n_{c} = n s, n_{i} = n s_{i}$ ).

Figure 3: — Data obtained from a simulation with $p_{r} \neq 0$ and node contacts selected throughout the entire population network (2D square lattice) for (a) the number of cumulative infections *n_c* as a function of time and (b) the number of active infections as a function of time. Dotted curves: standard SIR-model. Parameters: population size $N_{p} = 2001^{2}$ , transmission probability $w_{i} = p_{i} / 2 = 0.5$ , decay/removal constant $p_{r} = 0.5$ , and number of initial infections $n_{0} = 999$ .

Due to the removal of active infections, not the entire population gets infected during the epidemic in this case,⁴ and the cumulative number of infections will reach a final value $n_{e} < n$ (see the dashed horizontal line in Fig. 3a indicating $n = 2001^{2}$ ).

The simulated data in Fig. 3a and b match the curves given by the standard SIR-model to a high degree of accuracy. We may therefore conclude that not only the stochastic nature of infection transmission, but also the stochastics of infection removal (decay) have been implemented correctly and realistically in the simulation scheme. This is further corroborated in Fig. 4, which shows both $\dot{s} / s_{i}$ and ${\dot{s}}_{i} / s_{i}$ as a function of s, as derived on the basis of the data presented in Fig. 3a and b via a simple numerical evaluation of $\dot{s}$ and ${\dot{s}}_{i}$ ( $s_{e} = n_{e} / n$ indicates the maximum rate of cumulative infections reached). The data thus obtained agree very well with the standard SIR-model (where $\dot{s} / s_{i} = p_{i} (1 - s)$ and ${\dot{s}}_{i} / s_{i} = p_{i} (1 - s) - p_{r}$ ): each set of datapoints obviously follows the straight line that the standard SIR-model predicts for it, especially for mid-range values of s. Only at the very edges of the s-interval that applies, some stochastic noise becomes noticeable. This is due to the fact that both at the beginning as well as at the end of any (real) epidemic, the number of active infections is relatively low and therefore subject to (temporal) fluctuations. The fact that this phenomenon apparently presents itself also in the simulation process deserves attention, since it does not reveal any shortcomings in either the algorithms used in the simulation or their implementation. On the contrary, it is rather to be considered as a realistic artefact of an appropriate simulation of the stochastic processes involved in an actual epidemic.

Figure 4: — Simulated data for $\dot{s} / s_{i}$ (left vertical axis) and ${\dot{s}}_{i} / s_{i}$ (right vertical axis), obtained from the same simulations as the data in Fig. 3. Dotted lines: standard SIR-model [i.e. $\dot{s} / s_{i} = p_{i} (1 - s)$ and ${\dot{s}}_{i} / s_{i} = p_{i} (1 - s) = p_{r}$ (extrapolated to s = 1)]. Dashed vertical line s = *s_e*: maximum s reached during the epidemic. Parameters: the same as for Fig. 3.

However, as mentioned earlier, the standard SIR-model is only a (mean-field like) approximation. Its breakdown comes when the social bubble of the nodes is increasingly reduced from an environment that spans the entire population (in which case the standard SIR-model is exact) to smaller environments that contain only a limited number of nodes serving as contacts. This is clearly illustrated in Fig. 5a and b, which, respectively, show $〈 s_{s i} 〉$ and $〈 s_{i s} 〉$ as a function of $s$ for a series of simulations with p_r = 0, so that the entire population becomes infected in the end and s varies between 0 and 1 as a consequence. The contacts of each node were selected from a $(2 N + 1) \times (2 N + 1)$ square of nodes surrounding it.

Figure 5: — $〈 s_{s i} 〉$ (a) and $〈 s_{i s} 〉$ (b) as a function of $s$ , for a series of simulations with social bubbles consisting of $(2 N + 1) \times (2 N + 1)$ squares with $N = 16 (°), N = 12 (⋄), N = 10 (△), N = 8 (□), N = 6 (▽), N = 4 (⋄), N = 2 (+)$ . Parameters: population size $n = 2001^{2}$ , transmission probability $w_{i} = p_{i} / 2 = 0.5$ , decay/removal constant *p_r* = 0, and number of initial infections $n_{0} = 999$ .

For each simulation in Fig. 5b, a different value of N was taken, so that the size of the social bubble of the nodes (given by $ν = {(2 N + 1)}^{2} - 1$ ) differed per simulation. The values of N varied from N = 2 to N = 16 (i.e. the size of social bubbles varied from 24 to 1088). The dotted lines $〈 s_{s i} 〉 = 1 - s$ in Fig. 5a and $〈 s_{i s} 〉 = s$ in Fig. 5b represent the standard SIR-model. The departure [with increasing N (and ν)] in the behaviour of $〈 s_{s i} 〉$ and $〈 s_{i s} 〉$ with s from the mean-field characteristics of the standard SIR-model cannot be missed. This observation strongly indicates that the incorporation of the influence of the structure of the social networks and the size of the social bubbles into the analysis is not merely an exercise, but rather a matter of plain necessity, and that the standard SIR-model has serious shortcomings in this respect.

The usefulness of the method, presented in Section 1, of dealing with the network structure via (truncated) series expansions in s for $〈 s_{s i} 〉$ and $〈 s_{i s} 〉$ can be illustrated well by deriving values of the expansion coefficients from the simulated data for either $〈 s_{s i} 〉$ or $〈 s_{i s} 〉$ , taking these values as input for calculations of s as a function of time (t) [by solving either (1.42a,b) or (1.43a,b)], and comparison of the results with the s–t-data obtained from the simulations. It turns out that the variations of $〈 s_{i s} 〉$ with s shown in Fig. 5b can be described very well by a third-order polynomial of the type $a_{1}^{i s} s + a_{2}^{i s} s^{2} + a_{3}^{i s} s^{3}$ for all cases simulated (note that the standard SIR-model is in fact a limiting case here with $a_{1}^{i s} = 1, a_{2}^{i s} = 0, a_{3}^{i s} = 0$ ). This is clearly illustrated in Fig. 6a and b, in which the results of the best fits of the expansion coefficients $a_{1}^{i s}, a_{2}^{i s}, a_{3}^{i s}$ to the data for $〈 s_{i s} 〉$ vs. s obtained from the simulations for N = 6 and N = 10 are shown as representative examples.

Figure 6: — Third-order polynomial fits (dashed curves) of data (solid curves) for $〈 s_{i s} 〉$ vs. s from simulations with *p_r* = 0 and $N = 6, ν = 168$ (a) and $N = 10, ν = 440$ (b). Other parameters: the same as for Fig. 3. Best-fitting values for $a_{1}^{i s}, a_{2}^{i s}$ and $a_{3}^{i s}$ indicated in each figure.

It is easy to see that, with the best-fitting values taken for $a_{1}^{i s}, a_{2}^{i s}$ and $a_{3}^{i s}$ , the third-order polynomials (dashed curves) describe the simulations (solid curves) to quite an acceptable level indeed. This is also true for the other cases investigated in this respect (i.e. $N = 1, 2…16$ ). That the best-fitting values for $a_{1}^{i s}$ thus obtained provide by themselves an excellent reflection of the breakdown of the standard SIR-model deserves special attention here. Figure 7 shows these values as a function of ν (i.e. the number of contacts per node ν or, equivalently, the social-bubble size). For very large values of ν, the value of $a_{1}^{i s}$ reaches towards its asymptotic value $a_{1}^{i s} = 1$ given by the standard SIR-model (which represents the limiting case for $ν \to \infty$ ). In the lower ν-regime however, the value of $a_{1}^{i s}$ differs significantly from its mean-field value $a_{1}^{i s} = 1$ , and upon decreasing ν well below $ν \approx 300$ there is actually a collapse that disqualifies the standard SIR-model even as an approximation in this regime of ν-values.

Figure 7: — Variation of $a_{1}^{i s}$ with ν.

When p_r = 0 (and therefore s_i = s), the differential equations for s_i become identical to those for s, so that, depending on whether we use an expansion for, respectively, $〈 s_{s i} 〉$ or $〈 s_{i s} 〉$ , we only have to solve either (1.42b) or (1.43b) to obtain s as a function of t. For $N = 12, 10, 8, 6, 4, 2$ ( $ν = 624, 440, 288, 168, 80, 24$ ), the variation of s with t was calculated by numerically solving (1.43b) (see Section 3.3 for details) for the best-fitting values of $a_{1}^{i s}, a_{2}^{i s}$ and $a_{3}^{i s}$ for each N (as obtained from the previously mentioned fits of $〈 s_{i s} 〉$ vs. s). The results are shown in Fig. 8a–f. In each case, the marked datapoints represent the simulations and the dashed curves the respective solutions of (1.43b). The dotted curves relate to the results given by the standard SIR-model for the particular set of parameters used here (i.e. $w_{i} = p_{i} / 2 = 0.5, s_{0} = 2.4950 \cdot 10^{- 4}$ ). The agreement between the solutions of (1.43b) and the simulated data is equally noticeable as the discrepancy that grows (with increasing N) between them and the results from the standard SIR-model. The significance of this observation is 2-fold. On the one hand it shows that the method of expressing $〈 s_{i s} 〉$ as a series expansion in s has its merit. On the other hand, it further corroborates our previous observations about the inadequacies of the standard SIR-model and the mean-field approach that underlies it.

Figure 8: — s vs. t for *p_r* = 0 and $N = 2, 4, 6, 8, 10, 12$ . Dashed curves: fit. Dotted curves: standard SIR-model (not indicated for N = 12).

When $p_{r} \neq 0$ (so that $s_{i} \neq s$ ), we have to solve either both equations (1.42a) and (1.42b) or both equations (1.43a) and (1.43b) simultaneously. Using third-order polynomial approximations for $〈 s_{i s} 〉$ vs. s is not a viable option however. The reason is that $〈 s_{i s} 〉$ drops sharply towards zero upon approaching s = s_e (as a consequence of the removal of infections). At low-to-intermediate values of s, $〈 s_{i s} 〉$ may still be approximated well by a third-order polynomial as a function of s (as in the p_r = 0 case), but the approach of s = s_e is accompanied by a rather steep drop in $〈 s_{i s} 〉$ towards zero (for an example, see Fig. 9). The resulting functional dependence of $〈 s_{i s} 〉$ on s over the entire interval $s_{0} \leq s \leq s_{e}$ can no longer be appropriately described by a third-order polynomial in s, and using (1.43a,b) is therefore not an option. Fortunately, the dependence of $〈 s_{s i} 〉$ on s does show the desired characteristics and can be approximated fairly well in terms of a third-order polynomial, at least for N- and ν-values not too low (see Fig. 9). We can therefore use (1.42a,b) to investigate the cases where $p_{r} \neq 0$ . Such cases are of particular additional interest, since they offer an extra possibility to demonstrate the merits of expressing $〈 s_{s i} 〉$ or $〈 s_{i s} 〉$ as series expansion in s, by showing that they not only allow for an accurate reproduction of the simulated s – t curves (cumulative infections) but of the $s_{i} - t$ curves (active infections) as well. The procedure for this is conceptually similar to the one followed in the above for the p_r = 0 cases. We fit the simulated $〈 s_{s i} 〉$ vs. s data with a polynomial of the type $1 + a_{1}^{s i} s + a_{2}^{s i} s^{2} + a_{3}^{s o} s^{3}$ and take the best-fitting values of the coefficients $a_{1}^{i s}, a_{2}^{i s}$ and $a_{3}^{i s}$ as input for solving the differential equations (1.42a,b) via the algebraic/numerical method outlined in Section 3.2.

Figure 9: — Variation of $〈 s_{s i} 〉$ and $〈 s_{i s} 〉$ with s for $w_{i} = p_{i} / 2 = 0.5, p_{r} = 0.25$ and N = 10.

Figure 10 shows the results obtained in this context for $w_{i} = p_{i} / 2 = 1, p_{r} = 0.25$ and N = 12, 10, 8 ( $ν = 624, 440, 288$ ). The graphs in the left column show the simulated data (marked as grey circles) of s_i vs. t, as well as the corresponding results obtained on the basis of the standard SIR-model for the parameters involved (solid curves). The graphs in the right column show the same simulated data (also marked in grey) as in the graph to their left, but then with the solution of (1.42a,b) (solid curve) for the values of $a_{1}^{i s}, a_{2}^{i s}$ and $a_{3}^{i s}$ best fitting to the respective $〈 s_{s i} 〉$ vs. s data obtained from the simulations. The left column shows again a dramatic failure of the standard SIR-model. In contrast, the column to the right shows an excellent (N = 12) to still quite reasonable (N = 8) match between the simulated data and the solutions of (1.42a,b). This includes the position of the maximum so dramatically and consistently missed by the standard SIR-model in the left column.

In case of the cumulative infection rate s vs. t, the agreement between the simulated data and the corresponding solutions of (1.42a,b) is even slightly better than in the case of the active-infection rate s_i. This is clearly shown in Fig. 11. The solutions of (1.42a,b) (solid lines) follow the simulated data (markers) extremely well. We also see that with decreasing N, the curves show a tendency to shift to the right along the t-axis. A similar tendency can be observed in the curves for p_r = 0 shown in Fig. 8. This tendency can be understood as a direct manifestation of network and correlation effects. For instance, when the social bubbles become smaller, the relative decrease of the number of susceptibles that an active infection has left in its bubble after transmitting its infection to one of its contacts becomes larger. For smaller social bubbles, there is also an increased tendency towards the formation of small clusters of active infections sharing parts of their social bubble with other active infections. This typically leads to the kind of slow-down of the spread of the infection that we see in Fig. 11. The solutions of (1.42a,b) follow this process perfectly well, in contrast to the standard SIR-model which, from its very concept, does not account for network and correlation effects at all.

Figure 11: — s vs. t for $p_{r} = 0.25$ and N = 12, 10, 8 (other parameters: same as in Fig. 3). Simulated data (markers) and solutions of (1.42a,b) (solid curves). Dotted curve in upper figure: standard SIR-model (as indicated in grey).

In conclusion, we can say that the approximation of $〈 s_{s i} 〉$ and $〈 s_{i s} 〉$ as series expansions in s works quite satisfactory, especially in the p_r = 0 cases but also when $p_{r} \neq 0$ , provided N (or, more general ν) is not too small in the latter cases. Simulations show that the effects of a finite size of the social bubbles become noticeable at fairly large sizes already. Even for N = 12, a situation where the total number of possible contacts of a single node is as large as 624, both the variations of s and s_i with time (t) show significant quantitative variations from the mean-field behaviour that applies in the limiting case $N, ν \to \infty$ and for which the standard SIR-model is exact. Now, in real life, a number of 624 is a very large size for the social bubble of an average single member of a population when considered in an epidemiological context. From an epidemiological viewpoint, the social bubble of an individual in a population contains only those members of the population contacted intensively enough by the individual on a regular basis to make a transmission of an infection carried by one of the contacting members to the other possible (albeit not necessarily certain). As such, the social-bubble size depends on the critical exposure/uptake for the pathogen involved, defined as the exposure/uptake necessary for a full blown infection to develop in a population member: a lower critical exposure increases the social-bubble size. Also, the route of transmission affects the social-bubble size (airborne pathogens have their own notoriety in this respect). However, a number well in the hundreds for the (average) epidemiological social-bubble size in a population seems quite on the high side for even the more infectious of pathogens (see, e.g. Refs [34–36]).

Nevertheless, even in the cases of such large social bubbles, there is a substantial discrepancy between the actual time evolution of the infection numbers and the one provided by the standard SIR-model for the applying parameter values. This has serious consequences in relation to the extraction of values for p_i and p_r from field data about the spread of an actual infection. We see from Figs 8, 10 and 11 that the qualitative behaviour of the actual infection data does not differ significantly from that found for the mean-field case on the basis of the standard SIR-model. It may seem tempting therefore to fit the standard SIR-model to the actual data via adjustment of p_i (and optionally p_r where relevant). Especially in cases where only data on s vs. t are used, this may result in fits that reproduce the field data fairly well. However, it is easy to see that the values for p_i or $p_{i} / p_{r}$ thus obtained substantially underestimate the actual values (a reduction of the social-bubble size also reduces the growth rate $\dot{s}$ of the number of cumulative infections for given p_i or $p_{i} / p_{r}$ ). This is a serious problem indeed, since $R_{0} = p_{i} / p_{r}$ is often taken for the basic reproduction number (see Section 5.4), which plays an important role in practice for the assessment of the severeness of an outbreak/epidemic or of the risks associated with a particular pathogen in itself. More reliable estimates for p_i and p_r can be obtained, at least in principle, via the method of series expansions outlined in this article or via direct simulation. The problems do not end there however. It looks like the coefficients of the series expansions for $〈 s_{s i} 〉$ and $〈 s_{i s} 〉$ cannot be calculated easily via simple algebraic methods or easily implemented numerical methods, at least not for any network of arbitrary structure (for the purpose of this section, the coefficients were extracted from data generated by rather CPU intensive simulations for instance). This is a serious issue, since the structure and topology of the population network are expected to have a profound impact on at least the quantitative aspects of the spread of an infection, but perhaps also on the qualitative aspects. In connection to this we may refer to the Ising problem, where the dimensionality of the lattice (which affects, for instance, the number of nearest neighbours to a site/node) has major implications even for the qualitative behaviour of the system under consideration. The one-dimensional Ising model does not show any order at finite temperatures (no matter how low) [37], whereas the two- and three-dimensional versions of the model do exhibit ordering phenomena at temperatures above zero, but with different values of the corresponding critical temperature and critical exponents [38], the latter putting the two versions in different universality classes. Such observations are typical for systems with network or lattice features, and there is no reason to assume that population networks make an exception in this respect, especially since we will see in the coming sections that the spread of infectious diseases may be associated with its own kind of critical phenomena. Therefore, the extraction of $p_{i}$ - and $p_{r}$ -values from field data for the purpose of obtaining highly accurate estimates is quite a problematic affair troubled by fundamental difficulties. In order to obtain reliable values for instance, such an extraction cannot go without obtaining estimates for the coefficients in the series expansions of $〈 s_{i s} 〉$ or $〈 s_{s i} 〉$ as well, either from field or simulated data.

5. Properties of the model and its solutions: conditions for an epidemic, effects of infection removal

Even without solving the sets of differential equations represented by (1.42a,b) and (1.43a,b) completely, certain results key in the evolution of an epidemic can be derived from them.

5.1 Criterion for an epidemic to develop from a limited number of infections

From (1.42a), we immediately see that for ${\dot{s}}_{i}$ to be positive for $s \to 0$ (and thus for sufficiently low values of s):

p_{i} - p_{r} > 0 .

(5.1)

This is an interesting result, which actually implies a basic criterion for the possibility that a number of initial infections, so limited that $s_{i} = s_{0} \approx 0$ , will grow into a rampant epidemic or not. Only when the rate of transmission per active infection (p_i) is higher than the rate of removal (p_r), the number of active infections $n s_{i}$ (and therewith also the cumulative number of infections ns) will grow even on the basis of just a very few initial active infections.

All this may sound plausible, but the inequality (5.1) not only provides a mathematical foundation to common sense in this respect, but also points directly towards general strategies that can be deployed during the onset of an epidemic. To reduce the spread of infection one may first of all refer to protective measures or reducing the frequency of social contact (i.e. reducing f_cn). The effect of these is a reduction of p_i. The more effective they are, the more they will reduce p_i and the slower the infection will propagate at given p_r. On the other hand, one may look at cures and medication (when available). The sooner active infections can be eliminated the larger p_r will become, thus hampering the spread of the infection. In theory, the possibility even exists of smothering a major outbreak well before it even started: if by taking appropriate measures the value of $p_{i} - p_{r}$ can be made negative ( $p_{i} / p_{r} < 1$ ), a large-scale epidemic might be averted. For that to achieve by protective measures alone, it is important to mention that it is not necessary to reduce p_i to zero. Only a reduction of p_i sufficient enough to make $p_{i} - p_{r}$ negative (i.e. $p_{i} / p_{r} < 1$ ) will do. It is stressed however that when such measures fail and (5.1) is actually met, sooner or later the active infection rate will grow vigorously (in fact exponentially) with time, as may be inferred from (1.42a). For $s ≪ 1$ , Equation (1.42a) reduces to ${\dot{s}}_{i} = (p_{i} - p_{r}) s_{i}$ , the solution of which is an exponential function of t, and when $p_{i} - p_{r} > 0$ the result will be an exponential increase in time of the active infection rate that may easily grow to epidemic proportions.

An important observation to be made here is that there is no reference to the structure of the individual social networks in the inequality (5.1): the criterion it represents follows independently of the social fabric of the population. As such, at least within the context of the model, Equation (5.1) is universal and applies to all populations, irrespectively of their (social) network structure.

5.2 The maximum number of active infections reached during an epidemic

Furthermore, Equation (1.42a) allows us to obtain an expression for the value of s at which the overall (global) maximum is reached in the number of active infections. A necessary condition for such a maximum is ${\dot{s}}_{i} (s) = 0$ , which relates, in case of a global maximum, to an extremum both as a function of t and as a function of s. From Equations (1.41a) and (1.42a), we see that ${\dot{s}}_{i} = 0$ either when s_i = 0 or when $P (s) = p_{i} 〈 s_{s i} 〉 / ν - p_{r} = 0$ . The first case (s_i = 0) can be discarded, since it relates to the end of the epidemic which, in strict mathematical terms, is (always) approached asymptotically when $t \to \infty$ .

We will now evaluate the criterion for a maximum in s_i for the case where $〈 s_{s i} 〉$ is approximated by a second-order polynomial in s. This case represents the simplest deviation from the standard SIR-model possible. However, as a toy model it can be quite instructive. The relevant ODE’s and their solutions are given in Section 3.1.

When a global maximum in s_i exists in the context of a second-order polynomial approximation of $〈 s_{s i} 〉$ , it must relate to one of the solutions $σ = σ_{\pm}$ of the quadratic equation $p_{i} 〈 s_{s i} 〉 / ν - p_{r} \equiv p_{i} (ν + a_{1}^{s i} s + a_{2}^{s i} s^{2}) / ν - p_{r} = 0$ . That is:

σ_{\pm} = \frac{- a_{1}^{s i} \pm \sqrt{{(a_{1}^{s i})}^{2} - 4 ν a_{2}^{s i} (p_{i} - p_{r}) / p_{i}}}{2 a_{2}^{s i}} .

(5.2)

A particular solution $σ_{\pm}$ actually corresponds to (global) maximum when:

{\ddot{s}}_{i} = {(\dot{s_{i}} P (s) + s_{i} \dot{s} \frac{\partial P (s)}{\partial s})}_{s = σ_{\pm}} < 0 .

With $\dot{s_{i}} = 0$ for $s (t) = σ_{\pm}$ that is:

{\frac{\partial P (s)}{\partial s} |}_{s = σ_{\pm}} = \frac{p_{i} (a_{1}^{s i} + 2 a_{2}^{s i} σ_{\pm})}{ν} = \pm \frac{p_{i}}{ν} \sqrt{{(a_{1}^{s i})}^{2} - \frac{4 ν a_{2}^{s i} (p_{i} - p_{r})}{p_{i}}} < 0,

(5.3)

which can be met only in case of the minus sign, i.e. by the solution $σ_{-}$ , and (of course) only when the term under the square-root sign is non-negative.

Since it seems fairly evident that $a_{1}^{s i} < 0$ we have to distinguish between only two regimes of the expansion parameters, namely $a_{2}^{s i} < 0$ and $a_{2}^{s i} > 0$ . An epidemic requires $p_{i} - p_{r} > 0$ to start and propagate (see Section 5.1). In such a situation, it can be inferred from Equation (5.2) that when $a_{2}^{s i} < 0$ then $σ_{+} < 0$ and $σ_{-} > 0$ , and that when $a_{2}^{s i} > 0$ then $σ_{+} > 0$ and $σ_{-} > 0$ . Since $σ_{-}$ is positive for both $a_{2}^{s i} < 0$ and $a_{2}^{s i} > 0$ , there can be a local maximum in both cases, its value being obtained by substitution of $s = σ_{-}$ into Equation (3.8):

s_{i, \max} = σ_{-} - (\frac{ν p_{r} / a_{2}^{s i} p_{i}}{s_{+} - s_{-}}) log (\frac{(s_{0} - s_{-}) (σ_{-} - s_{+})}{(s_{0} - s_{+}) (σ_{-} - s_{-})}) .

(5.4)

The maximum in s_i results from a competition between the production of new infections $\dot{s}$ and the removal of already existing infections $- p_{r} s_{i}$ . For s-values below $σ_{-}$ , new infections arise at a higher rate than that existing infections are removed. At $s = σ_{-}$ , the generation of new infections is precisely compensated by the infection removal and a maximum in s_i is reached. For $s > σ_{-}$ , the removal of infections over-compensates the generation of new infections and a reduction of s_i sets in so that the epidemic or outbreak gradually fades out.

It is emphasized that, in general, the maximum in s_i does not mark a simultaneous onset of a decrease in the growth rate of new infections $\dot{s}$ .

With $\dot{s}$ given by (1.32), the maximum in the rate of new infections is given by

\ddot{s} = \frac{p_{i}}{ν} (\dot{s_{i}} 〈 s_{s i} 〉 + s_{i} \frac{\partial 〈 s_{s i} 〉}{\partial t}) = \frac{p_{i}}{ν} (\dot{s_{i}} 〈 s_{s i} 〉 + s_{i} \dot{s} \frac{\partial 〈 s_{s i} 〉}{\partial s}) = 0 .

(5.5)

This equation cannot be solved by algebraical methods in general. However, it is straightforward that when the maximum in s_i (which corresponds to $\dot{s_{i}} = 0$ ) and the maximum in $\dot{s}$ were to occur at to the same value of s, this would require the second time derivative of s:

\ddot{s} = \frac{p_{i}}{ν} \frac{\partial 〈 s_{s i} 〉}{\partial s} = \frac{p_{i} (a_{1}^{s i} + 2 a_{2}^{s i} σ_{-})}{ν}

(5.6)

to vanish. Via substitution of $σ_{-}$ that requirement is easily restated as:

\ddot{s} = - \sqrt{{(a_{1}^{s i})}^{2} - 4 ν a_{2}^{s i} (p_{i} - p_{r}) / p_{i}} = 0,

(5.7)

which holds only for very specific parameter combinations that make the term under the square-root sign zero. Hence, the maxima in s_i and $\dot{s}$ do, in general, not occur at the same s and t. In fact, unless $\ddot{s} = 0$ , the rate of new infections is always in decline already when s_i reaches its maximum at $s = σ_{-}$ , since $\ddot{s} \leq 0$ because of the minus sign in front of the square root in (5.7). The maximum in $\dot{s}$ therefore precedes the maximum in s_i, at least in the present model. This result may be of importance for policy and decision making during an ongoing epidemic. The observation that the rate of new infections has apparently reached its peak still means that the peak in the number of active infections is not there yet. Since the burden on the healthcare system due to an epidemic is largely determined by the number of active infections, this may have its implications, for instance with respect to matters of healthcare capacity.

5.3 The effects of infection removal and the herd-immunity pitfall

The final stage of the epidemic/outbreak is characterized by a stabilization (asymptotic in time) of s at some finite value s_e (n.b. $0 \leq s_{e} \leq 1$ ), whereas the active infections fade out (s_i asymptotically approaches zero when $t \to \infty, s \to s_{e}$ ). More important, the entire spread of the infection gradually comes to a halt, as the rate of infection $\dot{s}$ also becomes zero. The latter is in fact the quintessential feature of a fade-out of an epidemic and the influence of the network structure of the population is crucial in it.

The role of the network structure in the evolution of an epidemic can be described in more detail as follows. When the number of active infections approaches zero ( $s_{i} \to 0$ ), the social network of a susceptible individual will consist more and more of removed infections and (non-infected) susceptibles only. Only a decreasingly small and negligible minority of susceptible individuals is still vulnerable to infection by active infections from within their social network. The root cause of this phenomenon on a ‘microscopic’ scale (i.e. on the level of individual nodes) is the removal of infections ( $p_{r} \neq 0$ ), under the assumption that removed infections either relate to individuals that have overcome the infection and acquired immunity or to individuals that have succumbed to the infection. In both cases, such an individual then corresponds to an ‘inert’ node in the network, unable to become infected again and to pass on the infection to other nodes/individuals. The infection can no longer propagate through the population via such nodes. In fact, active infections may even become surrounded by removed infections (immunized nodes) only, thus providing a shield between that particular infection and the rest of the population (rendering the infection unable to infect other nodes). With time (and therefore with s), the number of inert nodes increases to such an extent (relative to the number of active infections s_i) that at some level the removal rate exceeds the rate of new infections: after having reached a maximum for $s = σ_{-}$ (see previous section), the number of active infections begins a steady decrease and the epidemic gradually fades out, as its propagation is more and more hampered by the mechanism described here. As an important corollary of such a mechanism, the epidemic generally comes to an end even before all the members of the population have been infected: the cumulative number of infections then stabilizes at a value $s_{e} \neq 1$ . The way in which the spread of the infection is hampered by inert nodes in the population lattice via a blockade of infection routes makes that the spread of an infection has all the characteristics of a percolation phenomenon. We will discuss this extensively in Section 8.

In mathematical terms, the end of an epidemic can be defined as the situation where s_i = 0 {which implies also that $\dot{s} = 0$ [see for instance (1.42b)]}. This allows for obtaining an equation for s_e. By setting s_i = 0 in (3.8) (second-order polynomial approximation of $〈 s_{s i} 〉$ ), s_e can be identified as the solution s of the equation:

s = \frac{ν (p_{r} / p_{i})}{a_{2}^{s i} (s_{+} - s_{-})} log (\frac{(s_{0} - s_{-}) (s - s_{+})}{(s_{0} - s_{+}) (s - s_{-})}),

(5.8)

which we reexpress, with $β = \frac{ν}{a_{2}^{s i} (s_{-} - s_{+})}$ , as:

s = β (\frac{p_{r}}{p_{i}}) log (\frac{(s_{0} - s_{+}) (s - s_{-})}{(s_{0} - s_{-}) (s - s_{+})}) = f (s) .

(5.9)

As a condition to be met by s = s_e, (5.9) can be reexpressed even further, via some algebra, in terms of the inverse of the function f(s) as:

s = s_{-} (1 + \frac{(s_{-} - s_{+}) / s_{-}}{α exp (s / η) - 1}) = f^{- 1} (s),

(5.10)

where $η = - β (p_{r} / p_{i})$ and the factor α is given by:

α = \frac{(s_{0} - s_{+})}{(s_{0} - s_{-})} .

(5.11)

Furthermore, it should be noted that when $p_{r} \neq 0$ , the function f(s) actually represents the number of removed infections at given s < 1, which at the end of the epidemic (s = s_e) becomes equal to the cumulative number of infections. It should be kept in mind that this is not the case for $f^{- 1} (s)$ however. The function f(s) differs from its inverse and $f^{- 1} (s)$ has been introduced solely to reexpress the equilibrium condition for s implied by (5.9) and not to explicitly reexpress the middle part of (5.9) in general (i.e. for all s). As such the function $f^{- 1}$ represents the number of removed infections only for s = s_e.

Equations (5.8), (5.9) and (5.10) cannot be solved for s via algebraic methods but require numerical or graphical techniques, the latter being quite instructive however. The function $f^{- 1} (s)$ is easier to handle than f(s) in that respect, mainly because of the divergent behaviour of f(s) at $s = s_{-}$ and $s_{+}$ . Furthermore, $f^{- 1} (s)$ is more appropriate for demonstrating analogies between certain aspects of the dynamics of an epidemic and concepts in thermodynamics and statistical mechanics. It is for these reasons that $f^{- 1} (s)$ was introduced in the first place to serve as a substitute for f(s) in (5.9).

As a consequence if $a_{1}^{s i} < 0$ we have $η \leq 0$ , which can be verified easily by substitution of (3.4) for $s_{\pm}$ :

η = \frac{ν (p_{r} / p_{i})}{a_{2}^{s i} (s_{+} - s_{-})} = \frac{ν (p_{r} / p_{i})}{a_{1}^{s i} \sqrt{(1 - 4 ν a_{2}^{s i} / {(a_{1}^{s i})}^{2}}} .

(5.12)

Also, the following inequalities apply to $s_{\pm}$ , as can be verified easily on the basis of (3.4) as well:

a_{1}^{s i} < 0, a_{2}^{s i} < 0 : s_{-} < 0 < s_{+} .

(5.13a)

a_{1}^{s i} < 0, a_{2}^{s i} > 0 : 0 < s_{+} < s_{-} .

(5.13b)

Focusing on the implications of the requirement expressed by (3.14) in case $a_{3}^{s i} = 0$ , it is straightforward that when $a_{2}^{s i} < 0$ [i.e. when (5.13a) applies] then $\dot{s} = p_{i} (a_{2}^{s i} s^{2} + a_{1}^{s i} s + ν) / ν \geq 0$ only for $0 \leq s \leq s_{+}$ . The same is true when $a_{2}^{s i} > 0$ [i.e. when (5.13b) applies]. We can therefore conclude that the root $s_{+}$ constitutes an upper bound to s_e when $s_{+} \leq 1$ .

We will now investigate the characteristics of the function $f^{- 1} (s)$ in some more detail. For that purpose, we introduce the continuation of the function $f^{- 1} (s)$ on the interval $0 \leq s \leq s_{+}$ to the function g(s) on $(- \infty, \infty)$ :

g (s) = s_{-} (1 + \frac{(s_{-} - s_{+}) / s_{-}}{α exp (s / η) - 1}) s \in R .

(5.14)

That is, we take the middle part of (5.10) as the rule of a function g(s) with domain $R$ instead of the interval $0 \leq s \leq s_{+}$ .

The first derivative of g(s) is readily obtained as:

\frac{\partial g (s)}{\partial s} = \frac{α (s_{+} - s_{-})}{η} (\frac{e^{s / η}}{{(α e^{s / η} - 1)}^{2}})

(5.15a)

and reexpressed via substitution of $η = \frac{ν (p_{r} / p_{i})}{a_{2}^{s i} (s_{+} - s_{-})}$ and $s_{+} s_{-} = ν / a_{2}^{s i}$ as:

\frac{\partial g (s)}{\partial s} = \frac{α {(s_{+} - s_{-})}^{2}}{s_{+} s_{-}} \cdot \frac{p_{i}}{p_{r}} (\frac{e^{s / η}}{{(α e^{s / η} - 1)}^{2}}) \geq 0 .

(5.15b)

Since $η \leq 0$ when $a_{1}^{s i} < 0$ [see (5.12)]:

\lim_{s \to \infty} \frac{\partial g (s)}{\partial s} = 0

(5.16)

so that we can conclude that g(s) approaches a horizontal asymptote as $s \to \infty$ . Furthermore, for those cases relevant, a quick examination⁵ of the second derivative $\partial^{2} g (s) / \partial s^{2}$ shows that the function g(s) may have an inflection point only for negative s. So, as illustrated in Fig. 12, for $s \geq 0$ the function g(s) is therefore a monotonously increasing, concave function of s with a horizontal asymptote:

Figure 12: — $f^{- 1} (s)$ vs. s for $p_{r} / p_{i} = 0.01, 0.10, 0.25, 0.50, 0.75, 1$ (solid curves) and the straight line y = s (dotted). Values of parameters: ν = 8, $a_{1} = - 7.2, a_{2} = - 0.8$ . Solutions of (5.10) correspond to the intersections of the relevant graph of $f^{- 1} (s)$ vs. s and the line y = s. s = 0 is always a solution. For $p_{r} / p_{i} < 1$ , a second solution s > 0 exists. With increasing $p_{r} / p_{i}$ , the second solution gradually moves towards s = 0. For $p_{r} / p_{i} = 1$ , both solutions converge into a single solution s = 0, the line y = s being the tangent of the corresponding graph of $f^{- 1} (s)$ . For $p_{r} / p_{i} > 1$ , only s = 0 remains as a solution.

y = \lim_{s \to \infty} s_{-} (1 + \frac{(s_{-} - s_{+}) / s_{-}}{α exp (s / η) - 1}) = s_{+} .

(5.17)

Furthermore, via straightforward substitution of $s = s_{0} = 0$ into (5.14), we get $g (0) = 0$ . Hence, for s > 0:

0 \leq g (s) < s_{+}

and consequently, for $0 \leq s \leq s_{+}$ :

0 \leq f^{- 1} (s) < s_{+} .

(5.18)

Now, if they exist, the solutions s = s_e of (5.10) are given by those intersections of the graph of $f^{- 1} (s)$ vs. s and the straight line y = s that take place at an s-value in the interval $0 \leq s < s_{+}$ when $s_{+} < 1$ , or in the interval $0 \leq s \leq 1$ when $s_{+} \geq 1$ (see Fig. 12). Besides s = 0 (which is always a solution), there is only a single intersection possible at most for s > 0 [due to the monotonously increasing concave nature of g(s), the absence of inflection points for s > 0 and the horizontal asymptote of g(s)]. So, if a solution $s = s_{e} > 0$ of (5.10) exists [and therefore of (5.9)], then that is the only positive solution.

Furthermore, when $s_{+} < 1$ then $s_{e} < 1$ as a direct consequence of (5.18). The latter mathematically demonstrates the possibility that significant parts of the population may remain uninfected during an epidemic mentioned on page 20. When p_r = 0 however, the entire population actually will become infected in the end (i.e. s_e = 1 when p_r = 0). For the model to be consistent with this, the coefficients $a_{1}^{s i}$ and $a_{2}^{s i}$ are subject to a constraint in that particular case. From (5.12), it follows that η = 0 when p_r = 0. For s = 1 to be a solution of (5.10) then requires:

1 = \lim_{η ↑ 0} s_{-} (1 + \frac{(s_{-} - s_{+}) / s_{-}}{α exp (s / η) - 1}) = s_{+} .

(5.19)

Substitution of (3.4) for $s_{+}$ here yields, after some rearrangements, the following relation between $a_{1}^{s i}$ and $a_{2}^{s i}$ :

a_{2}^{s i} (a_{2}^{s i} + a_{1}^{s i} + ν) = 0

(5.20a)

which implies:

a_{2}^{s i} = 0 \lor a_{2}^{s i} = - (ν + a_{1}^{s i}) .

(5.20b)

The case $a_{2}^{s i} = 0$ thereby corresponds to the standard SIR-model that we seek to replace by a more general approach in this article. In contrast, the case $a_{2}^{s i} = - (ν + a_{1}^{s i})$ does relate to the generalization of the SIR-model that accounts for the network structure of the population (albeit in the simplest approximation possible). In fact, the case $a_{2}^{s i} = 0$ , and therewith the standard SIR-model, can be seen as a special case of $a_{2}^{s i} = - (ν + a_{1}^{s i})$ where $a_{1}^{s i} = - ν$ (also see footnote⁶). It is also worth mentioning that fulfilment of the requirement $a_{2}^{s i} + a_{1}^{s i} + ν = 0$ comes down to $〈 s_{s i} 〉 = 0$ for $s = s_{i} = 1$ , and is therefore in full agreement with the requirement that s_e = 1 for p_r = 0: only when s = 1 and the number of susceptible nodes in the network becomes zero, and therefore the entire population has been infected, will the spread of the infection come to a halt. In fact, demanding that $〈 s_{s i} 〉 = 0$ for s = 1, would as well have lead us, in a totally valid way, to (5.20b).

It is emphasized that the constraint $a_{2}^{s i} = - (a_{1}^{s i} + ν)$ only applies in this particular form in case of an approximative approach where terms of order higher than 2 in the series expansion of $〈 s_{s i} 〉$ have been truncated.

In the exact case, the demand that $〈 s_{s i} 〉$ vanishes for s = 1 requires cancellation of all the expansion coefficients of the series, so that:

ν = - \sum_{k = 1}^{\infty} a_{k}^{s i} .

(5.21)

Based on the full series, and therefore exact, this relation holds in the most general sense. On every population network, no matter its structure, an epidemic will develop in accordance with this rule when p_r = 0, thus ensuring that $〈 s_{s i} 〉 = 0$ for s = 1 and therefore s_e = 1.

In general, the coefficients $a_{k}^{s i}$ not only depend on the structure of the population network, but also on $p_{r} / p_{i}$ . However, in some cases, the dependence on $p_{r} / p_{i}$ is weak, and in cases where the social network of an individual/node consists of the entire population there is even no dependence on $p_{r} / p_{i}$ or the network structure at all. In the latter case, the active infections and the removed infections will be distributed randomly over the population, so that (irrespective of $p_{r} / p_{i}$ ) the standard SIR-model applies, in which $a_{1}^{s i} = - ν$ and $a_{2}^{s o} = 0$ .

The effect of $p_{r} / p_{i} \neq 0$ in a case where the coefficients $a_{1}^{s i}$ and $a_{2}^{s i}$ are constants independent of $p_{r} / p_{i}$ is illustrated in Fig. 12. From (5.15b), it is clear that $\partial g (s) / \partial s$ (and therewith $\partial f^{- 1} (s) / \partial s$ in the s-interval of relevance) decreases with increasing $p_{r} / p_{i}$ for given $s_{+}$ and $s_{-}$ (i.e. for given $a_{1}^{s i}$ and $a_{2}^{s i}$ ). As a result, the intersection of $f^{- 1} (s)$ and the straight line y = s shifts towards lower s-values when $p_{r} / p_{i}$ increases, as can be seen in Fig. 12. So s_e decreases with $p_{i} / p_{r}$ in those cases. Furthermore, as we will see next, it can even be shown now that also when $a_{1}^{s i}$ and $a_{2}^{s i}$ do depend on p_r and p_i, the value of s_e actually becomes zero at a certain critical value of $p_{r} / p_{i}$ .

Since $f^{- 1} (s)$ is a monotonously increasing, convex function on the s-interval of relevance, it is in fact obvious that for all values of $p_{r} / p_{i}$ for which:

{\frac{\partial f^{- 1} (s)}{\partial s} |}_{s = 0} \leq 1

(5.22)

only s = 0 remains as a solution of (5.10), so that in all these cases s_e = 0.

Now, the derivative $\partial f^{- 1} (s) / \partial s$ for s = s₀ is readily obtained from (5.15b) as:

{\frac{\partial f^{- 1} (s)}{\partial s} |}_{s = s_{0}} = \frac{α (s_{0}) {(s_{+} - s_{-})}^{2}}{s_{+} s_{-}} \cdot \frac{p_{i}}{p_{r}} (\frac{e^{s_{0} / η}}{{(α (s_{0}) e^{s_{0} / η} - 1)}^{2}})

with $α = α (s_{0})$ given by (5.11). Substitution of $s_{0} = 0$ and some rearrangements yields:

{\frac{\partial f^{- 1} (s)}{\partial s} |}_{s = 0} = \frac{{(s_{+} - s_{-})}^{2}}{s_{+} s_{-}} \cdot \frac{p_{i}}{p_{r}} (\frac{α (0)}{{(α (0) - 1)}^{2}}) = \frac{p_{i}}{p_{r}} .

That is,

{\frac{\partial f^{- 1} (s)}{\partial s} |}_{s = 0} = {(\frac{p_{r}}{p_{i}})}^{- 1} .

(5.23)

Combining (5.22) and (5.23) we see that a threshold $p_{r} / p_{i} = 1$ exists which marks the boundary between a regime where $s_{e} > 0$ (when $p_{r} / p_{i} < 1$ ) and a regime where s_e = 0 (when $p_{r} / p_{i} \geq 1$ ). It should be noticed that this threshold does not depend on $a_{1}^{s i}$ and $a_{2}^{s i}$ or on ν, and is therefore independent of the structure of the population network and thus quite general in nature (at least in the context of our truncated-series model). Related to this, there is a range of values of $p_{r} / p_{i}$ , where s_e is always zero, given by $p_{r} / p_{i} \geq 1$ , irrespective of whether $a_{1}^{s i}$ and $a_{2}^{s i}$ depend on p_i and p_r or not (although such a dependence may have an influence on the value of s_e itself when $s_{e} > 0$ ). As a corollary, when $p_{r} / p_{i} \geq 1$ a small limited number of infections cannot trigger (and develop into) an epidemic that involves large parts of the population. In essence, Equations (5.22) and (5.23) represent the same result as that represented by (5.1). However, it is obtained here in a completely different way that strongly resembles the analysis of magnetic ordering in (ferro-) magnetic systems in terms of the Weiss molecular field theory [39]. As such, this approach anticipates the revelation of a striking analogy between a transition towards herd-immunity and thermodynamic phase transitions to be presented later on.

According to our analysis, an important general effect of infection removal seems to be a decrease of s_e to values less than 1: infection removal suppresses the propagation of infections through the population, to the extend that even part of the population will escape infection. A similar result has been obtained in the past within the context of the standard SIR-model as well. However, the present analysis not only shows that it also applies in a model where the structure of the population network is explicitly taken into account, thus making it a more general result, but also puts it on more solid mathematical foundations.

A legitimate question now is whether saturation of the cumulative number of infections at a value s = s_e corresponds to a state of herd-immunity. The answer to that question largely depends on how we define herd-immunity. For instance, one might think of a state of herd-immunity in the broadest sense possible, namely as a situation in which an infection is unable to further propagate within a population. The answer to the aforementioned question would be affirmative in that case. Such a definition, however, is too general for practical use, as it leaves too much room for ambiguity and some quintessential issues unaddressed. The whole point in this respect is that the value of s_e depends on the circumstances under which the infection spreads (i.e. on both p_r and p_i, as well as on the structure of the population network, which translates into the $a_{k}^{s i}$ ). As a result of this, there are serious pitfalls when it comes to rolling back measures aimed at preventing the spread of an infection. To demonstrate this, consider a situation in which a clear tendency towards saturation of s at a particular value s_e is observed after an infection has been spreading for a while under a regime of restrictive social measures. The effect of such measures is 2-fold thereby: they reduce p_i and they restrict the size of the social networks of individual members of the population, thus affecting (lowering) the value of $〈 s_{s i} 〉$ at given s. When the epidemic has already reached a stage where s makes an approach to its asymptotic value s_e, the number of active infections s_i may still not be zero but is already over its peak, and therefore:

\dot{s_{i}} = s_{i} (p_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r}) < 0 .

(5.24a)

Rolling back social measures at this stage [for instance on the basis of an (inadequate) judgement regarding the achievement of herd-immunity] means that a new regime is entered however, which replaces p_i by $p_{i}^{'}, 〈 s_{s i} 〉$ by $〈 s_{s i}^{'} 〉$ and ν by $ν^{'}$ , so that the rate of change of s_i now becomes:

{\dot{s}}_{i} = s_{i} (p_{i}^{'} \frac{〈 s_{s i}^{'} 〉}{ν^{'}} - p_{r}),

(5.24b)

where it should be noted that ${\overset{}{s}}_{i}^{'} (t)$ is equal to $s_{i} (t)$ at the moment t = t₀ when measures are rolled back (i.e. ${\overset{}{s}}_{i}^{'} (t_{0}) = s_{i} (t_{0})$ ). The problem here is that it is not at all certain that ${\dot{s}}_{i}^{'}$ is negative as well (like ${\dot{s}}_{i}$ ). Depending on $p_{i}^{'}$ and $〈 s_{s i}^{'} 〉$ , it cannot even be excluded a priori that the term in brackets in (5.24b) is actually positive and consequently ${\dot{s}}_{i}^{'} > 0$ . In that case, the spread of the infection will intensify again into a new wave of (active) infections, which will only attenuate after a new maximum in the number of active infections ( $s_{i}^{'}$ ) has been reached. Such a scenario is illustrated in Fig. 13, where the results are shown for a simulation where a less tight regime of ‘social measures’ follows upon a significantly more restrictive regime. The spread of the infections starts under a tight regime of social restrictions, in which the contacts of a particular (central) node are selected from a relatively small (2N + 1) × (2N + 1) square of nodes closest to (i.e. surrounding) the central node with N = 1 (the number of contacts to a single node thus being equal to ν = 8). The spread of the infection was simulated under this regime to a point where the epidemic had nearly come to a halt (i.e. s only weakly increasing with time and s_i almost zero). Then (at t = 500) a new, less strict, regime with N = 5 was introduced (so that the number of contacts per node increased to ν = 24), under which the few active infections left from the first regime/wave were given the opportunity to pass on their infection to the remaining susceptibles, thus restarting the epidemic. Figure 13 clearly reflects this. After a near fade-out of the active infections after the first regime it takes only a little while after the implementation of the second regime for the epidemic to regain strength, and very soon both the active and cumulative infection rates are clearly on the rise again. The result is that we are confronted with two subsequent waves of infections: one rather modest under a regime of tight social measures (N = 1) that seems to fade out after a while, and a second one of a much fiercer intensity after a partial roll-back of the measures (N = 5), by which almost all members of the population that remained uninfected after the first wave become infected in the end. It is for the possibility of scenarios of this kind alone that some doubts are justified about strategies for coping with an epidemic based on the (assumed) achievement of herd-immunity under a regime of (limited) social measures (which, however, is just what a country like Sweden openly advocated and put into practice during the initial months of the Covid-19 pandemic). It is also clear that a better understanding (coupled to a tighter definition) of the concept of herd-immunity is necessary to make it of (safer) practical use.

Figure 13: — Simulated sequence of infection waves under two different regimes of social measures (see main text). (a) Number of active infections *n_i* as a function of time [time measured in simulation cycles, i.e. the time in which (on average) each member of the population (node) makes exactly 1 contact]. (b) Cumulative infection rate s as a function of time. Dotted lines represent the cross-over of social regimes. Parameters: $p_{i} = 0.5, p_{r} = 0.325, N_{0} = 500$ , population size $N = 1501^{2}$ .

5.4 Reconsidering the meaning of reproduction numbers

It may be clear that, in view of the results presented in this section, the often quoted condition $R_{0} > 1$ for an epidemic to get started requires some reconsideration. Via a similar line of though as the one followed in Section 5.1, both the standard SIR-model and the extended SIR-model presented in this article lead to the criterion $p_{i} / p_{r} > 1$ for an epidemic to evolve from a small number of active infections in an otherwise fully susceptible population. With R₀ given by (2.5), this criterion is equivalent to $R_{0} / Q_{0} > 1$ rather than $R_{0} > 1$ . The criterion $R_{0} > 1$ can only be preserved (as is often done by the way) via the introduction of a rather crude approximation that puts $R_{0} \equiv p_{i} / p_{r}$ (and thus implicitly takes $Q_{0} = 1$ ). The rationale here is that $〈 t 〉 = 1 / p_{r}$ is the average lifetime of an active infection in case of an exponential decay as given by (2.1).⁷ Taking $〈 s_{s i} 〉 = constant = ν$ , the number of new infections per unit time due to a single active infection can be taken as a constant as well, which is equal to $p = p_{i}$ so that the total number of new infections due to a single active infection becomes $R'_{0} = p \cdot 〈 t 〉 = p_{i} / p_{r}$ (variations on this simplified scheme exist [40, 41] for the purpose of generalization, but the basic ideas underlying them are the same). It is clear that such an approach basically comes down to substitution of $〈 s_{s i} 〉 = ν$ into (2.3) and taking $τ = 0$ :

R_{0} = p_{i} \int_{0}^{\infty} e^{- p_{r} t} d t = \frac{p_{i}}{p_{r}}

(5.25)

in which case the factor Q₀, accounting for the s-dependence of $〈 s_{s i} 〉$ , becomes indeed unity and $R'_{0} = R_{0}$ . However, in general $R'_{0} \neq R_{0}$ , and $R'_{0}$ is therefore not the basic reproduction number but an approximation for R₀, suitable only in cases where the lifespan of an active infection is short enough (compared with the typical time scales related to propagation of the infection) for the depletion of the reservoir of susceptibles to be neglected. Similar considerations apply to R and the often mentioned criterion R > 1 for the number of active infections to be on the rise, as we will see at the end of the next section on herd-immunity. As such, the interpretation of the dynamics of epidemic outbreaks in terms of reproduction numbers is not entirely unproblematic.

6. Defining and understanding herd-immunity

6.1 Fundamentals

The aim is now to present a strict mathematical definition of herd immunity, in such a way that the result is not only rooted in the basic physical and mathematical principles of epidemic growth, but also makes sense from a practicle point of view.

For this purpose, we consider an epidemic that has been going on for a while in an SIR context,⁸ and under a regime of social and protective measures, such that at some moment t₀ in time, the cumulative number of infections is $s = s_{0} > 0$ and the number of active infections $s_{i} = s_{i 0} > 0$ . Furthermore,

\lim_{t ↑ t_{0}} {\dot{s}}_{i} (t) < 0

(6.1a)

so that the number of active infections is over its peak and declining immediately before t = t₀. The average number of s-nodes linked to an i-node immediately before t₀ is given by:

\lim_{t ↑ t_{0}} 〈 s_{s i} 〉 = (1 - ξ_{0}) ν .

(6.1b)

At t₀ a new regime is entered, whether by rolling back of the measures taken or by some changes in the properties (for instance the transmissibility) of the pathogen that causes the infection. The structure of the population network thus may change, so that ν has to be replaced by $ν^{'}$ . In addition, possible changes in the protective measures require the replacement of p_i by $p_{i}^{'}$ . Changes in the population network also imply the replacement of $〈 s_{s i} 〉$ by $〈 s_{s i}^{'} 〉$ and of $1 - ξ_{0}$ by $λ^{'} (1 - ξ_{0})$ . The parameter $λ^{'}$ thereby accounts for changes in the structure and the topology of the population network and matches the value of $1 - ξ_{0}$ to the new network: $〈 s_{s i}^{'} 〉 |_{t = t_{0}} = λ^{'} (1 - ξ_{0}) ν^{'}$ . We finally consider $〈 s_{s i}^{'} 〉$ as an expansion around s = s₀:

〈 s_{s i}^{'} 〉 = λ^{'} (1 - ξ_{0}) ν^{'} + \sum_{k = 1}^{\infty} a_{k}^{'} {(s - s_{0})}^{k} = λ^{'} (1 - ξ_{0}) ν^{'} + \sum_{k = 1}^{\infty} a_{k}^{'} s^{' k},

(6.2)

where $s^{'} = s - s_{0}$ .

Substitution of (6.2) into (5.24b) yields for the new regime ( $t \geq t_{0}$ ):

\dot{s_{i}} = p_{i}^{'} s_{i} [λ^{'} (1 - ξ_{0}) + \frac{1}{ν^{'}} \sum_{k = 1}^{\infty} a_{k}^{'} s^{' k}] - p_{r} s_{i} .

That is,

\dot{s_{i}} = λ^{'} (1 - ξ_{0}) p_{i}^{'} s_{i} [1 + \sum_{k = 1}^{\infty} \frac{a_{k}^{'} s^{' k}}{λ^{'} (1 - ξ_{0}) ν^{'}}] - p_{r} s_{i}

(6.3)

which is, in effect, identical to the expression for $\dot{s_{i}}$ in the case of an infection with transmission probability $p_{i}^{″} = λ^{'} (1 - ξ_{0}) p_{i}^{'}$ spreading over a network on which the number of nodes linked to a central node is $ν^{″} = λ^{'} (1 - ξ_{0}) ν^{'}$ (instead of $ν^{'}$ ). Upon truncating terms of order k > 2 in the series expansion in (6.3) we have:

\dot{s_{i}} = λ^{'} (1 - ξ_{0}) p_{i}^{'} s_{i} [1 + \frac{a_{1}^{'} s^{'} + a_{2}^{'} s^{' 2}}{λ^{'} (1 - ξ_{0}) ν^{'}}] - p_{r} s_{i} .

(6.4)

Now, like s, the variable $s^{'}$ is a state variable, so that $\dot{s_{i}} = {\dot{s}}^{'} (\partial s_{i} / \partial s^{'})$ . Therefore, upon making the identifications $p_{i} \equiv p_{i}^{″}, ν \equiv ν^{″}$ and $s \equiv s^{'}$ in (1.42a,b), the differential equation (6.4) can be dealt with in the same manner in which the differential equation of identical form given by (1.42a) was dealt within Section 3.1 for the case of a second-order polynomial approximation of $〈 s_{s i} 〉$ . We thus obtain, straightforwardly from (3.3a):

(\frac{\partial s_{i}}{\partial s^{'}}) = 1 - \frac{ν^{'} p_{r} / p_{i}^{'}}{λ^{'} (1 - ξ_{0}) ν^{'} + a_{1}^{'} s^{'} + a_{2}^{'} s^{' 2}} .

By analogy with (3.3b), this can be reexpressed as:

(\frac{\partial s_{i}}{\partial s^{'}}) = 1 - \frac{ν^{'} p_{r} / a_{2}^{'} p_{i}^{'}}{(s^{'} - s_{+}^{'}) (s^{'} - s_{-}^{'})}

(6.5)

where [see (3.4)]:

s_{\pm}^{'} = \frac{a_{1}^{'}}{2 a_{2}^{'}} (- 1 \pm \sqrt{1 - \frac{4 λ^{'} (1 - ξ_{0}) ν^{'} a_{2}^{'}}{(a_{1}^{' 2})}}) .

(6.6)

Via some minor rearrangements of (6.5) we obtain the following ODE [compare with (3.5)]:

(\frac{\partial s_{i}}{\partial s^{'}}) = 1 - (\frac{ν^{'} p_{r} / p_{i}^{'}}{a_{2}^{'} (s_{+}^{'} - s_{-}^{'})}) [\frac{1}{(s^{'} - s_{+}^{'})} - \frac{1}{(s^{'} - s_{-}^{'})}]

(6.7)

the solutions of which are given by

s_{i} = s^{'} - (\frac{ν^{'} p_{r} / a_{2}^{'} p_{i}^{'}}{s_{+}^{'} - s_{-}^{'}}) log (\frac{s^{'} - s_{+}^{'}}{s^{'} - s_{-}^{'}}) + C^{'} .

(6.8)

The constant of integration $C^{'}$ follows from the condition $s_{i} |_{s = s_{0}} = s_{i} |_{s^{'} = 0} = s_{i 0}$ (remember $s' = s - s_{0}$ ):

C^{'} = s_{i 0} + (\frac{ν^{'} p_{r} / p_{i}^{'}}{a_{2}^{'} (s_{+}^{'} - s_{-}^{'})}) log (\frac{s_{+}^{'}}{s_{-}^{'}}) .

(6.9)

Combining (6.8) and (6.9), we finally obtain

s_{i} = s^{'} - (\frac{ν^{'} p_{r} / p_{i}^{'}}{a_{2}^{'} (s_{+}^{'} - s_{-}^{'})}) log (\frac{s_{-}^{'} (s^{'} - s_{+}^{'})}{s_{+}^{'} (s^{'} - s_{-}^{'})}) + s_{i 0} .

(6.10)

The logarithmic term here on the right-hand side has a clear physical meaning that will prove key to the definition and understanding of herd-immunity. Calling this term s_L we can write (6.10) as:

s_{i} = s^{'} - s_{L} + s_{i 0} .

With $s^{'} = s - s_{0}$ and $s_{i 0} = s_{0} - s_{r 0}$ that is:

s_{i} = s - (s_{L} + s_{r 0}),

(6.11)

where $s_{r 0} = n_{r 0} / n$ relates to the total number of removed infections at t = t₀ (i.e. when s = s₀, $s^{'} = 0$ ). In general, $s_{i} (t) = s (t) - s_{r} (t)$ , so that (6.11) implies:

s_{r} = s_{L} + s_{r 0} .

(6.12)

With the given definition of $s_{r 0}$ it is thus shown that the term

s_{L} = (\frac{ν^{'} p_{r} / a_{2}^{'} p_{i}^{'}}{s_{+}^{'} - s_{-}^{'}}) log (\frac{s_{-}^{'} (s^{'} - s_{+}^{'})}{s_{+}^{'} (s^{'} - s_{-}^{'})})

(6.13)

represents the infections removed under the new regime (i.e. after t = t₀). In view of this, an additional quantity of physical relevance becomes evident as well, namely:

Δ s_{i} = s^{'} - s_{L} = s^{'} - (\frac{ν^{'} p_{r} / p_{i}^{'}}{a_{2}^{'} (s_{+}^{'} - s_{-}^{'})}) log (\frac{s_{-}^{'} (s^{'} - s_{+}^{'})}{s_{+}^{'} (s^{'} - s_{-}^{'})})

(6.14)

representing, for given $s^{'} (t)$ , the net change in the active-infection number since the new regime took effect at t = t₀.

The condition

Δ s_{i} = 0

(6.15)

makes an important physical criterion. As stated earlier, we assume a situation where the number of active infections was in (sharp) decline under the old regime prior to t = t₀. For that to remain the case under the new regime it is required that $Δ s_{i} < 0$ for all $s^{'} > 0$ (all $t > t_{0}$ ). In contrast, when there is an interval $0 < s^{'} < s_{0}^{'}$ of $s^{'}$ -values for which $Δ s_{i} > 0$ , the number of active infections will initially rise again (maybe even strongly) after the new regime has come into effect. It may be obvious that such a situation is at variance with what one would intuitively think of as a state of herd-immunity. However, a completely different situation occurs when for all conceivable regimes to come into effect at t = t₀ (most importantly the regime of social normality) $Δ s_{i}$ is negative definite for $s^{'} > 0$ (i.e. $Δ s_{i} < 0$ for all $s^{'} > 0$ ). A necessary and sufficient condition for such a situation to occur is that solving $Δ s_{i} = 0$ for $s^{'}$ yields $s^{'} = 0$ as the only solution for any conceivable regime. Although there will even be new infections after t = t₀ in such a case, the total number of active infections meanwhile can do nothing then but decrease with t and $s^{'}$ , and the epidemic is inevitably in state of decline and fading out.

To substantiate these viewpoints mathematically we define, by analogy with (5.9):

f^{'} (s^{'}) = (\frac{ν^{'} p_{r} / p_{i}^{'}}{a_{2}^{'} (s_{+}^{'} - s_{-}^{'})}) log (\frac{s_{+}^{'} (s^{'} - s_{-}^{'})}{s_{-}^{'} (s^{'} - s_{+}^{'})}) .

(6.16)

The condition $Δ s_{i} = 0$ can then be recast into the form [see (6.14)]:

s^{'} = f^{'} (s^{'})

(6.17a)

or, equivalently, into:

s^{'} = f^{' - 1} (s^{'}),

(6.17b)

where $f^{' - 1} (s^{'})$ represents the inverse of $f^{'} (s^{'})$ which, being the analogue of $f^{- 1} (s)$ in (5.10), is readily obtained, with $α^{'} = s_{+}^{'} / s_{-}^{'}$ taken as the analogue of α [see (5.11)], as:

f^{' - 1} (s^{'}) = s_{-}^{'} (1 + \frac{(s_{-}^{'} - s_{+}^{'})}{s_{+}^{'} exp (s^{'} / η^{'}) - s_{-}^{'}})

(6.18)

where

η^{'} = \frac{ν^{'} p_{r} / p_{i}^{'}}{a_{2}^{'} (s_{+}^{'} - s_{-}^{'})} .

(6.19)

Being identical in their mathematical form, the functions $f^{' - 1} (s^{'})$ and $f^{- 1} (s)$ behave in a qualitatively similar way in relation to their respective arguments $s^{'}$ and s. Hence, the line of thought followed in Section 5.3 in connection with $f^{- 1} (s)$ applies to $f^{' - 1} (s^{'})$ as well. As such, we find Equations (6.17a) and (6.17b) to have two solutions when $\partial f^{' - 1} (s^{'}) / \partial s^{'} |_{s^{'} = 0} > 1$ , one of them being $s^{'} = 0$ and the other one given by the intersection of the line $y = s^{'}$ . When $\partial f^{' - 1} (s^{'}) / \partial s^{'} |_{s^{'} = 0} \leq 1$ however, only the solution $s^{'} = 0$ remains and $Δ s_{i} < 0$ for $s^{'} > 0$ . When the latter is the case for any regime of social measures, the epidemic is in a stage of inevitable fade-out.

By analogy with (5.15a), the derivative $\partial f^{' - 1} (s^{'}) / \partial s^{'}$ is obtained as:

\frac{\partial f^{' - 1} (s^{'})}{\partial s^{'}} = \frac{s_{+}^{'} s_{-}^{'} (s_{+}^{'} - s_{-}^{'})}{η^{'}} (\frac{e^{s^{'} / η^{'}}}{{(s_{+}^{'} e^{s^{'} / η^{'}} - s_{-}^{'})}^{2}}) .

(6.20)

Substitution of (6.19) for $η^{'}$ here, while also using $s_{+}^{'} s_{-}^{'} = λ^{'} (1 - ξ_{0}) ν^{'} / a_{2}^{'}$ [to be obtained straightforwardly on the basis of (6.6)], then yields:

\frac{\partial f^{' - 1} (s^{'})}{\partial s^{'}} = \frac{λ^{'} (1 - ξ_{0}) {(s_{+}^{'} - s_{-}^{'})}^{2}}{p_{r} / p_{i}^{'}} (\frac{e^{s^{'} / η^{'}}}{{(s_{+}^{'} e^{s^{'} / η^{'}} - s_{-}^{'})}^{2}}) .

(6.21)

For $s^{'} = 0$ , i.e.

{\frac{\partial f^{' - 1} (s^{'})}{\partial s^{'}} |}_{s^{'} = 0} = \frac{λ^{'} (1 - ξ_{0}) p_{i}^{'}}{p_{r}} .

(6.22)

The condition for the epidemic to remain fading out after t = t₀ ( $s^{'} > 0$ ), also under the new regime, becomes therewith:

\frac{λ^{'} (1 - ξ_{0}) p_{i}^{'}}{p_{r}} \leq 1 .

(6.23)

That is,

λ^{'} p_{i}^{'} \leq \frac{p_{r}}{(1 - ξ_{0})} .

(6.24)

In every new regime for which the product $λ^{'} p_{i}^{'}$ meets this (in)equality, the number of active infections will be subject to a monotonous decrease after t = t₀. If the regime of social normality is among these regimes, then it is safe to lift any restrictive and protective measures, in the sense that this will not lead to a new wave of infections: the rate at which ${\dot{s}}_{i}$ decreases may be less than in a regime with measures in place, but the epidemic will continue to fade-out until the last active infections disappear and s_i becomes zero.

It is noteworthy that requiring ${\dot{s}}_{i} \leq 0$ for $s^{'} = 0$ on the basis of (6.3):

\dot{s_{i}} = s_{i} (λ^{'} (1 - ξ_{0}) p_{i}^{'} - p_{r}) \leq 0

(6.25)

directly leads us to (6.23) and (6.24) as well. However, this procedure leaves us with no clue as to whether $Δ s_{i}$ is negative definite or not for $s^{'} > 0$ , and therefore does not exclude the possibility, as observed in Fig. 13, that for some $s^{'}$ the number of active infections will start to rise again (even under the same regime of measures).

One could therefore say that as soon as (6.17a,b) apply, a form of herd-immunity has been achieved, despite the fact that new infections will continue to emerge until s_i = 0, albeit at an increasingly lower rate as $s^{'}$ increases. New infections will cease to emerge as soon as s_i = 0. That is, when [see (6.10)]:

s_{i} = s^{'} - (\frac{ν^{'} p_{r} / p_{i}^{'}}{a_{2}^{'} (s_{+}^{'} - s_{-}^{'})}) log (\frac{s_{-}^{'} (s^{'} - s_{+}^{'})}{s_{+}^{'} (s^{'} - s_{-}^{'})}) + s_{i 0} = s^{'} - f^{'} (s^{'}) + s_{i 0} = 0

and thus when:

s^{'} + s_{i 0} = f (s^{'})

(6.26a)

which is equivalent to:

s^{'} = f^{' - 1} (s^{'} + s_{i 0}) .

(6.26b)

The solution $s^{'} = s_{e}^{'}$ of (6.26a) and (6.26b) relates to s_e in this via $s_{e}^{'} = s_{e} - s_{0}$ , and has to be obtained numerically. The value of $s_{e} = s_{0} + s^{'}$ represents the final cumulative infection rate reached when the epidemic comes to an end. Since at the end of an epidemic all active infections have been removed, it also represents the final rate of removed infections to be reached, i.e. when $s = s_{e} = s_{0} + s^{'}$ then s_r = s_e.

Now that we have captured the criterion for the end of an epidemic in the mathematical form of (6.26a,b), the relevant question is how ‘robust’ the resulting epidemic state after reaching s = s_e is against an influx of new (active) infections from outside of the population, for instance via infected travellers (from outside) or infective population members (re)entering from abroad. In other words: will a new wave of infections start or not, once a (very) small but not insignificant number of new active infections from outside has been introduced into the population?

To answer this question, let $ν^{'} (1 - ξ_{e})$ be the average number of s-nodes linked to an r-node when the epidemic has come to a halt:

〈 s_{s r}^{'} 〉 = ν^{'} (1 - ξ_{e}) .

(6.27a)

Using the symmetry relation $s_{x} 〈 s_{y x}^{'} 〉 = s_{y} 〈 s_{x y}^{'} 〉$ , we then get for this case (for which $s^{'} = s_{e} - s_{0} = s_{e}^{'}$ ):

〈 s_{r s}^{'} 〉 = \frac{s_{e}}{1 - s_{e}} ν^{'} (1 - ξ_{e})

(6.27b)

and via (1.17) and (1.18), which reduce to:

〈 s_{s s}^{'} 〉 + 〈 s_{r s}^{'} 〉 = ν^{'}

(6.28a)

〈 s_{r r}^{'} 〉 + 〈 s_{s r}^{'} 〉 = ν^{'},

(6.28b)

we thus obtain

〈 s_{r r}^{'} 〉 = ν^{'} - 〈 s_{s r}^{'} 〉 = ν^{'} ξ_{e}

(6.29a)

〈 s_{s s}^{'} 〉 = ν^{'} - 〈 s_{r s}^{'} 〉 = ν^{'} (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}})

(6.29b)

for this case. To distinguish the average coordinations $〈 s_{x y} 〉$ and $〈 s_{x y}^{'} 〉$ in the earlier wave(s) from those in the new wave, we write the latter as $〈 σ_{x y} 〉$ and the coefficients of their corresponding series expansions as α_k (instead of a_k). Similarly, we also introduce σ_i and $σ = s - s_{e}$ as, respectively, the rate of active infections and the rate of cumulative infections in the new wave [where σ accounts for the infections transmitted during the new wave only, in contrast to the total cumulative infection rate s which includes the infections that emerged during the prior wave(s) as well]. We assume the initial number of new active infections that form the precursor to a possible new wave of infections to be very low ( $σ_{0} = σ_{i 0} ≪ 1$ or even $σ_{0} = σ_{i 0} \approx 0$ ). Since we also assume the removed infections to have full immunity, a possible new wave of infections will spread exclusively among those members of the population (nodes) that remained uninfected (i.e. susceptible) during the earlier wave(s). Hence, since each newly introduced active infection is considered to replace a susceptible node at random (so that $〈 σ_{s i} 〉 |_{σ = σ_{0}} = 〈 s_{s i}^{'} 〉 |_{s^{'} = s_{e} - s_{0}}$ ) and because σ₀ is considered small enough that to take $σ_{0} \approx 0$ :

〈 σ_{s i} 〉 = 〈 s_{s i}^{'} 〉 |_{s^{'} = s_{e} - s_{0}} + \sum_{k = 1}^{\infty} α_{k} σ^{k} = ν^{'} (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) + \sum_{k = 1}^{\infty} α_{k} σ^{k} .

(6.30)

We consider no changes in social measures taken after the first appearance of the new active infections, so that the typical rate of transmission remains $p_{i}^{'}$ , the rate of change of σ_i in the new wave therewith becomes:

{\dot{σ}}_{i} = σ_{i} (\frac{p_{i}^{'} 〈 σ_{s i} 〉}{ν^{'}} - p_{r}) = σ_{i} (p_{i}^{'} {(1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) + \sum_{k = 1}^{\infty} α_{k} σ^{k}} - p_{r}) .

(6.31)

However, a new wave of infections due to a small (almost negligible) number of initial infections $σ_{0} ≪ 1$ will start only if $\dot{σ_{i}} > 0$ for $s^{'} = s_{e}$ (i.e. for $σ = σ_{0} \approx 0$ ). From (6.31), it is easily inferred that a new wave will therefore not emerge when:

p_{i}^{'} (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) - p_{r} \leq 0 .

That is, when:

\frac{1 - s_{e}}{1 + s_{e} (ξ_{e} - 2)} \cdot \frac{p_{r}}{p_{i}^{'}} \geq 1

(6.32)

which thus provides us with a proper criterion for ‘true’ herd-immunity that is not only rooted in the basic mechanisms and mathematics of epidemic growth but also connects with our intuitive conception of the phenomenon.

An important insight that immediately follows from (6.32) is that apparently the structure and topology of the population network do have an influence in the process of achieving herd-immunity, in contrast to what we have seen earlier in connection with the criterion [expressed by (5.1), (5.23), and (5.24a,b)] for an epidemic to develop from a few initial infections in case of a fully susceptible population (s_s = 1). Key for this observation is the dependence of s_e and ξ_e on the structure of the social networks. Network correlations explicitly make their way into the process via ξ_e (whereas they affect the value of s_e in a more implicit manner). These correlations are a typical artefact of network structure of the population and also the percolative nature of the spread of an infection on a population network. Their influence can be understood as follows. We write, for every $s \leq s_{e}$ :

〈 s_{s r}^{'} (s^{'}) 〉 \equiv ν^{'} (1 - ξ (s^{'})) .

(6.33a)

In case of a fully random distribution of the susceptible individuals/nodes over the network $〈 s_{s r}^{'} (s^{'}) 〉 = ν^{'} (1 - s)$ . We may therefore write:

〈 s_{s r}^{'} (s^{'}) 〉 = ν^{'} (1 - s^{'}) + Δ 〈 s_{s r}^{'} (s^{'}) 〉,

(6.33b)

where $Δ 〈 s_{s r}^{'} (s^{'}) 〉$ accounts for the correlations (i.e. the deviations from the random distribution). Combining (6.33a) and (6.33b), we have:

ξ (s^{'}) = s^{'} - \frac{Δ 〈 s_{s r}^{'} (s^{'}) 〉}{ν^{'}} .

(6.34a)

With $Δ ξ (s^{'}) = - \frac{Δ 〈 s_{s r}^{'} (s^{'}) 〉}{ν^{'}}$ that is:

ξ (s^{'}) = s^{'} + Δ ξ (s^{'})

(6.34b)

so that $ξ_{e} = ξ (s_{e}) = s_{e} + Δ ξ_{e}$ .

We thus obtain:

\frac{1 - s_{e}}{1 + s_{e} (ξ_{e} - 2)} = \frac{1 - s_{e}}{1 + s_{e} ((s_{e} + Δ ξ_{e}) - 2)} = \frac{1}{1 - s_{e} + \frac{s_{e} Δ ξ_{e}}{1 - s_{e}}} .

Substitution of which into (6.32) yields:

\frac{1}{1 - s_{e} + s_{e} Δ ξ_{e} / (1 - s_{e})} \cdot \frac{p_{r}}{p_{i}^{'}} \geq 1 .

(6.35)

We see that the network correlations explicitly enter this (in)equality on the left-hand side via $Δ ξ_{e}$ . The sign of $Δ ξ_{e}$ is indicative of whether such correlations support ( $Δ ξ_{e} < 0$ ) or counteract ( $Δ ξ_{e} > 0$ ) the achievement of herd-immunity. A qualitative argument for the sign of $Δ ξ_{e}$ in general can be given by considering the influence of correlations on the number of s–r pairs and $〈 s_{s r}^{'} 〉$ .

In the absence of correlations (fully random distribution of r-nodes), the number of s–r pairs for $s^{'} = s_{e} - s_{0}$ is [see (1.14)]: $n_{s r}^{'} = n s_{e} 〈 s_{s r}^{'} 〉 = n ν^{'} s_{e} (1 - s_{e})$ , so that $〈 s_{s r}^{'} 〉 = ν^{'} (1 - s_{e})$ . However, due to the percolative nature of the spread of the infection, the r-nodes along its paths are not randomly distributed but form dendritic structures (‘trees’). With $s_{r}^{'} = s_{e}$ , the following inequality then applies to the number of r–r pairs for $s^{'} = s_{e} - s_{0}$ :

n_{r r}^{'} \leq \frac{n s_{e}^{2} ν^{'}}{2},

where the equal sign relates to a random distribution of removed infections (each removed infection having $s_{e} ν^{'}$ other removed infections in its social network) and the inequality applies in case of the (correlated) dendritic structures (the division by 2 corrects for double counting removed infections). With $n_{r r}^{'} = n s_{r}^{'} 〈 s_{r r}^{'} 〉 / 2$ [see (1.15)], we thus obtain (remember $s_{r}^{'} = s_{e}$ ):

〈 s_{r r}^{'} 〉 \leq ν^{'} s_{e}

and with $〈 s_{s r}^{'} 〉 = ν^{'} - 〈 s_{r r}^{'} 〉$ [see (6.28b)]:

〈 s_{s r}^{'} 〉 \geq ν^{'} (1 - s_{e}) .

Via $Δ ξ (s^{'}) = - \frac{Δ 〈 s_{s r}^{'} (s^{'}) 〉}{ν^{'}}$ and (6.33b), this (in)equality can be transformed into:

1 - (s_{e} + Δ ξ_{e}) > 1 - s_{e}

in the presence of correlations (i.e. in cases where $Δ ξ_{e} \neq 0$ ). That is,

Δ ξ_{e} < 0 .

(6.36)

We thus find out that network correlations by themselves always contribute to the achievement of herd-immunity in a positive way: a negative $Δ ξ_{e}$ increases the value of left-hand side of (6.35) with respect to its value for $Δ ξ_{e} = 0$ (i.e. for a random distribution without network correlations). It is emphasized however, that network correlations are a contributing factor, rather than a necessary requirement for herd-immunity. After all, is clear that even in the absence of correlations ( $Δ ξ_{e} = 0$ ), the sheer increase in the cumulative infection number s_e alone already leads to an increase in the left-hand side of (6.35). However, indicative of the role of correlations, (6.35) and (6.36) indirectly emphasize the role of percolation effects as well in the establishment of herd-immunity, given the close relation between network correlations and percolation.

6.2 Different types of herd-immunity: a classification

It is worth noticing that a distinction can be made on the basis of (6.24) and (6.32) between different types of herd-immunity, all of which having clear, but different, practical implications. First of all, it is important whether (6.24) and (6.32) only hold in regimes of strict social measures or in every conceivable regime of social measures (including the regime of social normality). In the first case, we can speak of ‘weak’ herd-immunity, as opposed to ‘strong’ herd-immunity in the second case. Weak herd-immunity is a contextual phenomenon and it occurs only by virtue of the restrictions imposed upon the population by a regime of sufficiently strict social measures. There is no guarantee that the population is safe from a restart of the epidemic as soon as social restrictions are (partially) lifted. Only strong herd-immunity can offer such a guarantee. In fact, weak herd-immunity is not what we intuitively associate with herd-immunity, only strong herd-immunity does. However, for the sake of clarity, making a distinction between the two forms has its benefits.

Another relevant distinction can be made on the basis of something that we might call the ‘degree’ of herd-immunity. As soon as (6.24) is met, the epidemic is in a phase of inevitable fade-out under the imposed regime of social measures. However, the generation of new infections has still not come to a halt (which may still put a burden on the health system for instance). Nevertheless, the risk of the epidemic growing out of control has disappeared. We might call this situation a state of ‘first degree herd-immunity’. Such a situation may precede a state of ‘second degree herd-immunity’, which is entered when the number of active infections actually becomes zero and the resulting state is one in accordance with (6.32), i.e. a state where, although (very) small pockets of new infections may (re)appear, the population at large is ‘immune’ against the build-up of a new wave of infections, at least under the imposed regime of restrictive social measures.

It should be clear that states of weak herd-immunity are deceptive, irrespective of their degree. Although the number of infections may be declining (first degree) or has come to a fade-out (second degree), the risk of a renewed increase in the active infection rate after lifting the social measures is real. A state of weak herd-immunity may therefore be the aim of temporary measures to lift the burden on the health system, but its achievement should by no means taken as a motivation to return to normality. Only when the number of active infections is in decline in a case of strong herd-immunity (of first degree) such a return is safe. By definition, the end of the battle against an epidemic outbreak is then marked by a state of strong herd-immunity of second degree: not only has the active infection rate faded-out in such a case, but as long as the acquired immunity of the previously infected individuals remains intact the population is immune, even under a regime of social normality, against new waves of infections arising out of small contingents of initial infections. Telling the difference between states of weak and strong herd-immunity may not be easy in practice however, especially when the pathogen is (relatively) new and its properties (p_i for instance) insufficiently known.

Is should be emphasized, however, that also a state of strong, second-degree, herd-immunity may not necessarily last indefinitely. The analysis in the previous section clearly shows that a sufficiently strong and sufficiently long-lasting (acquired) immunity of individual population members is in fact the necessary condition for both achieving and sustaining a state of herd-immunity in general. Therefore, in cases of waning immunity, where the immunity of the individual population members goes into decline after a while, the immunity of the population at large against epidemic outbreaks will also be reduced or even entirely disappear.

6.3 Herd-immunity and reproduction numbers

In the literature, conditions for herd-immunity and the herd-immunity threshold are often expressed in terms of reproduction numbers. Considering the critical remarks made in Section 2 and Section 5.4 with respect to the practical use of reproduction numbers, combined with the occasional confusion about the definition of herd-immunity in the literature, a short regression into this subject seems more than appropriate.

Very often, a state of herd-immunity is defined as a state where the number of active infections has reached its peak and is in a state of decline. It may be clear that this is actually what is called a state of first-degree herd-immunity according to the classification in the previous section, as opposed to a state of second-degree herd-immunity to be reached when the active infection rate has finally faded-out to zero. The condition for such a state of first-degree herd-immunity is (when $s_{i} \neq 0$ ):

{\dot{s}}_{i} = p_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r} \leq 0 .

That is,

\frac{〈 s_{s i} 〉}{ν} \cdot \frac{p_{i}}{p_{r}} \leq 1 .

(6.37)

Note that $〈 s_{s i} 〉 = 〈 s_{s i} (t) 〉$ . We define:

R' = R' (t) = \frac{〈 s_{s i} (t) 〉}{ν} \cdot \frac{p_{i}}{p_{r}}

(6.38)

which we reexpress as:

R' (t) = Q' (t) \cdot \frac{p_{i}}{p_{r}}

(6.39a)

where

Q' (t) = \frac{〈 s_{s i} (t) 〉}{ν}

(6.39b)

so that the requirement for first-degree herd-immunity (6.37) can be rewritten as:

R' = Q' \cdot \frac{p_{i}}{p_{r}} \leq 1 .

(6.40)

The similarity between (6.39a) and (2.4) is obvious. However, $Q' \neq Q$ in general [see (2.3)], so that $R' \neq R$ . Therefore, $R'$ is not a reproduction number.

Combining (2.3) and (6.39b), we get:

Q (t) = - \int_{t}^{\infty} \frac{〈 s_{s i} (t') 〉}{ν} d e^{- p_{r} (t' - t)} = p_{r} \int_{t}^{\infty} Q' (t') e^{- p_{r} (t' - t)} d t' .

(6.41)

We introduce $Q_{m} (t)$ :

\begin{matrix} Q_{m} (t) = p_{r} \int_{t}^{\infty} Q' (t) e^{- p_{r} (t' - t)} d t' \\ = p_{r} \cdot Q' (t) \int_{t}^{\infty} e^{- p_{r} (t' - t)} d t' = Q' (t) \end{matrix} .

(6.42)

Since $Q' (t) \geq Q' (t')$ for $t' \geq t$ (because $〈 s_{s i} (t) 〉 \geq 〈 s_{s i} (t') 〉$ for $t' \geq t$ ), it is easy to see that $Q_{m} (t) \geq Q (t)$ . By combining this result and (6.42), we thus obtain:

Q' (t) \geq Q (t) .

(6.43)

Hence [see (2.4) and (6.39a)], for all t > 0: $R' \geq R$ . So, when the criterion $R' \leq 1$ for first-degree herd-immunity is met, the criterion $R \leq 1$ is also met. Vice versa however, $R \leq 1$ is not a sufficient condition for $R' \leq 1$ . Therefore, $R \leq 1$ is not a valid criterion for first-degree herd-immunity.

7. Vaccination

A key lesson from the previous sections is that the pursuit of herd-immunity by allowing the infection to spread through the population may possibly run into serious difficulties. When the infection is allowed to spread under a (strict) regime of social measures there is a risk that the population ends up in a state of apparent herd-immunity that turns out to be false as soon as social restrictions are lifted and the infection rate starts to rise steeply again. On the other hand, letting the infection spread under a normal social regime does lead to herd-immunity, but only at the cost of a very large number of infections, which is unacceptable especially in those cases where the infection is of a kind that causes serious health issues. Therefore, the only way to achieve herd-immunity in a manner that is safe under all circumstances is vaccination.

To describe the effects of large-scale vaccination on the susceptibility of a population to epidemic spreading of an infection, we introduce the effectiveness ϵ of a vaccine, being the relative decrease of the transmission probability w_i or, equivalently, the relative reduction of the transmission constant $p_{i} = 2 w_{i}$ . The constant of transmission $p_{i}^{'}$ from an active infection to a vaccinated individual is thereby related to the transmission constant p_i from an active infection to an unvaccinated (fully susceptible) individual via:

p_{i}^{'} = (1 - ϵ) p_{i} .

(7.1)

The lesser the protection offered by a vaccine, the lower the value of ϵ for that particular vaccine: by definition $0 \leq ϵ \leq 1$ , where ϵ = 1 corresponds to a 100% effective vaccine that gives full protection (immunity), whereas ϵ = 0 effectively relates to a case without any vaccination or to a totally inactive vaccine.

Now, let $〈 σ_{s i} 〉$ and $〈 σ_{s i}^{'} 〉$ , respectively, be the average number of unvaccinated nodes and the number of vaccinated nodes linked to an active infection. The rate of change s_i of the active infections in case of a (partially) vaccinated population can then be expressed as:

\begin{matrix} {\dot{s}}_{i} = s_{i} (\frac{p_{i} 〈 σ_{s i} 〉 + p_{i}^{'} 〈 σ_{s i}^{'} 〉}{ν} - p_{r}) \\ = s_{i} (\frac{p_{i} (〈 σ_{s i} 〉 + (1 - ϵ) 〈 σ_{s i}^{'} 〉)}{ν} - p_{r}) . \end{matrix}

(7.2)

The vaccinated population is assumed to be free from active and removed infections ( $s_{i} = s = 0$ ) prior to t = 0, when a very small number ( $s_{i} ≪ 1$ ) of new active infections, randomly distributed over the population, appears.

We write:

〈 σ_{s i} 〉 = ν (1 - ξ_{v}) + \sum_{k = 1}^{\infty} α_{k} s^{k} 〈 σ_{s i}^{'} 〉 = ν ξ_{v} + \sum_{k = 1}^{\infty} α_{k}^{'} s^{k} .

(7.3)

Here, ξ_v accounts for the reduction of $〈 σ_{s i} 〉$ and $〈 σ_{s i}^{'} 〉$ at t = 0 (s = 0) due to vaccination (compared with ξ₀, ξ_e and $ξ (s)$ in Section 6). Mathematically, nodes representing a vaccinated individual are equivalent to those representing a removed infection. Substitution of (7.3) into (7.2) yields

{\dot{s}}_{i} = s_{i} (p_{i} {(1 - ϵ ξ_{v}) + \frac{1}{ν} \sum_{k = 1}^{\infty} β_{k} (ϵ) s^{k}} - p_{r})

(7.4)

where

β_{k} (ϵ) = α_{k} + (1 - ϵ) α_{k}^{'} .

(7.5)

The condition ${\dot{s}}_{i} (t) |_{t = 0} > 0$ (or equivalently ${\dot{s}}_{i} (s) |_{s = 0} > 0$ ) for an epidemic to develop from a few initial infections then implicitly leads to (compare with Sections 5.1 and 5.2):

\frac{1}{(1 - ϵ ξ_{v})} \cdot \frac{p_{r}}{p_{i}} \geq 1

(7.6)

as a criterion for vaccine-acquired herd-immunity. Possible (network) correlations between vaccinated individuals enter the criterion via ξ_v. Such correlations may arise (in theory) when the vaccination is carried out according to a non-random scheme. Such a situation seems quite unusual however. We therefore assume that the members of the population are vaccinated at random so that $ξ_{v} = s_{v}$ , where s_v is the vaccination rate (fraction of the population vaccinated). In that case, (7.6) becomes

\frac{1}{(1 - ϵ s_{v})} \cdot \frac{p_{r}}{p_{i}} > 1 .

(7.7)

The product $ϵ s_{v} \equiv s_{v}$ can be considered as an effective vaccination rate: vaccinating a fraction s_v of the population with a vaccine having an effectiveness of $ϵ < 1$ is equivalent to vaccinating a population fraction $s_{v} = ϵ s_{v}$ with a vaccine having an effectiveness ϵ = 1. The critical vaccination rate marking the herd-immunity threshold is now straightforwardly obtained from (7.7) as:

s_{v c} = \frac{p_{i} - p_{r}}{ϵ p_{i}} = \frac{1}{ϵ} (1 - \frac{Q_{0}}{R_{0}}),

(7.8)

where $R_{0} = Q_{0} \cdot p_{i} / p_{r}$ is the basic reproduction number, with Q₀ accounting for the s-dependence of $〈 s_{s i} 〉$ (see Section 2). In the literature, the herd-immunity threshold is often given as $s_{v c} = 1 - 1 / R_{0}$ (see Ref. [13] for instance). This is, strictly speaking, incorrect. Expressions of that form relate to an incorrect/incomplete expression for R₀ or, at best, to an approximation for R₀ (see Section 5.4), where the s-dependence of $〈 s_{s i} 〉$ is ignored or neglected (leading to $R_{0} = p_{i} / p_{r}$ ).

The result (7.8) can be reexpressed in terms of the critical effective vaccination rate $s_{v c}$ as:

s_{v c} = \frac{p_{i} - p_{r}}{p_{i}} = 1 - \frac{Q_{0}}{R_{0}} .

(7.9)

Vaccination-acquired herd-immunity is obtained when the vaccination rate is equal to or larger than the critical vaccination rate, which is equivalent to the effective vaccination rate $s_{v} = ϵ s_{v}$ being equal to or larger than the critical effective vaccination rate $s_{v c}$ :

s_{v} \geq \frac{1}{ϵ} (1 - \frac{Q_{0}}{R_{0}})

(7.10a)

s_{v} \geq 1 - \frac{Q_{0}}{R_{0}} .

(7.10b)

Since $0 \leq ϵ \leq 1$ and $0 \leq s_{v} \leq 1$ , it is easy to see that $0 \leq s_{v} \leq 1$ . The critical effective vaccination rate $s_{v c} = 1 - Q_{0} / R_{0}$ is the lowest value of $s_{v}$ for which herd-immunity is obtained for given $p_{i} / p_{r} = R_{0} / Q_{0}$ . However, from (7.10a), it is evident that its value also equals the lowest value of ϵ for which (by vaccinating the entire population so that s_v = 1) herd-immunity can be obtained at given $p_{i} / p_{r}$ (lower values of ϵ do not allow for herd-immunity to be obtained for the value of $p_{i} / p_{r}$ involved, since they would require $s_{v} > 1$ ). We can therefore combine two ‘phase diagrams’ into a single figure. Figure 14a shows the relevant combinations of $p_{i} / p_{r}$ and $s_{v}$ , as well as the combinations of $p_{i} / p_{r}$ and ϵ, for which vaccine-acquired herd-immunity is or (respectively) can be obtained (and for which not). The combinations of $p_{i} / p_{r}$ and $s_{v}$ are represented as points $(p_{i} / p_{r}, s_{v})$ in the $p_{i} / p_{r} - s_{v}$ plane (horizontal axis and right vertical axis), whereas the combinations of $p_{i} / p_{r}$ and ϵ are represented as points $(p_{i} / p_{r}, ϵ)$ in the $p_{i} / p_{r} - ϵ$ plane (horizontal axis and left vertical axis). Vaccine-acquired herd-immunity is possible only for points $(p_{i} / p_{r}, s_{v})$ and $(p_{i} / p_{r}, ϵ)$ in the grey-shaded area of the combined $p_{i} / p_{r} - s_{v}$ and $p_{i} / p_{r} - ϵ$ plane. This area is enclosed by the curves $ϵ, s_{v} = 1, p_{i} / p_{r} = 1$ and the graph of the function $f : p_{i} / p_{r} \to 1 - p_{r} / p_{i}$ . Points $(p_{i} / p_{r}, s_{v})$ in this grey-shaded area correspond to a state of (vaccine-induced) herd-immunity by definition. In contrast, for points $(p_{i} / p_{r}, ϵ)$ in the grey-shaded area, herd-immunity is obtained only when an appropriate vaccination rate $0 \leq s_{v} \leq 1$ consistent with (7.10a) is chosen. It should be noted that the region for $p_{i} / p_{r} < 1$ is in fact irrelevant in the context of vaccination, since herd-immunity is, so to speak, trivial and inherent to the situation here (an epidemic cannot develop at all when p_i < p_r, as outlined previously).

Figure 14: — (a) Combined $p_{i} / p_{r} - ϵ$ and $p_{i} / p_{r} - s$ ‘phase-diagrams’. Points $(p_{i} / p_{r}, s)$ (right vertical axis) in the grey-shaded area correspond to vaccine-acquired herd-immunity, points $(p_{i} / p_{r}, ϵ)$ (left vertical axis) to the *possibility* of vaccine-acquired herd-immunity (via a sufficiently high vaccination rate, the minimum value of which can be read from the diagram in Fig 14b). (b) Contour lines of the vaccination rates necessary for vaccine-acquired herd-immunity as a function of $p_{i} / p_{r}$ and ϵ.

In addition, Fig. 14b shows a contour map of the minimum vaccination rates necessary to obtain herd-immunity [calculated on the basis of (7.10a)] as a function of $p_{i} / p_{r}$ and ϵ. Adjacent contours correspond to a difference $Δ s_{v} = 0.0625$ . The contours for $s_{v} = 0, 0.25, 0.5, 0.75, 1$ have been specially highlighted in black to serve as visual anchors. The progress of the contour lines clearly illustrates how ever higher vaccination rates become necessary to obtain herd-immunity when $p_{i} / p_{r}$ is increased while ϵ remains constant. Where the line $y = ϵ = constant$ intersects the curve $y = 1 - p_{i} / p_{r}$ (which relates to s_v = 1), a critical value $r_{c} = r_{c} (ϵ)$ of $p_{i} / p_{r}$ is reached: for values of $p_{i} / p_{r} > r_{c} (ϵ)$ a vaccine with efficiency ϵ is unable to provide herd-immunity. Conversely, Fig. 14b also shows however that lower values of ϵ necessitate ever higher values of s_v to obtain herd-immunity when R₀ is kept fixed, until a critical value $ϵ_{c} = ϵ_{c} (p_{i} / p_{r}) = 1 - p_{r} / p_{i}$ is reached below which no herd-immunity is possible even for s_v = 1. Points $(p_{i} / p_{r}, ϵ)$ in the segment of the $p_{i} / p_{r} - ϵ$ plane enclosed by the curve $ϵ = 1 - p_{r} / p_{i}$ and the horizontal axis (ϵ = 0) therefore relate to a situation where the pathogen involved becomes ‘endemic’. What is meant by this is that for such combinations of $p_{i} / p_{r}$ and ϵ the spread of the infection cannot be stopped, despite vaccination. When additionally the individuals that have recovered from an infection only obtain a low (partial) immunity and/or loose most of their immunity after longer periods of time, the pathogen will remain circulating among the members of the population. The only way out of this situation is to develop a vaccine with an effectiveness high enough to ensure herd-immunity for a vaccination rate $s_{v} \leq 1$ . As long as such a vaccine is not available, the pathogen has to be considered as ‘endemic’.

It may also happen that a vaccination campaign is undertaken using different vaccines of different effectiveness (as in the case of, for instance, many 2021 vaccination campaigns against Covid-19). In such a situation, (7.2) should be replaced by the more general expression:

{\dot{s}}_{i} = s_{i} (\frac{p_{i}}{ν} {〈 σ_{s i} 〉 + \sum_{m} (1 - ϵ_{m}) 〈 σ_{s i, m}^{'} 〉} - p_{r}),

(7.11)

where the summation runs over the different vaccines, which are labelled by the integer m. We replace the equation on the right in (7.3) by:

〈 σ_{s i, m}^{'} 〉 = ν ξ_{m} + \sum_{k = 1}^{\infty} α_{k, m}^{'} s^{k} .

Correspondingly, the equation on the left in (7.3) is replaced by

〈 σ_{s i, m} 〉 = ν (1 - \sum_{m} ξ_{m}) + \sum_{k = 1}^{\infty} α_{k, m} s^{k} .

We can now write the generalization of (7.4) as:

{\dot{s}}_{i} = s_{i} (p_{i} {(1 - \sum_{m} ϵ_{m} ξ_{m}) + \frac{1}{ν} \sum_{k = 1}^{\infty} β_{k} (ϵ_{1} .. ϵ_{m}) s^{k}} - p_{r}) .

(7.12)

In case of at-random vaccinations that is:

{\dot{s}}_{i} = s_{i} (p_{i} {(1 - \sum_{m} ϵ_{m} s_{m}) + \frac{1}{ν} \sum_{k = 1}^{\infty} β_{k} (ϵ_{1} .. ϵ_{m}) s^{k}} - p_{r}),

(7.13)

where s_m is the (partial) rate of vaccination with vaccine m (i.e. the fraction of the population vaccinated with vaccine m). In analogy with (7.7), the criterion for herd-immunity now becomes:

\frac{1}{(1 - \sum_{m} ϵ_{m} s_{m})} \cdot \frac{p_{r}}{p_{i}} \geq 1 .

(7.14)

Let the vaccine-averaged efficiency be defined as:

\bar{ϵ} = \frac{\sum_{m} ϵ_{m} s_{m}}{\sum_{m} s_{m}},

(7.15)

where $\sum_{m} s_{m} = s_{v}$ can be identified as the total (cumulative) vaccination rate (i.e. the sum of all the partial vaccination rates).

The criterion (7.14) can then be reexpressed as:

\frac{1}{(1 - {\bar{ϵ}}_{m} s_{v})} \cdot \frac{p_{r}}{p_{i}} \geq 1

(7.16)

from which the critical vaccination rate follows as:

s_{v c} = \frac{p_{i} - p_{r}}{\bar{ϵ} p_{i}} .

(7.17)

This result is of the same form as (7.8), except that the (single-vaccine) effectiveness ϵ has been replaced by the vaccine-averaged effectiveness $\bar{ϵ}$ . An effective vaccination rate $s_{v}$ can be defined in the same way as previously, giving $s_{v} = \bar{ϵ} s_{v}$ , the critical effective vaccination rate $s_{v c}$ being given by (7.9).

It should be noted that the obtained critical vaccination rates do not depend on the structure of the social network: as long as the vaccinations are carried out at random, critical vaccination rates are the same for all populations irrespective of their social (network) structure.

However, when a vaccination campaign is undertaken during an ongoing epidemic the situation is different, and the structure of the population network does have an influence on the threshold for vaccine-acquired herd-immunity, even in case of at-random vaccination.

Suppose that the infection has been spreading from t = 0 onwards until at t = t₀ a vaccination campaign is started, and that only one type of vaccine is used in this campaign. As a result of the vaccinations, the susceptible part of the population is divided into a vaccinated and an unvaccinated part from t = t₀ onwards, where the ‘bare’ transmission constant p_i applies to the unvaccinated part and the reduced transmission constant $p_{i}^{'} = (1 - ϵ) p_{i}$ to the vaccinated part. With time, the epidemic comes to a halt under the combined influence of infection removal and the vaccine-related reduction of p_i. This situation differs from the case without a vaccination campaign during the epidemic, since in that case the epidemic comes to a halt due to infection removal only (and its effects on infection percolation).

In general, especially when vaccinations are randomly distributed across the population, the vaccination rates among the susceptibles and the removed infections differ after the end of the epidemic. These rates are represented, respectively, by $s_{v}^{s}$ and $s_{v}^{r}$ . Due to the lower transmission probability that vaccinated susceptibles are subject to compared with non-vaccinated ones ( $p_{i}^{'} < p_{i}$ ), eventually (with time) the inequality $s_{v}^{s} \geq s_{v}^{r}$ will apply. Vaccinated susceptibles will get infected (if at all) in lower numbers than the non-vaccinated ones so that they will become overrepresented among the non-infected individuals and underrepresented among the infected (the opposite being the case for the non-vaccinated susceptibles). The partial vaccination rates $s_{v}^{r}$ and $s_{v}^{s}$ are related to the total vaccination rate s_v for the entire population via:

s_{s} s_{v}^{s} + s_{r} s_{v}^{r} = s_{v} .

(7.18)

Hence, the inequality $s_{v}^{s} \geq s_{v}^{r}$ implies that $s_{v}^{s} \geq s_{v}$ : the total vaccination rate s_v is in fact a lower bound for the partial vaccination rate $s_{v}^{s}$ of the non-infected (s) part of the population after a (first) wave of infections has passed. Since infection removal is considered to leave an individual with full immunity, a potential next wave (due to new pockets of active infections after the fade-out of the preceding wave) will spread exclusively among the members of the non-infected (and therefore still susceptible) part of the population. To describe the dynamics of this next (second) wave we introduce, in addition to $s_{v}^{s}$ , the fraction $s_{s}^{n}$ of non-vaccinated (n) individuals among the susceptible part of the population after the first wave (nb: $s_{v}^{s} + s_{n}^{s} = 1$ ). We also introduce $〈 s_{x y}^{s} 〉$ , representing the average number of nodes of type x linked to a node of type y on the network formed by the nodes still uninfected (susceptible) after the first wave (i.e. before the second wave starts due to newly introduced active infections). After the emergence of new active infections, the node types can be: vaccinated (v), non-vaccinated (n), active infection (i), susceptible (s), and removed infection (r).

Now, let s_e be the final value of s and $〈 s_{s r} 〉 = ν (1 - ξ_{e})$ when the first wave has come to a halt and just before the start of the second wave [compare with (6.27a)]. As shown in the previous section, the value of $〈 s_{s s} 〉$ is then given by [compare with (6.29b)]:

〈 s_{s s} 〉 = ν - 〈 s_{r s} 〉 = ν (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) .

(7.19)

We also have (see Appendix 2):

s_{v}^{s} 〈 s_{v v}^{s} 〉 + s_{n}^{s} 〈 s_{v n}^{s} 〉 = ν^{'} s_{v}^{s},

(7.20a)

s_{v}^{s} 〈 s_{n v}^{s} 〉 + s_{n}^{s} 〈 s_{n n}^{s} 〉 = ν^{'} s_{n}^{s},

(7.20b)

where $ν^{'} = 〈 s_{s s} 〉$ , and as such directly follows from (7.19) as:

ν^{'} = ν (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) .

(7.21)

The left-hand parts of (7.20a,b) can be interpreted, respectively, as the average number of v-nodes linked to an s-node (7.20a) and the average number of n-nodes linked to an s-node (7.20b). Since we assume that the small number of new active infections is distributed randomly among the members of the susceptible part of the population, these new active infections will therefore be linked, on average, to $ν^{'} s_{v}^{s}$ vaccinated susceptibles and to $ν^{'} s_{n}^{s}$ non-vaccinated susceptibles. Hence,

〈 s_{v i}^{s} 〉 = ν s_{v}^{s} (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) + \sum_{k = 1}^{\infty} α_{k}^{v} {(s - s_{e})}^{k}

(7.22a)

〈 s_{n i}^{s} 〉 = ν s_{n}^{s} (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) + \sum_{k = 1}^{\infty} α_{k}^{n} {(s - s_{e})}^{k}

(7.22b)

with s representing the cumulative infection rate over both the first and second wave combined. The rate of change of the active infections is given by

\dot{s_{i}} = s_{i} (\frac{p_{i}}{ν} {〈 s_{n i}^{s} 〉 + (1 - ϵ) 〈 s_{v i}^{s} 〉} - p_{r}) .

(7.23)

Substitution of (7.22a), (7.22b) and $s_{n}^{s} = 1 - s_{v}^{s}$ into (7.23) yields, after some rearrangements,

\dot{s_{i}} = s_{i} (p_{i} {(1 - ϵ s_{v}^{s}) (1 + \frac{s_{e} (ξ_{e} - 1)}{1 - s_{e}}) + \frac{1}{ν} \sum_{k = 1}^{\infty} β_{k}^{s} (ϵ) {(s - s_{e})}^{k}} - p_{r}),

(7.24)

where $β_{k}^{s} (ϵ) = α_{k}^{n} + (1 - ϵ) α_{k}^{v}$ .

Demanding $\dot{s_{i}} < 0$ for $s = s_{e}$ , the criterion for herd-immunity is straightforwardly obtained from (7.24) as:

{\frac{(1 - ϵ s_{v}^{s}) (1 + s_{e} (ξ_{e} - 2))}{1 - s_{e}}} \cdot \frac{p_{i}}{p_{r}} < 1 .

(7.25)

That is, we will have herd-immunity when:

s_{v}^{s} > \frac{1}{ϵ} ({\frac{1 - s_{e}}{1 + s_{e} (ξ_{e} - 2)}} \cdot \frac{p_{r}}{p_{i}} - 1) .

(7.26)

A generalization of this result to the case of multiple vaccines with different ϵ is rather straightforward. Since $s_{v}^{s} > s_{v}$ , the right-hand part of this inequality can be considered as a critical value of the global vaccination rate s_v beyond which herd-immunity is assured.

The network structure manifests itself in this case through the values of s_e and $ξ_{e}$ , and is therewith a decisive factor in the achievement of herd-immunity, with a direct influence on the herd-immunity threshold. As a consequence, problems may thus arise similar to those outlined in the previous section in connection with the achievement of spontaneous herd-immunity. A fade-out of the number of active infections under a regime of restrictive measures, even when combined with a vaccination campaign, is not a guarantee that herd-immunity is being achieved. Lifting the restrictive measures to regain a regime of social normality may be accompanied by a new rise in the active-infection numbers to such an extent that even a new wave of infections cannot be ruled out in advance. Everything will depend on the values of $ξ_{e}^{'}$ and $p_{i}^{'}$ that replace ξ_e and p_i in the new regime entered after lifting the restrictions. If (7.26) is not met for $ξ_{e}^{'}$ and $p_{i}^{'}$ then a new wave of infections is inevitable upon lifting restrictions, despite the vaccinations administered so far (which will simply be too low in number for the establishment of herd-immunity in such a case). This has important consequences for efforts to prevent epidemic outbreaks by means of vaccination. A prophylactic vaccination campaign will provide the herd-immunity it is aiming at only when the resulting vaccination rate exceeds the herd-immunity threshold for a situation of social normality. If the latter is not the case, a transition from a regime of social restrictions to social normality (or a milder regime) may be followed by (significant) increases in the infection numbers, despite the vaccination campaign and an apparent fade-out of the infection rates prior to the moment of rolling back the restrictions.

It may also be clear that waning immunity poses a potential risk also in connection with vaccine-acquired (herd-)immunity, in the same way as it does in cases where immunity is a result of prior infection. As in the latter cases, vaccine-acquired immunity may also become less with time, so that vaccinated individuals become more vulnerable to infection (i.e. less immune). Such a situation is mathematically equivalent to a reduction of the vaccine-efficiency ϵ. The implications of such a scenario for the immunity of the population at large against epidemic outbreaks are evident, and can be illustrated graphically, and in quite an instructive way as such, via Fig. 14a and b. For that purpose we consider a population for which herd-immunity has been established through vaccination beyond the critical vaccination rate for the specific combination of infection characteristics (i.e. $p_{i} / p_{r}$ ) and (initial) vaccine efficiency (ϵ) involved. The point $(p_{i} / p_{r}, ϵ)$ representing this combination must therefore be in the grey-shaded area in Fig. 14a (which relates to the combinations of $p_{i} / p_{r}$ and ϵ for which herd-immunity can be obtained, provided the vaccination rate exceeds its critical threshold for the combination involved). The critical threshold is given by the particular value of s_v for which the corresponding contour line in Fig. 14b goes through the point $(p_{i} / p_{r}, ϵ)$ . It can be seen directly from Fig. 14b now that upon decreasing ϵ at a given value of $p_{i} / p_{r}$ , the value of the critical threshold necessary to maintain a state of herd-immunity increases. That means that upon a continuing decrease of ϵ (starting in a state of herd-immunity), a situation will finally be reached where the critical vaccination threshold exceeds the actual vaccination rate and herd-immunity will be lost.

8. Percolation

8.1 The percolation transition and its relevance in the context of vaccination

So far, herd-immunity has been presented purely as a consequence (or, merely, a side-effect) of infection removal, even when network effects are involved. However, an additional independent mechanism for herd-immunity is brought about in the form of percolation by the network structure and topology typical of populations. Percolation on a lattice or network can be understood as the formation of paths along nodes of a particular type, or as the formation of (isolated) clusters of nodes of a certain type (either enclosed by nodes of a different type or cut-off from the rest of the network). The formation of paths or clusters can be the result either of (random) removal/replacement of nodes or the (random) removal of links/bonds. The first case is referred to as site-percolation whereas the second case is called bond-percolation (see Ref. [5] for a basic but detailed outline of concepts and theory). Percolation phenomena play a role in many branches of the natural sciences and technology, ranging from solid-state physics (magnetic dilution) and chemistry (polymerization) to electrical engineering (random electrical networks). It is by the very nature of the problem that the relevance of percolation appears almost self-evident in the context of epidemic infection growth as well. Surprising it is therefore that the subject has been given fairly little attention in the epidemiological literature, despite the fact that it has been demonstrated that the percolation paradigm has its (potential) merit for the field [e.g. through the analysis by Davis et al. [9] of the spread of yersinia pestis (plague) among populations of great gerbils]. However, a conceptually simple phenomenon at first glance, percolation is a notoriously difficult subject for mathematical analysis. Despite the fact that seminal results have been achieved during the 1950s and onwards (see Ref. [42] for an in-depth review), specific problems often defy solution by analytical means and can only be dealt with through the use of computational methods [in particular (statistical) simulations]. Therefore, the emphasis of this section will, out of necessity, for a significant part be on computational results.

Percolation seems a particularly relevant concept in relation to (random) vaccination, especially when, as from now on, an ‘ideal’ vaccine is considered with 100% efficiency [i.e. ϵ = 1 (see previous section)]. Nodes in the population network are randomly immunized and are no longer susceptible to infection. They can no longer become infected and, equally important, they can no longer pass on the infection to other (susceptible) nodes. They are, so to say, ‘inert’ nodes in the network, in contrast to the ‘active’ nodes which are either already infected or still susceptible to infection. It is easy to see that this situation corresponds, in fact, to nothing less than a genuine case of site-percolation.

An essential phenomenon to be considered now is the so-called percolation transition. A general feature of both bond- and site-percolation on a lattice or network, the percolation transition marks a sharp change, upon increasing the number of inert nodes, between a regime where a majority of the active nodes forms a ‘macroscopic’ cluster of proportions comparable to those of the entire network, to a regime where the active nodes are split up in clusters of much smaller dimensions (of, for instance, no more than a few nodes). The critical value x_c of the fraction of inert nodes in the network at which the transition takes place is commonly referred to as the percolation threshold. A phase transition in the true physical and thermodynamic sense, the percolation transition comes with all the typical characteristics of a thermodynamic phase transition, such as universality and scaling invariance (see Ref. [5], Section 7). Its relevance to the problem of epidemic infection growth and herd-immunity is evident. Below the percolation threshold, the infection is easily passed on throughout the entire population network. Even when the initial number of active infections is low, a vast number of nodes may eventually be reached by the infection. However, for values of x_v above the percolation threshold, the infection chains sooner or later run into an inert node that blocks any further propagation of the infection along that particular chain. Especially when the number of initial infections is very low, only a (very) minor fraction of the nodes will be reached by the infection and the number of accumulative infections will remain low.

The role of percolation in an epidemiological setting and its relation to herd-immunity can be demonstrated quite well via carefully thought-out simulations of the evolution of an epidemic on a simplified network. A convenient choice for such a network is the 2D square lattice already introduced in Section 4, with the nodes representing the individual members of the network. Links between nodes, representing the possibility of contact and, inherently, a route of infection transmission, can be chosen in any arbitrary way in order to simulate the effects of differences in the size and structure of the social networks of the individual members of the population. A particular benefit of such simulations is that different mechanisms can be ‘turned on and of’ at will by an appropriate choice of their corresponding parameters, thus enabling a targeted investigation of their particular role and influence (or those of other mechanisms).

To separate the influence of infection removal from that of percolation phenomena, the evolution of four epidemics was simulated for different values of the rate of (random) vaccination x_v on a 2D square lattice of 2001 × 2001 nodes while putting p_r = 0 (i.e. no infection removal). Periodic boundary conditions were again applied. The initial states of the population t = 0 at the start of each epidemic were constructed by randomly labelling nodes as vaccinated until the desired vaccination rate was reached, followed by a random selection of non-vaccinated nodes to be labelled as active infections, thus providing the ‘seeds’ for the epidemic.

Figures 15a–d shows the end-status of the nodes in the population after each epidemic has come to a halt. In these simulations, nodes were considered to be linked only to their direct nearest neighbours, a situation resembling the conditions under a (very) strict lock-down. Nodes are labelled red when infected, black when vaccinated and white when still uninfected. In cases like these, where there is no infection removal, the infection spreads through the entire cluster of susceptible nodes surrounding each initial (seed) infection until the boundary of the cluster (formed by vaccinated nodes) is reached (i.e. when there actually is such a boundary instead of a continuous cluster). The fragmentation of the bulk cluster of infected (red) nodes into clusters of ever smaller size upon increasing x_v is clearly visible. The results shown are consistent with the value of the site-percolation threshold of a 2D square lattice of $x_{c} \approx 0.41$ as reported in the literature for the case of random blocking of sites [43].

Figure 15: — End-status (after fade-out of the epidemic) of the nodes in a model-population consisting of a 2D square lattice with nearest-neighbour contacts for different rates (*x_v*) of random vaccination. The different nodes types are distinguished by the colour of the square unit cell that surrounds them (red: infected nodes, black: vaccinated nodes and white: susceptible nodes). (a) $x_{v} = 0.35$ . (b) $x_{v} = 0.40$ . (c) $x_{v} = 0.425$ . (d) $x_{v} = 0.45$ .

To allow for a more detailed impression of the effect of vaccination at the level of the individual nodes, enlarged smaller sections of the respective network end-states shown in Figs. 15a–d are represented in Figs. 16a–d. It is clearly recognizable how, with increasing x_v, more and more paths along susceptible nodes become interrupted by vaccinated nodes, even to the level that susceptible nodes and actually entire clusters of susceptible nodes become fenced-in by a closed ‘ring’ or even a cluster of vaccinated nodes, thus shielding the susceptible nodes involved from active infections outside the cluster. Only infections from inside such enclosed clusters of susceptibles may lead to a spread of the infection to other members of the cluster. When p_r = 0, eventually all the nodes in a susceptible cluster will become infected in the end. But, whereas below the percolation threshold x_c this implies that a majority if the non-vaccinated part of the population (if not the entire part) will become infected, only a small (possibly negligible) minority of the non-vaccinated individuals will become infected when x_v > x_c, provided that the number of initial infections is (sufficiently) low.

Figure 16: — Close-up of the model populations shown in Fig. 15a and d (red: infected nodes, black: vaccinated nodes and white: susceptible nodes). (a) $x_{v} = 0.35$ . (b) $x_{v} = 0.40$ . (c) $x_{v} = 0.425$ . (d) $x_{v} = 0.45$ .

In a partially vaccinated population, each initial infection will ‘land’ in one of the remaining clusters of (non-vaccinated) susceptibles. There it will start transmitting the infection via its social contacts, thus initiating the spread of infection through the cluster. However, once an entire cluster of susceptibles has become infected, further propagation of the infection will stop at the cluster boundaries formed by the closed ring of vaccinated nodes. Below the percolation threshold (x_v < x_c), cluster sizes (from here onwards defined as the number of nodes in each cluster) are quite large (of the same order of magnitude as the size of the population) so that the final (cumulative) number of infections will be large too. Beyond the percolation threshold however (x_v > x_c), cluster sizes are modest or even very small. Since the infection will be limited only to those clusters embracing one or more initial infections, the final number of infections will remain low when the number of initial infections is low.

In view of these considerations, we expect the cumulative number of infections at the end of an epidemic to scale with a properly weighted average $〈 S_{c} 〉$ of the size S_c of the clusters of susceptibles directly after vaccination (before the start of the epidemic). An appropriate and meaningful choice for such an average is obtained by taking the average over the fractions of the susceptible part of the population accounted for by the individual clusters:

〈 S_{c} 〉 = \sum_{c} \frac{S_{c}}{n_{s}} \cdot S_{c} = \frac{1}{n_{s}} \sum_{S_{c} = 1}^{n_{s}} m (S_{c}) \cdot S_{c}^{2},

(8.1a)

where the first summation runs over all susceptible clusters (c), with n_s representing the total number of susceptible nodes in the population, whereas the second summation runs over all cluster sizes, with $m (S_{c}) \in N$ representing the actual number of clusters of size S_c present in the population network (so that $m (S_{c}) \cdot S_{c}$ is the total number of nodes belonging to clusters of size S_c). A quantity of even more significance is obtained when the cluster sizes S_c and their average $〈 S_{c} 〉$ are themselves considered relative to the total number $n_{s} = n (1 - x_{v})$ of susceptible nodes at a given x_v. That is, when we introduce the relative cluster size $S_{c} / n_{s}$ and its average $〈 S_{c} 〉 / n_{s}$ . Being equal to 1 in case of only one single macroscopic cluster of a size comparable to the size of the population, but approaching zero in cases where there are only very small clusters of a few nodes embedded in a very large population, the relative clusters size $S_{c} / n_{s}$ can be seen as an order parameter (r_p) for percolation on a lattice or network. From (8.1a), we immediately get:

r_{p} = \frac{〈 S_{c} 〉}{n_{s}} = \sum_{c} {(\frac{S_{c}}{n_{s}})}^{2} .

(8.1b)

Figure 17a shows $〈 S_{c} 〉 / n_{s}$ as a function of x_v for a 2D square lattice with nearest-neighbour contacts. Numerical data were obtained by generating a randomly vaccinated population for a series of x_v throughout the entire interval $0 < x_{v} < 1$ and subsequent application of a computational algorithm for cluster identification to each population, thus providing the necessary input for calculating $〈 S_{c} 〉$ and $〈 S_{c} 〉 / n_{s}$ (the cluster-identification algorithm being loosely based on the algorithm described in Ref. [5], p. 171ff). The result is typical of a system showing a percolation transition. Below $x_{v} = x_{c} \approx 0.41$ , the average relative cluster size is close to unity. When x_v approaches $x_{v} = 0.41$ , a gradual decrease in $〈 S_{c} 〉 / n_{s}$ sets in that culminates in a sharp drop at $x_{v} = x_{c} \approx 0.41$ marking a transition to a regime marked by small clusters (of even negligible relative size) at higher values of x_v.

Furthermore, the fact that percolation transitions (including the one shown in Fig. 17a) are true phase transitions in a statistical physical sense is reflected in Figure 17b, showing the standard deviation $\sqrt{〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉} / n_{s}$ of the relative cluster size as a function of x_v. With $\sqrt{〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉} / n_{s}$ being a measure for the (average) size of the ‘fluctuations’ in $S_{c} / n_{s}$ , its behaviour as a function of x_v is typical of physical systems undergoing a phase transition of so-called second order, as reflected in the distinctive lambda-shape of the curve in Figure 17b, with a clear peak at x_v = x_c that also marks a discontinuity in the derivative.

8.2 The correlation length

In the (modern) theory of phase transitions, the size of the fluctuations is intimately related to the correlation length [5], as such that the increase and divergence of the fluctuations (and other quantities) upon approaching the percolation transition is directly connected to a divergence of the correlation length.

For percolation phenomena, the correlation length ξ is defined (see [5], p. 64) as an appropriate average of a typical measure of length R_c of the isolated finite clusters (the large infinite ‘bulk-cluster’ being excluded from the average):

ξ^{2} = \sum_{c^{'}} \frac{S_{c^{'}}}{{n^{'}}_{s}} \cdot R_{c^{'}}^{2},

(8.2a)

where the index $c'$ runs over the finite clusters and $n'_{s}$ represents the total number of susceptible nodes contained in the finite clusters. Assuming only one single bulk cluster and representing its size by S₀, we can write $n'_{s} = n_{s} - S_{0}$ . With $R_{c'} \sim \sqrt{S_{c'}}$ , we can thus rewrite (8.2a) as:

ξ^{2} \sim \sum_{c^{'}} \frac{S_{c^{'}}}{n_{s} - S_{0}} \cdot S_{c^{'}} .

(8.2b)

Calculation of the correlation length from computer simulations of lattices (or networks) randomly filled with susceptible and vaccinated nodes comes with a conceptual problem however. The culprit in this is the bulk cluster. In an infinite lattice, the bulk cluster is infinitely large too. In contrast, lattices simulated on a computer can never be infinite. The amount of available memory imposes an (absolute) upper bound upon their size, and even for sizes significantly lower than the largest size allowed for by the available memory computation times may become impractically long. Hence, computationally simulated lattices and networks are of finite and considerably limited size, and so are the clusters on them to be identified as simulated bulk clusters. The actual core of the problem here resides in the identification of these bulk clusters. Simply taking the largest cluster size for the size of the bulk cluster will not work, since there is always a largest cluster size below as well as above the percolation threshold. A constraint that the largest cluster must be ‘very large’ in order to qualify as bulk clusters will simply replace one problem for another, since ‘very large’ is a highly arbitrary qualification and therewith a complication in itself. Calculation of the average of the (squared) fluctuation size does not come with such difficulties however, as it involves a summation over all clusters, including the bulk cluster (hence, the index c instead of $c'$ ):

〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉 = \sum_{c} \frac{S_{c}}{n_{s}} \cdot {(S_{c} - 〈 S_{c} 〉)}^{2} .

(8.3)

It would therefore be highly significant if we could indeed (and generally) relate every peak in $\sqrt{〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉}$ to a sharp increase (or divergence) in the correlation length, no matter the context or case in which such a peak emerges. Using somewhat pragmatic arguments, it can be shown that this is actually the case.

First of all, note that $〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉 = 〈 S_{c}^{2} 〉 - {〈 S_{c} 〉}^{2}$ . Via its definition, $〈 S_{c}^{2} 〉$ can be expressed as:

〈 S_{c}^{2} 〉 = \sum_{c} \frac{S_{c}}{n_{s}} \cdot S_{c}^{2} = \frac{1}{n_{s}} \sum_{c} S_{c}^{3},

(8.4)

whereas $〈 S_{c} 〉$ follows directly from (8.1a). We fairly assume that there is only one single bulk cluster. We can then reexpress (8.1a) and (8.4) as:

〈 S_{c} 〉 = \frac{S_{0}^{2}}{n_{s}} + \frac{1}{n_{s}} \sum_{c^{'}} S_{c^{'}}^{2}

(8.5a)

〈 S_{c}^{2} 〉 = \frac{S_{0}^{3}}{n_{s}} + \frac{1}{n_{s}} \sum_{c^{'}} S_{c^{'}}^{3}

(8.5b)

with $c'$ running over all non-bulk clusters. Based on considerations similar to those that led us to a scaling relation between $〈 S_{c'} 〉$ and ξ [as expressed by (8.2b)] we expect that $〈 S_{c'}^{2} 〉 \sim ξ^{4}$ . After all S_c ( $S_{c'}$ ) can be considered as the square of a characteristic length of a cluster c ( $c'$ ) and ξ by definition as the average of such a length. Hence, the following scaling relationship applies to the sum in (8.5b):

\frac{1}{n_{s}} \sum_{c^{'}} S_{c^{'}}^{3} = \frac{n_{s} - S_{0}}{n_{s}} \sum_{c^{'}} \frac{S_{c^{'}}}{n_{s} - S_{0}} \cdot S_{c^{'}}^{2} = \frac{n_{s} - S_{0}}{n_{s}} \cdot 〈 S_{c^{'}}^{2} 〉 \sim ξ^{4} .

(8.6)

Note that $0 \leq (n_{s} - S_{0}) / n_{s} \leq 1$ . Restating the scaling relationships (8.2b) and (8.6) as:

\frac{1}{n_{s}} \sum_{c^{'}} S_{c^{'}}^{2} = a ξ^{2} \frac{1}{n_{s}} \sum_{c^{'}} S_{c^{'}}^{3} = b ξ^{4}

(8.7)

(with a and b in the order of unity) we can eventually rewrite Equations (8.5a) and (8.5b) as:

〈 S_{c} 〉 = \frac{S_{0}^{2}}{n_{s}} + a ξ^{2}

(8.8a)

〈 S_{c}^{2} 〉 = \frac{S_{0}^{3}}{n_{s}} + b ξ^{4}

(8.8b)

from which we obtain, via direct substitution and some minor rearrangements:

⟨ S_{c}^{2} ⟩ - {⟨ S_{c} ⟩}^{2} = n_{s}^{2} {(\frac{S_{0}}{n_{s}})}^{3} [1 - \frac{S_{0}}{n_{s}}] - 2 n_{s} a {(\frac{S_{0}}{n_{s}})}^{2} ξ^{2} + (b - a^{2}) ξ^{4} .

(8.9)

The ratio $r = S_{0} / n_{s}$ in (8.9) is generally referred to as the percolation order parameter in the literature (see Ref. [5], p. 151). It equals 1 for complete percolation ( $S_{0} = n_{s}$ ) and vanishes when the bulk cluster collapses at the percolation transition. It is closely related to the order parameter r_p defined via (8.1b). The first term on the right-hand side of (8.9) can be expressed as $n_{s}^{2} f (r)$ , with f(r) a function of r given by $f (r) = r^{3} (1 - r)$ . This function has two roots, respectively, at r = 0 and r = 1, and a local maximum at r = 3/4. Upon approaching the percolation transition, S₀ decreases more and more towards zero, and with it also r and f(r). As a result, the first term on the right-hand side of (8.9) decreases towards zero upon approaching the percolation transition (instead of increasing or even diverging). In addition, it is also straightforward to see that the second term in (8.9), quadratic in ξ, can be neglected against the term 4th-order in ξ in (8.9).

Sufficiently close to the percolation transition we can therefore write:

〈 S_{c}^{2} 〉 - {〈 S_{c} 〉}^{2} \approx (b - a^{2}) ξ^{4}

thus obtaining:

1 / n_{s} \sqrt{〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉} \sim ξ^{2}

(8.10)

for the average size of the fluctuations represented by the standard deviation of the cluster size. Hence, a sharp increase or divergence in $\sqrt{(⟨ {(S_{c} - ⟨ S_{c} ⟩)}^{2} ⟩} / n_{s})$ upon approaching the percolation threshold indeed relates to an increase or divergence in the correlation length ξ (and vice versa). A divergence in the correlation length is considered to be the quintessential feature of the critical phenomena that go with second-order phase transitions. Therefore, the lambda-shaped peak in Fig. 17b can be considered as a direct manifestation of the very nature of the percolation transition, which is that of a second-order phase transition. It is emphasized however that (8.10) is quite a general result and therefore its use is not limited to phenomena entirely driven by percolation (as we will see in the next section).

8.3 The vaccination-induced herd-immunity threshold as the critical point of a second-order phase transition

As mentioned earlier, the benefit of numerical simulations is that mechanisms can be switched on and off, so that their role and influence can be investigated separately. By taking p_r = 0 in the simulations, the influence of vaccination and the role of vaccination-related percolation phenomena in the evolution of an epidemic can be isolated and illustrated (as shown in Figs 15a–d and 16a–d in section 8.1).

Now, let n_e represent the final number of cumulative infections after an epidemic has come to a halt and $n_{s} = n_{p} (1 - x_{v})$ the number of susceptible nodes left after vaccination. Figure 18a shows, for p_r = 0, the relative rate $n_{e} / n_{s} = s_{e} / (1 - x_{v})$ of cumulative infections at the end of an epidemic as a function of the rate of random vaccination x_v for the 2D square lattice with nearest neighbour interactions under consideration.⁹ Data were obtained by (again) generating a randomly vaccinated population for given x_v followed by a random labelling of a fixed number of sites as active infections to serve as seeds for the epidemic, and then letting the epidemic spread until it has faded out and come to a halt. We thereby assume that vaccinated nodes are fully immune to infection. The propagation of the infection itself is simulated in the same way as in the simulations presented in Section 4, namely by repetitive random selection of nodes, checking whether a selected node is infected, randomly selecting one of its nearest neighbours, check its status (s, i or v) and turn it into an active infection as well when it is found to be susceptible and a generator of pseudo random numbers outputs a number lower than the given transmission probability w_i. The number used in the simulations of the initial infections (seeds) was taken to be $n_{0} = 10^{3}$ . Figure 18b shows the data on $s_{e} / (1 - x_{v})$ vs. x_v shown in Fig. 18a together with the relative size $〈 S_{c} 〉 / n_{s}$ of the susceptible clusters after vaccination and prior to the epidemic. The obvious similarity between the curves in Fig. 18b in both a qualitative and quantitative sense cannot be overlooked and is in agreement with the conjecture that $n_{e} / n_{s}$ scales with $〈 S_{c} 〉 / n_{s}$ . This conjecture can also be made plausible through simple though somewhat crude arguments. Consider the population of susceptible nodes to be split-up into N_c clusters of different sizes (S_i) according to a cluster-size distribution of some kind (i.e. prior to the introduction of the initial infections). The number of initial infections n₀ is small but large enough for the initial infections to ‘sample’ the cluster-size distribution when they are randomly distributed over the susceptible nodes. The total size S of all clusters together is equal to the number of susceptible nodes n_s:

Figure 18: — (a) Relative rate of cumulative infections $n_{e} / n_{s} = s_{e} / (1 - x)$ vs. *x_v* and (b) relative rate of cumulative infections $n_{e} / n_{s} = s_{e} / (1 - x)$ vs. *x_v* (black triangles/solid line) compared with the relative average cluster size $〈 S_{c} / n_{s} 〉$ vs. *x_v* (open circles/dashed line). (a) $n_{e} / N_{s}$ vs. *x_v*. (b) $〈 S_{c} 〉 / N_{s}$ and $n_{e} / N_{s}$ vs. *x_v*.

S = n_{s} = \sum_{i = 1}^{N_{c}} S_{i},

(8.11a)

where the summation runs over all clusters (and N_c represents the total number of clusters). The total number of initial infections is equal to the sum of all the occupation numbers¹⁰ $n_{0, i}$ of the individual clusters at t = 0 (note that, by definition, $0 \leq n_{0, i} \leq S_{i}$ ):

n_{0} = \sum_{i = 1}^{N_{c}} n_{0, i} .

(8.11b)

We introduce ${n_{0, i}} = {n_{0, 1}, n_{0, 2}, n_{0, 3} \dots n_{0, N}}$ as the set of occupation numbers of the clusters. For a given set of occupation numbers (i.e. for a given distribution of initial infections), the cumulative number of infections at the end of an epidemic $n_{e} \equiv n_{{n_{0, i}}}$ can be written as (remember that all clusters containing one or more initial infections will become infected):

n_{{n_{0, i}}} = \sum_{i = 1}^{N_{c}} n_{e, i} = \sum_{i = 1}^{N_{c}} (1 - δ_{0, n_{0, i}}) S_{i} .

(8.12)

Here, $n_{e, i}$ represents the number of cumulative infections in the ith cluster at the end of the epidemic and $δ_{0, n_{0, i}}$ the Kronecker-delta:

δ_{0, n_{0, i}} = {\begin{matrix} 1 & n_{0, i} = 0 \\ 0 & n_{0, i} \neq 0 \end{matrix} .

(8.13)

We now introduce ${〈 S_{i} 〉}_{{n_{0, i}}}$ as the average, for given ${n_{0, i}}$ , of the size of the cumulatively infected clusters over the number of cumulatively infected nodes at the end of the epidemic:

{〈 S_{i} 〉}_{{n_{0, i}}} = \sum_{i = 1}^{N_{c}} \frac{(1 - δ_{0, n_{0, i}}) S_{i}}{n_{{n_{0, i}}}} \cdot S_{i}

(8.14a)

which can be reexpressed, by substitution of (8.12) for $n_{{n_{0, i}}}$ , as:

{〈 S_{i} 〉}_{{n_{0, i}}} = \frac{\sum_{i = 1}^{N_{c}} (1 - δ_{0, n_{0, i}}) S_{i}^{2}}{\sum_{i = 1}^{N_{c}} (1 - δ_{0, n_{0, i}}) S_{i}} .

(8.14b)

Since the initial infections sample the cluster-size distribution (when sufficiently large in number), the average ${〈 S_{i} 〉}_{{n_{0, i}}}$ can be considered as a fair approximation of the previously introduced average $〈 S_{c} 〉$ of the size of the susceptible clusters after vaccination and prior to the epidemic. Hence, an approximation for the ratio $s_{e} / (n_{s}^{- 1} 〈 S_{c} 〉)$ of the normalized infection rate $s_{e} = n_{e} / n_{s} = n_{{n_{0, i}}} / n_{s}$ and the normalized average cluster size $〈 S_{c} 〉 / n_{s}$ easily follows. Combining (8.12) and (8.14b) yields

\frac{s_{e}}{n_{s}^{- 1} 〈 S_{c} 〉} = \frac{n_{{n_{0, i}}}}{{〈 S_{i} 〉}_{{n_{0, i}}}} = \frac{{(\sum_{i = 1}^{N_{c}} (1 - δ_{0, n_{0, i}}) S_{i})}^{2}}{\sum_{i = 1}^{N_{c}} (1 - δ_{0, n_{0, i}}) S_{i}^{2}}

so that, via some algebraic rearrangements, we get:

\frac{s_{e}}{n_{s}^{- 1} ⟨ S_{c} ⟩} \approx 1 + \frac{\sum_{i = 1}^{N_{c}} \sum_{j \neq i j = 1}^{N_{c}} (1 - δ_{0, n_{0, i}}) (1 - δ_{0, n_{0, j}}) S_{i} S_{j}}{\sum_{i = 1}^{N_{c}} (1 - δ_{0, n_{0, i}}) S_{i}^{2}} .

(8.15)

At low vaccination rates, a (vast) majority of the susceptible nodes forms a large (bulk) cluster of a size comparable to the size of the population network. With increasing vaccination rates, more and more clusters separate from the bulk cluster and become isolated. However, the size of these ‘secondary’ clusters is much smaller (up to orders of magnitude even) than that of the bulk cluster. Note that the bulk cluster does not contribute to the numerator of (8.15) but does contribute to the nominator of (8.15). As such, the products $S_{i} S_{j}$ in the terms these smaller clusters contribute to the numerator in (8.15) will lose against the $S_{i}^{2}$ term contributed by the bulk cluster to the nominator in (8.15). In addition, due to their small size only a (small) minority of the separated clusters will include an initial infection, especially when x_v approaches the critical region near the percolation transition where the number (but not the typical size) of the separated clusters increases significantly. Moreover, the probability that two specific secondary clusters (to be identified by their indices i, j) both include an initial infection is (very) low. Hence, many of the product terms $(1 - δ_{0, n_{0, i}}) (1 - δ_{0, n_{0, j}})$ in the numerator of (8.15) will be zero over the entire range of x_v-values up to the critical region and beyond. Altogether, it strongly looks as though (8.15) can be reduced to $s_{e} / 〈 S_{c} 〉 \approx 1$ and that s_e scales indeed with the average cluster size $〈 S_{c} 〉$ .

The divergence, as a function of x_v, in the average of the fluctuation in the size $〈 S_{c} 〉$ of the susceptible clusters shown in Fig. 17b is also reflected in the average fluctuation size for the infected clusters present in vaccinated populations after an epidemic has faded out. Figure 19 shows $\sqrt 〈 {(S_{c} - 〈 S 〉)}^{2} 〉 / n_{s}$ for these infected clusters vs. x_v. The divergence clearly stands out and the behaviour of $\sqrt 〈 {(S_{c} - 〈 S 〉)}^{2} 〉 / n_{s}$ as a function of x_v is typical therewith of a phase transition. The dotted lines in Fig. 19 further substantiate this viewpoint. They mainly serve as guides to the eye, but have been calculated by adjusting the parameters $c', c ″$ , ν and x_c of the function:

Figure 19: — Average of the fluctuations in the size of the clusters of cumulative infections after fade-out of the epidemic vs. vaccination rate *x_v*. Dotted curves are guides to the eye.

f (x_{v}) = {\begin{matrix} c' | x_{v} - x_{c} |^{- ν} & x_{v} < x_{c} \\ c ″ | x_{v} - x_{c} |^{- ν} & x_{v} > x_{c} \end{matrix}

(8.16)

to the datapoints. The function $f (x_{v})$ represents a scaling law of the type typically associated with the critical phenomena complementing a second-order phase transition, with $ν$ representing a (so-called) critical exponent. The dotted lines fit the datapoints quite well, and $\sqrt 〈 {(S_{c} - 〈 S 〉)}^{2} 〉 / n_{s}$ shows therewith the appropriate scaling behaviour expected in connection with a second-order phase transition. As such, the onset of vaccination-induced herd-immunity can be perceived in itself as a phase transition of second order [especially since we previously saw that the divergence in $\sqrt 〈 {(S_{c} - 〈 S 〉)}^{2} 〉 / n_{s}$ relates to a divergence in the correlation length]. Figure 19 is based on insufficient data though to obtain an accurate estimate for x_p and $ν$ .

The notion that a critical value of an epidemic control parameter (x_v in the present case), beyond which no epidemic outbreak is possible, relates to a second-order phase transition is not entirely new and was originally advanced by Grassberger [6] on the basis of a search for critical behaviour in simulated data for some special (limiting) cases of the SIR-model, and an estimation of the related critical exponents. The significance of the approach followed in this article by looking at the variation in the standard deviation of the secondary cluster-size, and pointing out its relation to the correlation length, is that it goes deeper therewith into the physics of the phenomenon, thus providing additional insight into what actually happens from a physics perspective in a population during the spread of an infection, especially in the critical region. The simulations and the analysis presented here are also based on a more general realization of the SIR-model, where the removal of infections is a stochastic process that runs parallel to the transmission of infections, unlike in some of the cases simulated and discussed by Grassberger where, for instance, an active infection was removed as soon as it has had an opportunity to pass on its infection. Moreover, whereas Grassberger simulates an epidemic that starts from only a single initial infection in the centre of the population lattice, in the present paper a multitude of initial infections (randomly distributed over the entire population) is considered. This is a more realistic presentation of many epidemic outbreaks in the real world. In the case of Covid-19 for instance, the pathogens were often introduced into larger populations via travellers from abroad (i.e. from outside the population) spreading out in irregular patterns over the entire population (or larger segments of the population) so that the distribution of initial Covid-19 infections became more or less a random one.

8.4 Vaccination-induced percolation transitions in the framework of the SIR-model

The effects of percolation in vaccinated populations can also be dealt with in the context of the SIR-model, provided we modify the standard SIR-model a little further beyond the modifications already presented in Section 1.

Percolation phenomena directly affect the (average) number of susceptible contacts $〈 s_{s i} 〉$ of an active infection and they can be accounted for as such via the coefficients of the expansion of $〈 s_{s i} 〉$ in s. Close to the percolation threshold (i.e. to the critical vaccination rate), $〈 s_{s i} 〉$ decreases sharply with any further increase in s when $s \to s_{e}$ , since the active infections will then approach the cluster boundaries formed by the vaccinated nodes. When s = s_e, the active infections will actually reach the cluster boundaries so that $〈 s_{s i} 〉$ vanishes. For the SIR-model to be consistent with this, constraints must be imposed upon the coefficients of the truncated series expansions of $〈 s_{s i} 〉$ to be used, such that the following condition applies:

〈 s_{s i} 〉 = \sum_{n = 1}^{\infty} a_{n} s_{e}^{n} = 0 .

(8.17)

However, to properly deal with percolation phenomena and describe them correctly in terms of a modified SIR-model, some further adjustments of the model seem to be inevitable. The reason is that we have to deal with two qualitatively different regimes, respectively, below the percolation transition ( $x_{v} < x_{p})$ and above the percolation transition (x_v > x_p). Below the percolation transition the bulk cluster dominates, whereas, in the absence of a bulk cluster, the secondary clusters take over in the regime above the transition. It seems appropriate therefore, to split the clusters into a subset consisting of the cluster(s) of (equal) maximum size (which consists of a single bulk cluster for sufficiently low x_v), and a subset comprised of all other (secondary) clusters. The contributions to s (and s_e) from both respective subsets are then treated separately. For purpose of the latter, we introduce:

{〈 s_{s i} 〉}_{b} = \sum_{n = 0}^{\infty} a_{n}^{(b)} s_{b}^{n}

(8.18a)

{〈 s_{s i} 〉}_{c'} = \sum_{n = 0}^{\infty} a_{n}^{(c')} s_{c'}^{n}

(8.18b)

where ${〈 s_{s i} 〉}_{b}, {〈 s_{s i} 〉}_{c'}$ represent the average number of s nodes in contact with an i node on, respectively, the bulk (maximum-sized) cluster(s) (labelled b), and the secondary clusters (labelled $c'$ ). The variables s_b and $s_{c'}$ represent the respective occupation rates, relative to n_s, of the susceptible nodes in the bulk (maximum-sized) cluster(s), and those in the secondary clusters. Note that, by definition, a node in one cluster/subset has no contact with nodes in the other clusters/subsets, so that we can expand ${〈 s_{s i} 〉}_{b}, {〈 s_{s i} 〉}_{c'}$ in a single variable only (s_b and $s_{c'}$ , respectively).

Now, let S_b represent the combined sizes of the clusters of maximum size (i.e. the size of the single bulk cluster when existent or the sum of the nodes contained in the multiple of maximum-sized clusters) and $〈 S_{c'} 〉$ the average size of the secondary clusters:

〈 S_{c'} 〉 = \sum_{c'} \frac{S_{c'}}{n_{s, c'}} \cdot S_{c'},

(8.19)

where $c'$ runs over the secondary clusters and $n_{s, c'}$ represents the total number of susceptible nodes in the secondary clusters before the infection spreads. Note that $〈 S_{c'} 〉$ is therewith an average over the susceptible nodes in the secondary clusters only. Both S_b and $〈 S_{c'} 〉$ are functions of x_v. The number of nodes in the bulk (maximum-sized) cluster(s) is of course equal to S_b. Following Section 8.1, we also introduce the relative cluster sizes:

{\bar{S}}_{b} = \frac{S_{b}}{n_{s}} 〈 {\bar{S}}_{c'} 〉 = \frac{〈 S_{c'} 〉}{n_{s}} .

(8.20)

The total number of susceptible nodes in the network is given by

n_{s} = n_{s, b} + n_{s, c'},

(8.21)

where $n_{s, b}$ represents the number of susceptibles in the bulk cluster(s) prior to the start of the epidemic (note that $n_{s, b} = S_{b}$ actually). We can rewrite this relation in terms of the relative total sizes ${\bar{n}}_{b} = n_{s, b} / n_{s} = {\bar{S}}_{b}$ and ${\bar{n}}_{c'} = n_{s, c'} / n_{s}$ of, respectively, the bulk (maximum-sized) cluster(s) and the secondary clusters as:

{\bar{n}}_{b} + {\bar{n}}_{c'} = 1 .

(8.22)

It is easy to see that in each separate regime (below or above the percolation threshold), the values ${\bar{n}}_{b}, {\bar{S}}_{b}$ and $〈 {\bar{S}}_{c'} 〉$ will be unique for each value of x_v. Stated differently: ${\bar{n}}_{b}, {\bar{S}}_{b}$ and $〈 {\bar{S}}_{c'} 〉$ are bijective functions of x_v on the intervals $0 \leq x_{v} \leq x_{p}$ and $x_{p} < x_{v} \leq 1$ . They can therefore be considered as state variables for the subsets of clusters to which they relate. As a corollary, the coefficients $a_{n}^{(b)}$ in the series for ${〈 s_{s i} 〉}_{b}$ in (8.18a) can be expanded themselves in either ${\bar{n}}_{b}$ or ${\bar{S}}_{b}$ , just like the coefficients $a_{n}^{(c')}$ in the series for ${〈 s_{s i} 〉}_{c'}$ in (8.18b) can be expanded in $〈 {\bar{S}}_{c'} 〉$ . The variable ${\bar{n}}_{c'}$ does not qualify as a proper state variable, however, since the secondary clusters prevail above the percolation threshold (x_v > x_p) and contain all susceptible nodes, making $n_{c'} / n_{s} = 1$ for all x_v in the interval $x_{p} < x_{v} \leq x_{p}$ . Furthermore, it should be kept in mind that, since we deal with two qualitatively different x_v-regimes, each regime may require its own series expansion of the $a_{n}^{(b)}$ and $a_{n}^{(c')}$ . Expansions of $a_{n}^{(b)}$ and $a_{n}^{(c')}$ in terms of, respectively, ${\bar{n}}_{b}$ and $〈 {\bar{S}}_{c'} 〉$ seem to be the most elegant and convenient path to follow.

It is at this point where constraints of a kind similar to (8.17) come into play. First of all, we note that in case of vaccination at random:

a_{0}^{(b)} = a_{0}^{(c')} = ν (1 - x_{v}) .

(8.23)

The series expansion for ${〈 s_{s i} 〉}_{b}$ can then be written as:

{〈 s_{s i} 〉}_{b} = ν (1 - x_{v}) + \sum_{n = 1}^{\infty} a_{n}^{(b)} s_{b}^{n} .

(8.24)

When the bulk cluster is entirely infected $s_{b} = n_{s, b} / n_{s} = {\bar{n}}_{b}$ [compare with (8.17)]:

{〈 s_{s i} 〉}_{b} = ν (1 - x_{v}) + \sum_{n = 1}^{\infty} a_{n}^{(b)} {\bar{n}}_{b}^{n} = 0

(8.25)

which by introducing a function $a_{n} = a_{n} ({\bar{n}}_{b})$ , such that,

a_{n}^{(b)} = a_{n} \cdot ν (1 - x_{v})

(8.26)

can be reexpressed as:

{〈 s_{s i} 〉}_{b} = ν (1 - x_{v}) {1 + \sum_{n = 1}^{\infty} a_{n} {\bar{n}}_{b}^{n}} = 0 .

(8.27)

For this condition to be met, a power-series expansion of the function $a_{n} ({\bar{n}}_{b})$ in ${\bar{n}}_{b}$ cannot have the form of a Taylor series, since ${〈 s_{s i} 〉}_{b}$ should vanish when ${\bar{n}}_{b}^{n} = 0$ (which will be the case upon reaching the percolation threshold x_v = x_c). A Laurent series is to be used instead:

a_{n} ({\bar{n}}_{b}) = \sum_{m = - \infty}^{\infty} α_{m, n} {\bar{n}}_{b}^{m} = \sum_{m = 1}^{\infty} \frac{α_{- m, n}}{{\bar{n}}_{b}^{m}} + \sum_{m = 0}^{\infty} α_{m, n} {\bar{n}}_{b}^{m},

(8.28)

which also includes negative powers of ${\bar{n}}_{b}$ . How the values of the coefficients $α_{m, n}$ must be taken so that the resulting Laurent series for $a_{n} ({\bar{n}}_{b})$ are in accordance with (8.25) and (8.27) is shown in detail in Appendix 3 for $a_{1} ({\bar{n}}_{b})$ and $a_{2} ({\bar{n}}_{b})$ . According to that analysis, the Laurent series for $a_{1}^{(b)}$ and $a_{1}^{(b)}$ must be of the following form:

a_{1}^{(b)} = ν (1 - x_{v}) [\frac{α_{- 1, 1}}{{\bar{n}}_{b}} + \sum_{j = 0}^{\infty} α_{j, 1} {\bar{n}}_{b}^{j}] .

(8.29a)

a_{2}^{(b)} = ν (x_{v} - 1) [\frac{α_{0, 1}}{{\bar{n}}_{b}} + \frac{(1 + α_{- 1, 1})}{{\bar{n}}_{b}^{2}} + \sum_{j = 0}^{\infty} α_{j + 1, 1} {\bar{n}}_{b}^{j}] .

(8.29b)

Series expansions for the $a_{n}^{c'}$ in (8.18b) follow along similar lines. There is an issue to be considered however. To obtain the counterpart of (8.25) for the secondary clusters, we must be able to specify the total number of nodes in the subset of secondary clusters that will become infected in the absence of infection removal. We will therefore deal with this issue first.

Consider a fairly large number $n_{0} ≫ 1$ of initial infections randomly distributed among the unvaccinated nodes. The most likely distribution of these infections over the unvaccinated clusters that follows is one in which a (sufficiently large) cluster of size S_c will, on average, contain a number of $n_{0, c} = n_{0} \cdot (S_{c} / n_{s})$ initial infections. When there is a bulk cluster of unvaccinated nodes present (x_v < x_p) it will therefore almost certainly contain more than a single initial infection, since S_b is of the same order of magnitude as n_s [or even (almost) equal n_s], and $n_{0} ≫ 1$ . Without infection removal, a contribution of a total of $n_{s, b} = S_{b}$ cumulative infections due to infection of the entire bulk cluster is certain therewith.

Estimating the contribution to the cumulative infection rate from secondary clusters is a different matter however. Since secondary clusters can be very small, the equation $n_{0, c} = n_{0} \cdot (S_{c} / n_{s})$ for the most likely number of initial infections in a cluster does not necessarily apply to them. For instance, although $n_{0} \cdot (S_{c} / n_{s}) \neq 0$ , it is highly conceivable that a significant part of the smaller secondary clusters will in fact remain without any (initial) infections (especially when $n_{0} ≪ n_{s}$ ). And although, provided the distribution of cluster sizes is known, the combinatorics of the problem is tractable,¹¹ it does not provide us with concise algebraic results.

A useful alternative can be based though on the fact that $〈 {\bar{S}}_{c'} 〉$ is a state variable: the cluster distribution is unique for each x_v and so is the average (relative) cluster size. The importance of this is that the average cluster size directly affects the expectation value ${\bar{S}}_{c'}$ of the relative total size of the secondary clusters that contain at least one single initial infection. The value of ${\bar{S}}_{c'}$ is crucial, since it represents the number, relative against n_s, of all the nodes that will get infected upon the spread of the infection (when p_r = 0) in the secondary clusters. That is, it represents the end-value $s_{c', e}$ of the cumulative relative infection rate $s_{c'}$ of the secondary clusters after the epidemic has come to a halt ( $0 \leq s_{c'} \leq s_{c', e}$ with $s_{c', e} = {\bar{S}}_{c'}$ ). In addition, ${\bar{S}}_{c'}$ is also determined by $n_{0, c'}$ , the number of initial infections distributed among the nodes of the secondary clusters. We can therefore consider ${\bar{S}}_{c'}$ to be a function of both $〈 {\bar{S}}_{c'} 〉$ and $s_{0, c'} = n_{0, c'} / n_{s}$ . That is, ${\bar{S}}_{c'} = {\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$ . As such, we will express the coefficients $a_{n}^{c'}$ of the series expansion of ${〈 s_{s i} 〉}_{c'}$ in (8.18b) in terms of ${\bar{S}}_{c'}$ , and subsequently rework the obtained results into expressions in terms of both $〈 {\bar{S}}_{c'} 〉$ and $s_{0, c'}$ . For purpose of the latter we also need to obtain an (approximative) expression for ${\bar{S}}_{c'}$ in terms of $〈 {\bar{S}}_{c'} 〉$ and $s_{0, c'}$ .

The derivation of the coefficients of the Laurent series of the $a_{n}^{c'}$ in ${\bar{S}}_{c'}$ :

a_{n}^{(c')} \equiv a_{n}^{'} \cdot ν (1 - x_{v}) = ν (1 - x_{v}) \sum_{m = - \infty}^{\infty} α_{m, n}^{'} {\bar{S}}_{c'}^{m}

(8.30)

is somewhat tedious but similar in line to the derivation of the coefficients $a_{n}^{b}$ as a function of ${\bar{n}}_{b}$ . It basically comes down to replacing $a_{n}^{b}$ by $a_{n}^{c'}, {\bar{n}}_{b}$ by ${\bar{S}}_{c'}$ and the coefficients α_n by the coefficients $α_{n}^{'}$ related to the series expansion of the $a_{n}^{c'}$ in ${\bar{S}}_{c'}$ . A detailed outline is presented in Appendix 4. The result for $a_{1}^{c'}$ and $a_{2}^{c'}$ reads:

a_{1}^{(c')} = ν (1 - x_{v}) [\frac{α_{- 1, 1}^{'}}{{\bar{S}}_{c'}} + \sum_{j = 0}^{\infty} α_{j, 1}^{'} {\bar{S}}_{c'}^{j}]

(8.31a)

a_{2}^{(c')} = ν (x_{v} - 1) [\frac{α_{0, 1}^{'}}{{\bar{S}}_{c'}} + \frac{(1 + α_{- 1, 1}^{'})}{{\bar{S}}_{c'}^{2}} + \sum_{j = 0}^{\infty} α_{j + 1, 1}^{'} {\bar{S}}_{c'}^{j}] .

(8.31b)

For obtaining an approximative expression for ${\bar{S}}_{c'}$ in terms of $〈 {\bar{S}}_{c'} 〉$ and $s_{0, c'}$ , we reasonably assume ${\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$ to be a continuous function on the relevant part $0 \leq s_{0, c'} \leq n_{s, c'} \land 0 \leq 〈 {\bar{S}}_{c'} 〉 \leq 1$ of its domain, and that the Taylor series of ${\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$ in $〈 {\bar{S}}_{c'} 〉$ and $s_{0, c'}$ exists on this domain part, so that we can write:

{\bar{S}}_{c'} (n_{0, c'}, 〈 {\bar{S}}_{c'} 〉) = \sum_{n = 0}^{\infty} \sum_{m = 0}^{n} c_{n, m} s_{0, c'}^{n - m} {〈 {\bar{S}}_{c'} 〉}^{m} .

(8.32)

As pointed out in Appendix 5, we can fairly approximate this series as:

{\bar{S}}_{c'} \equiv c_{2, 1} s_{0, c'} 〈 {\bar{S}}_{c'} 〉 + c_{3, 2} s_{0, c'} {〈 {\bar{S}}_{c'} 〉}^{2} = c_{2, 1} s_{0, c'} 〈 {\bar{S}}_{c'} 〉 (1 + (c_{3, 2} / c_{2, 1}) 〈 {\bar{S}}_{c'} 〉) .

(8.33)

With $c = c_{2, 1} \cdot (1 + (c_{3, 2} / c_{2, 1}) 〈 {\bar{S}}_{c'} 〉)$ , substitution of (8.33) here for ${\bar{S}}_{c'}$ finally yields $a_{1}^{(c')}$ and $a_{2}^{(c')}$ in terms of $s_{0, c'}$ and $〈 {\bar{S}}_{c'} 〉$ :

a_{1}^{(c')} = ν (1 - x_{v}) [\frac{c^{- 1} α_{- 1, 1}^{'}}{s_{0, c'} 〈 {\bar{S}}_{c'} 〉} + \sum_{j = 0}^{\infty} c^{j} α_{j, 1}^{'} \cdot {(s_{0, c'} 〈 {\bar{S}}_{c'} 〉)}^{j}]

(8.34a)

and

a_{2}^{(c')} = ν (x_{v} - 1) [\frac{c^{- 1} α_{0, 1}^{'}}{s_{0, c'} 〈 {\bar{S}}_{c'} 〉} + \frac{c^{- 2} (1 + α_{- 1, 1}^{'})}{{(s_{0, c'} 〈 {\bar{S}}_{c'} 〉)}^{2}} + \sum_{j = 0}^{\infty} c^{j} α_{j + 1, 1}^{'} \cdot {(s_{0, c'} 〈 {\bar{S}}_{c'} 〉)}^{j}] .

(8.34b)

When we neglect terms in the summations over j with j > 1 (which implies that we assume ${\bar{S}}_{c'} \approx c s_{0, c'} 〈 {\bar{S}}_{c'} 〉$ ), Equations (8.34a,b) reduce to:

a_{1}^{(c')} = ν (1 - x_{v}) [\frac{c^{- 1} α_{- 1, 1}^{'}}{s_{0, c'} 〈 {\bar{S}}_{c'} 〉} + α_{0, 1}^{'} + c α_{1, 1}^{'} \cdot s_{0, c'} 〈 {\bar{S}}_{c'} 〉]

(8.35a)

and

a_{2}^{(c')} = ν (x_{v} - 1) [\frac{c^{- 1} α_{0, 1}^{'}}{s_{0, c'} 〈 {\bar{S}}_{c'} 〉} + \frac{c^{- 2} (1 + α_{- 1, 1}^{'})}{{(s_{0, c'} 〈 {\bar{S}}_{c'} 〉)}^{2}} + α_{1, 1}^{'} + c α_{2, 1}^{'} \cdot s_{0, c'} 〈 {\bar{S}}_{c'} 〉] .

(8.35b)

Furthermore, we can also neglect/drop the terms linear in $s_{0, c'} 〈 {\bar{S}}_{c'} 〉$ here, since $s_{c'}$ is positive, and $s_{c'} \leq {\bar{S}}_{c'} \approx c s_{0, c'} 〈 {\bar{S}}_{c'} 〉$ . As a result of that (8.35a) and (8.35b) then reduce to:

a_{1}^{(c')} = ν (1 - x_{v}) [\frac{c^{- 1} α_{- 1, 1}^{'}}{s_{0, c'} 〈 {\bar{S}}_{c'} 〉} + α_{0, 1}^{'}]

(8.36a)

a_{2}^{(c')} = ν (x_{v} - 1) [\frac{c^{- 1} α_{0, 1}^{'}}{s_{0, c'} 〈 {\bar{S}}_{c'} 〉} + \frac{c^{- 2} (1 + α_{- 1, 1}^{'})}{{(s_{0, c'} 〈 {\bar{S}}_{c'} 〉)}^{2}} + α_{1, 1}^{'}] .

(8.36b)

The approximative/truncated series expansions for ${〈 s_{s i} 〉}_{b}$ and ${〈 s_{s i} 〉}_{c'}$ can now readily be obtained. Combining (8.25) and (8.29a,b), we get:

{⟨ s_{s i} ⟩}_{b} = ν (1 - x_{v}) \cdot [1 + (\frac{α_{- 1, 1}}{{\bar{n}}_{b}} + \sum_{j = 0}^{\infty} α_{j, 1} {\bar{n}}_{b}^{j}) \cdot s_{b} - (\frac{α_{0, 1}}{{\bar{n}}_{b}} + \frac{(1 + α_{- 1, 1})}{n_{b}^{2}} + \sum_{j = 0}^{\infty} α_{j + 1, 1} {\bar{n}}_{b}^{j}) \cdot s_{b}^{2}] .

(8.37)

And, consistent with the condition that for $s_{c'} = {\bar{S}}_{c'} \equiv c s_{0, c'} 〈 {\bar{S}}_{c'} 〉$ (see Appendix 4):

{〈 s_{s i} 〉}_{c'} = ν (1 - x_{v}) + \sum_{n = 1}^{\infty} a_{n}^{(c')} {\bar{S}}_{c'}^{n} = 0

(8.38)

combining (8.18b) and (8.36a,b) yields for ${〈 s_{s i} 〉}_{c'}$ , up to second-order in $s_{c'}$ :

{⟨ s_{s i} ⟩}_{c'} = ν (1 - x_{v}) \cdot [1 + (\frac{c^{- 1} α_{- 1, 1}^{'}}{s_{0, c'} ⟨ {\bar{S}}_{c'} ⟩} + α_{0, 1}^{'}) \cdot s_{c'} - (\frac{c^{- 1} α_{0, 1}^{'}}{s_{0, c'} ⟨ {\bar{S}}_{c'} ⟩} + \frac{c^{- 2} (1 + α_{- 1, 1}^{'})}{{(s_{0, c'} ⟨ {\bar{S}}_{c'} ⟩)}^{2}} + α_{1, 1}^{'}) \cdot s_{c'}^{2}] .

(8.39)

With ${〈 s_{s i} 〉}_{b}$ and ${〈 s_{s i} 〉}_{c'}$ given by (8.37) and (8.39), respectively, the sets of differential equations describing the temporal evolution of the respective infection rates of the bulk cluster (s_b) and the secondary clusters ( $s_{c'}$ ) can be written as:

{\dot{s}}_{i, b} = p_{i} s_{i, b} \cdot \frac{{〈 s_{s i} 〉}_{b}}{ν} - p_{r} s_{i, b}

(8.40a)

{\dot{s}}_{b} = p_{i} s_{i, b} \cdot \frac{{〈 s_{s i} 〉}_{b}}{ν},

(8.40b)

where $s_{i, b}$ represents the contribution to the active infection rate composed of active infections in the bulk cluster, and

{\dot{s}}_{i, c'} = p_{i} s_{i, c'} \cdot \frac{{〈 s_{s i} 〉}_{c'}}{ν} - p_{r} s_{i, c'}

(8.41a)

{\dot{s}}_{c'} = p_{i} s_{i, c'} \cdot \frac{{〈 s_{s i} 〉}_{c'}}{ν}

(8.41b)

with $s_{i, c'}$ representing the contribution to the active infection rate composed of active infections in the secondary clusters. The solutions of these differential equations are controlled by the total number of initial infections $s_{0}$ , and by the ratio according to which these infections are (randomly) distributed among the sites of the bulk and the secondary clusters. Consider s₀ being split-up into a part $s_{0, b}$ , accounting for the initial infections that ‘land’ in the bulk cluster, and a part $s_{0, c'}$ that accounts for those initial infections in the secondary clusters: $s_{0} = s_{0, b} + s_{0, c'}$ . Since the bulk cluster is (very) large relative to s₀, we can write:

s_{0, b} = s_{0} \cdot {\bar{n}}_{b}

(8.42a)

which immediately leads us also to:

s_{0, c'} = 1 - s_{0, b} = s_{0} \cdot (1 - {\bar{n}}_{b}) .

(8.42b)

For any choice of parameters, and with (8.42a,b) serving as initial conditions, the pairs of differential equations (8.40a,b) and (8.41a,b) can be solved independently for, respectively, $s_{b}, s_{i, b}$ and $s_{c'}, s_{i, c'}$ via a numerical procedure, provided that n_b and $n_{c'} = 〈 S_{c'} 〉 = n_{s} 〈 {\bar{S}}_{c'} 〉$ are known as a function of x_v. The latter requirement can be fulfilled by determining the bulk-cluster size for a very large number of x_v-values between 0 and 1, thus resulting in a dataset to be used as input from which, for any value of x_v, proper estimates for n_b and ${〈 s_{s i} 〉}_{c'}$ can be obtained via interpolation (note that $n_{c'} = n_{s} - n_{b}$ ).

Figures 20a and b, respectively, show, as a function of x_v, the absolute size of the bulk cluster n_b and the absolute size of the combined secondary clusters $n_{c'}$ as obtained from simulations for a population network consisting of 2D square lattice of 751 × 751 nodes with nearest-neighbour contacts (in which case the percolation threshold is approximately $x_{c} = 0.41$ ). The variation of n_b and $n_{c'}$ with x_v clearly reflects the existence of a percolation threshold. The size of the bulk cluster decreases linearly with x_v at low vaccination rates. Upon approaching the percolation threshold $x_{v} = x_{c} \approx 0.41$ , the decrease of n_b with x_v becomes increasingly nonlinear until, as a typical feature of a percolation transition, the size of the bulk cluster vanishes completely at x_v = x_c. The variation of $n_{c'}$ with x_v is complementary to this. At low values of x_v, the combined size of the secondary clusters is almost zero. Upon approaching the percolation threshold, $n_{c'}$ starts to increase, and at x_v = x_c there is a steep jump as a corollary of the percolation threshold, which is then followed by a transition towards a regime of linear decrease at higher values that persists up to x_v = 1. Figures 20c and d, respectively, show the relative variations ${\bar{n}}_{b} = n_{b} / n_{s}$ and ${\bar{n}}_{c'} = n_{c'} / n_{s}$ of the bulk and the combined secondary cluster sizes. The figures are quite illustrative, since they show that below the percolation threshold the vast majority of the susceptibles is in the bulk cluster, whereas for x_v > x_c the susceptibles are predominantly (if not entirely) in the secondary clusters. The simulations not only provide us with the required relation between n_b and x_v necessary as input for solving the differential Equations (8.40a, b) and (8.41a,b), but also with the relation between $〈 {\bar{S}}_{c'} 〉 = n_{c'} / n_{s}$ and x_v required in that respect (since $n_{c'} = n_{s} - n_{b}$ ). The dashed curve in Fig. 21a shows the end-value s_e of the cumulative infection rate $s = s_{b} + s_{c'}$ as a function of x_v for the case p_r = 0, as obtained from (numerical) solutions of (8.40b) and (8.41b) [remember that s_i = s when p_r = 0, so that solving (8.40a) and (8.41a) for $s_{i, b}$ and $s_{i, c'}$ is not necessary in this case]. The relevant parameters $α_{n, m}, α'_{n, m}$ and c were adjusted such that the solutions of the ODE’s fit well to the s_e values directly obtained from the simulations (indicated by markers): it is obvious that the agreement is very acceptable indeed. To illustrate the influence of the bulk cluster size and the total size of the secondary clusters on s_e, Fig. 21b shows the dashed curve in Fig. 21a (obtained from the ODEs) compared with the ${\bar{n}}_{b}$ vs. x_v curve in Fig. 20c. Similarly, Fig. 21c shows the comparison of the dashed curve in Fig. 21a to the average size of the secondary clusters obtained from the simulations. It is obvious that for lower values of x_v up to the percolation threshold, s_e follows the size of the bulk cluster (Fig. 21b). At the percolation threshold, the secondary clusters take over and s_e very accurately follows the average size of the secondary clusters (Fig. 21c). This is perfectly in accordance with our expectations. As such, the results presented in Figs 20a–d and 21a–c clearly illustrate that with the extension presented in the present section, the theoretical framework outlined in Section 1 is also able to account for percolation phenomena in general, and even allows for describing the effects of percolation transitions.

Figure 20: — Variation of *n_b* (a) and $n_{c'}$ (b) with *x_v* as obtained from simulations for a 751 × 751 square lattice with nearest neighbour interactions. Dotted lines represent the total number of susceptibles. Panels (c) and (d): resulting variations of ${\bar{n}}_{b}$ (c) and ${\bar{n}}_{c'}$ (d) with *x_v* obtained from the data represented in (a) and (b), respectively.

Figure 21: — (a) End-value *s_e* of cumulative infection rate obtained from ODE vs. *x_v* ( dashed curve) compared with data from simulations (markers) for *p_r* = 0 and $n_{0} = 125$ . (b) End-value *s_e* represented by dashed curve in (a) (solid curve, left vertical axis) compared with ${\bar{n}}_{b}$ from simulations (dotted curve with markers, right vertical axis). (c) End-value *s_e* represented by dashed curve in (a) (solid curve, left vertical axis) compared with ${\bar{n}}_{c'}$ from simulations (dotted curve with markers, right vertical axis).

8.5 Percolation phenomena in unvaccinated populations

The occurrence of percolation phenomena is not restricted to (partially) vaccinated populations. Depending on the circumstances, manifestations of percolation are possible in unvaccinated populations as well. Especially in cases where the social bubble of the individuals is (strongly) reduced (reduced number of links per node if the network), percolation phenomena may play a crucial role for the patterns in which the infection spreads throughout the population, as well as for the extend to which the infection spreads. The role of percolation is expected to increase when the number of links per node decreases. As such we have on one end of the spectrum the Ising-like networks with nearest-neighbour contacts only, where the effects of percolation are the strongest, whereas on the other end we have the extreme limit where the social bubble of an individual (node) consists of the entire population. The latter case is in fact ‘percolation free’ (showing no typical percolation effects that is), since an infected node can reach any susceptible node and transmit its infection to it, no matter where this susceptible node is located within the population. One might even say that there is actually no population network in that case, and that therewith the necessary physical prerequisite for percolation phenomena (a network with only a limited number of links/contacts per node) is in fact missing. That this represents a special case indeed is reflected in the fact that, for instance, when p_r = 0 the entire population will get infected in the end in such a case. It is worth pointing out in this respect that the fact that the standard SIR-model, essentially representing a mean-field approach, is exact for this case (see Section 4) offers an interesting analogy to the physics of magnetically ordered systems where mean-field methods tend to become more and more accurate when the range of the interactions between the individual magnetic moments (spins) increases, and even become exact in the limit of infinite interaction ranges.

Due to the tendency to become more important when the number of the contact links of the nodes decreases, it may be obvious that percolation is particularly an issue in connection with lock-downs. The stronger the lock-down conditions become, and therewith the limitation of the social bubbles of individuals, the stronger the influence of percolation phenomena on the spread of the infection will be. For policy makers it is important to be aware of this, since we will see that percolation effects may cause the infection rates to behave in ways that can be quite confusing, thereby posing a risk of misinterpretation in connection with critical issues like, for instance, herd-immunity.

An important manifestation of percolation effects consists of an additional reduction, with increasing cumulative infection rate s, of the (average) number $〈 s_{s i} 〉$ of susceptible nodes that can be contacted by an active infection in the earliest stages of the epidemic [i.e. for (very) low s-values]. Typical examples of this phenomenon obtained from simulations are shown in Fig. 22a and d, where $〈 s_{s i} 〉$ is shown as a function of s for four social bubbles different in size. The social bubbles of a node thereby consist of all the other nodes in a $(2 N + 1) \times (2 N + 1)$ square surrounding the node (the node itself being at the centre of the square). The dashed lines represent $〈 s_{s i} 〉$ in the percolation-free case ( $N = \infty$ ) for which the standard SIR-model is exact. Figures 22a–d, respectively, correspond to N = 12, N = 8, N = 4 and N = 2. The parameters p_i and p_r gave been chosen such that $p_{r} / p_{i} = 0.75$ .

Figure 22: — $〈 s_{s i} 〉$ for different social-bubble sizes [ $(2 N + 1) \times (2 N + 1)$ square], dashed lines represent the percolation-free case $N = \infty$ which is covered by the standard SIR-model. (a) N = 12. (b) N = 8. (c) N = 4. (d) N = 2.

The effects of N on $〈 s_{s i} 〉$ in Fig. 22a–d are obvious, especially in the low-s regime: with decreasing N the reduction of $〈 s_{s i} 〉$ in the lower s-regime becomes increasingly stronger. This behaviour can be understood as follows.

Let s₀ be the initial infection rate at t = 0 so that for t > 0 we can write $s = s_{0} + s'$ ( $s_{0}, s' ≪ 1$ ). We consider both s₀, t (and consequently $s'$ ) to be so small that we can neglect the probability that more than 1 infection can be found within the social bubble of each node. We implicitly assume as well therewith that N is small enough to rule out any overlap between the social bubbles of the initial infections. At t = 0, each initial infection has ν susceptible nodes in its social bubble. Upon infecting one of its contacts that number reduces by 1 to $ν - 1$ . It is easy to see that the newly infected node has a similar number of susceptibles in its own social network (the node via which the infection was transmitted being the only non-susceptible one in that network). Writing the number of initial infections that has not passed on its infection at given $s'$ as $n'_{0} = n (s_{0} - s')$ [with n₀ = ns₀ being the number of initial (active) infections], so that the number of initial infections that has actually infected another node can then be written as $n_{0} - n'_{0} = n s'$ , the change in $〈 s_{s i} 〉$ upon the change $s'$ in s can be written, to a first approximation, as:

Δ 〈 s_{s i} 〉 = \frac{n'_{0} ν + 2 (n_{0} - n'_{0}) (ν - 1)}{n (s_{0} + s')} - ν = \frac{(s_{0} - s') ν + 2 s' (ν - 1)}{(s_{0} + s')} - ν = \frac{- 2 s'}{s_{0} + s'} .

(8.43)

Hence,

{(\frac{\partial 〈 s_{s i} 〉}{\partial s})}_{s = s_{0}} = \lim_{s' \to 0} \frac{Δ 〈 s_{s i} 〉}{s'} = - \frac{2}{s_{0}}

(8.44a)

{(\frac{\partial^{2} 〈 s_{s i} 〉}{\partial s^{2}})}_{s = s_{0}} = \lim_{s' \to 0} \frac{\partial}{\partial s'} (\frac{Δ 〈 s_{s i} 〉}{s'}) = \frac{2}{s_{0}^{2}}

(8.44b)

so that

〈 s_{i s} 〉 \approx 1 - \frac{2 s'}{s_{0}} + \frac{s'^{2}}{s_{0}^{2}} .

(8.45)

This is quite a remarkable result, which shows that the s-dependence ( $s'$ -dependence) of $〈 s_{i s} 〉$ for sufficiently small s₀ and $s'$ directly depends on the number of initial infections s₀, the derivative of $〈 s_{i s} 〉$ with respect to $s'$ for $s' = 0$ being given by $\partial 〈 s_{i s} 〉 / \partial s' = - 2 / s_{0}$ . Even more striking is the fact that according to (8.45), $〈 s_{i s} 〉$ should not depend on N (i.e. not on the size of the social bubble). To verify the correctness of (8.45) in general, Fig. 23 shows the variation of $〈 s_{i s} 〉$ with $s'$ in the low- $s'$ regime for N = 2 at eight different values of s₀, varying between $s_{0} = 10^{- 4}$ and $8 \cdot 10^{- 4}$ . The curves show a qualitative behaviour consistent with (8.45). Moreover, the inset shows the estimated values of $- \partial 〈 s_{i s} 〉 / \partial s'$ obtained from the slope (at very low $s'$ ) of the curves in the main figure (triangles), and the variation of $- \partial 〈 s_{i s} 〉 / \partial s' |_{s' = s_{0}}$ with s₀ to be expected on the basis of (8.45) ( $- \partial 〈 s_{i s} 〉 / \partial s' |_{s = s_{0}} = - 2 / s_{0}$ ), as represented by the solid curve. It is obvious that the agreement is excellent, so that we conclude that (8.45) is indeed a correct representation of $〈 s_{i s} 〉$ at low $s'$ . Hence, $- \partial 〈 s_{i s} 〉 / \partial s' |_{s' = s_{0}}$ is the same for all N. However, ν depends on N ( $ν = 4 N (N + 1)$ ), so that the relative differential variation of $〈 s_{i s} 〉$ with $s'$ near $s' = 0$ varies significantly with N:

Figure 23: — $〈 s_{s i} 〉$ as a function of $s'$ for different s₀ (main figure) and the corresponding values of $- \partial 〈 s_{i s} 〉 / \partial s'$ vs. s₀ (inset).

\frac{1}{ν} {(\frac{\partial 〈 s_{s i} 〉}{\partial s})}_{s' = 0} = - \frac{1}{2 s_{0} N (N + 1)} .

(8.46)

This variation becomes increasingly stronger with decreasing N. Hence, the observed trend in the reduction of $〈 s_{s i} 〉$ with N in the lower s-regimes in Fig. 22a–d.

The steep drops for sufficiently low N in $〈 s_{s i} 〉$ in the low- $s'$ regime of the kind shown in Fig. 22a–d [and explained on the basis of (8.45)] may have important consequences for the evolution of an epidemic. In cases of small social bubbles, and in the very early stage of an epidemic, the rate of change ( ${\dot{s}}_{i}$ ) of the active infections can be expressed as (see Section 1):

\dot{s_{i}} = p_{i} s_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r} s_{i} = p_{i} s_{i} [1 + \frac{1}{ν} (\frac{s'^{2}}{s_{0}^{2}} - \frac{2 s'}{s_{0}})] - p_{r} s_{i} .

(8.47)

An extremum in s_i will occur when ${\dot{s}}_{i} = 0$ . That is, when:

\frac{p_{i}}{ν s_{0}^{2}} s'^{2} - \frac{2 p_{i}}{ν s_{0}} s' + (p_{i} - p_{r}) = 0 .

(8.48)

Solving for $s'$ is nearly trivial and yields:

s'_{\pm} = s_{0} (1 \pm \sqrt{1 + \frac{ν (p_{r} - p_{i})}{p_{i}}})

(8.49)

which relates to real solutions for $s'$ when:

\frac{p_{r}}{p_{i}} \geq \frac{ν - 1}{ν} .

(8.50)

However, $s'_{+}$ is an improper solution since $s'_{+} > s_{0}$ , and any $s' > s_{0}$ is in conflict with the requirement that there is not more than 1 active infection per social bubble for (8.45) to apply (bear in mind here that $s' = s_{0}$ corresponds to a situation where each initial infection has infected exactly one contact in its bubble). We are thus left with $s'_{-}$ as the only solution that may apply, which requires not only (8.50) to be met, but also that $0 \leq s'_{-} \leq s_{0}$ . It is easy to show that the latter is the case when $p_{r} / p_{i} \leq 1$ . So $s'_{-}$ corresponds to an extremum in s_i when¹²:

\frac{ν - 1}{ν} \leq \frac{p_{r}}{p_{i}} \leq 1 .

(8.51)

It is rather straightforward to show that this extremum is actually a maximum. We have:

\overset{..}{s_{i}} = {\dot{s}}_{i} (p_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r}) + s_{i} \frac{\partial}{\partial t} (p_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r}) = s_{i} \frac{\partial}{\partial t} (p_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r}) .

Note that both ${\dot{s}}_{i}$ and the term in brackets here must be zero for an extremum in s_i [see (8.47)]. We thus obtain:

\overset{..}{s_{i}} = s_{i} \cdot \dot{s} \frac{\partial}{\partial s} (p_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r}) .

Substitution of (8.45) for $〈 s_{s i} 〉$ then yields for: $s^{'} = {s^{'}}_{-}$

{\ddot{s}}_{i} = \frac{2 p_{i} s_{i} \dot{s}}{ν s_{0}} (\frac{s'_{-}}{s_{0}} - 1)

(8.52)

which is negative since $s'_{-} < s_{0}$ (be aware that $\dot{s} > 0$ ). As such the extremum in s_i can be identified indeed as a maximum. This is a crucial observation indeed, since it implies that the number of active infections will reach its maximum shortly after the onset of the spread of the infection, when $s' = s'_{-}$ the increase $s'$ in the cumulative number of infections is even less than the number of the initial infections therewith (since $s'_{-} \leq s_{0}$ ). After reaching that maximum, s_i will then drop (sharply) again, so that the total number of cumulative infections will eventually come to a halt at a value of $s = s_{0} + s'$ which will be of the order of a few times s₀ but not more. One could therefore say that under those circumstances the epidemic comes to an early stop by percolation effects alone, which smother the epidemic well before it gained any noticeable strength. However, a scenario like this is possible only for values of $p_{r} / p_{i}$ within the range given by (8.51), and is therefore limited to cases of very small social bubbles (i.e. low values of ν). As such, the scenario may only apply to (very) tight lock-down conditions. For instance, on a square lattice with four nearest-neighbour contacts only (8.51) becomes $3 / 4 \leq p_{r} / p_{i} \leq 1$ , representing a fairly narrow window of $p_{r} / p_{i}$ -values. For $ν > 4$ , that window is even smaller and its lower bound even closer to 1. Hence, an active implementation of the scenario, in order to prevent an outbreak from developing into a full-blown epidemic, may not only require a strong reduction of the social-bubble size (a necessary requirement) but, depending on the infecting pathogen and its typical p_r-value involved, also necessitate bringing $p_{r} / p_{i}$ within the appropriate range (closer to 1) by additional protective measures specifically aimed at reducing p_i (note that p_r, as an intrinsic property of the involved pathogen, cannot be ‘engineered’).

Even in those cases where percolation phenomena do not prevent an epidemic from passing the initial start-up hurdles, their presence may have a (significant) moderating influence on the evolution of the epidemic. The reason for this is (again) the steep decrease, as shown in Fig. 22a–d, in the variation of $〈 s_{s i} 〉$ with s that occurs for lower values of N. The effect of such a decrease is a significant reduction, at low s-values already, of the rate at which the infection is transmitted for given p_i. As a result, the maximum for given p_i and p_r in the active-infection rate s_i is pushed towards lower values of s so that the epidemic will fade-out earlier and at a lower number of accumulative infections. This can be demonstrated as follows. With:

{\dot{s}}_{i} = p_{i} s_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r} s_{i}

(8.53)

the condition ${\dot{s}}_{i} = 0$ for a local extremum in s_i can be expressed straightforwardly as:

\frac{〈 s_{s i} 〉}{ν} = \frac{p_{r}}{p_{i}}

(8.54)

[note that ${\dot{s}}_{i}$ also vanishes when $s_{i} = 0$ but that case relates to $t \to \infty$ , i.e. to the fade-out of the epidemic (see Section 5.2)]. It is easily shown that the local extrema are in fact maxima by using:

{\ddot{s}}_{i} = \dot{s_{i}} (p_{i} \frac{〈 s_{s i} 〉}{ν} - p_{r}) - \frac{p_{i} s_{i}}{ν} \frac{\partial 〈 s_{s i} 〉}{\partial t} = \frac{p_{i} s_{i}}{ν} (\dot{s} \frac{\partial 〈 s_{s i} 〉}{\partial s}) .

Since $\dot{s} > 0$ for all $t$ , and $\partial 〈 s_{s i} 〉 / \partial s < 0$ for all $s$ , we get ${\ddot{s}}_{i} < 0$ for all t. The condition (8.54) therefore relates to a maximum and the active infection rate s_i will grow as long as $〈 s_{s i} 〉 / ν > p_{r} / p_{i}$ . However, as soon as (8.54) is met and s_i reaches its maximum therewith, the epidemic will go into remission and gradually fades out. The steeper the decrease of $〈 s_{s i} 〉$ with s, the lower the value of s at which (8.54) is met for given p_i and p_r will be. Consequently, it is easy to see that reducing the social-bubble size (i.e. reducing N or, generally, ν) will lead to a suppression of the epidemic itself. The root cause of this phenomenon is, at a deeper level, a percolation-related suppression of the average number of susceptible contacts $〈 s_{s i} 〉$ .

It may be obvious from (8.53) and (8.54) that also in the cases under consideration here, where the evolution of an epidemic is dominated by percolation phenomena, the ratio $r_{p} = p_{r} / p_{i}$ plays a crucial role in that evolution. All the cases that relate to the same value of r_p but to different magnitudes of p_r and $p_{i} = p_{r} / r_{p}$ are in fact equivalent. For given r_p, the only effect of variations in the magnitudes of p_r and p_i consists of a rescaling of the time-axis. For different r_p-values however, both qualitative and quantitative differences in the evolution of the epidemic are to be expected. To illustrate this, Fig. 24 shows the variation of the end-value s_e of the accumulative rate of infections as a function of $r_{p} = p_{r} / p_{i}$ , as well as the average cluster size $〈 S_{c} 〉$ normalized to the number of nodes in the population n. The data were obtained from simulations based on a population represented by a rectangular 2D square lattice of 1501 × 1501 nodes with nearest-neighbour contacts only (Ising-like case).

Figure 24: — End-value *s_e* of the accumulative infection rate (open circles/right axis) and normalized average cluster size $〈 S_{c} 〉 / n$ (dashed curve/left axis) vs. $p_{r} / p_{i}$ .

Not entirely unexpected, both the value of s_e and the average cluster size show a continuous decrease with increasing $p_{r} / p_{i}$ . An interesting feature thereby is that the average cluster size steeply declines at relatively low values of $p_{r} / p_{i}$ already, and then collapses at the (relatively low) value $r_{p} = p_{r} / p_{i} \approx 0.235$ . The value of s_e follows this tendency and bends down sharply with $〈 S_{c} 〉$ , until it reaches a point of inflection at basically the same value of r_p that marks the collapse of $〈 S_{c} 〉$ . It then begins a gradual fade-out towards zero, which is almost complete at $r_{p} = 0.5$ . The low value of r_p at which s_e reaches the inflection point and $〈 S_{c} 〉$ collapses is particularly noteworthy. In the percolation-free case without vaccination ( $ν, N = \infty$ and x_v = 0), the critical point at which s_e becomes zero (so that an epidemic spread of the infection is no longer possible) is given by r_p = 1 (see Sections 5.1 and 5.3). Furthermore, in case of vaccination, the percolation-free case requires a vaccination rate of $x_{v} \approx 0.75$ for the critical value of r_p to drop to $r_{p, c} \approx 0.235$ [the required vaccination rate follows from $p_{r} / p_{i} = 1 - x_{v}$ (see Section 7)]. In the Ising-like case, however, such a reduction of the critical value of r_p is observed without any vaccinations whatsoever. Hence, in cases of small social bubbles, percolation and infection removal apparently team-up and combine their moderating effects on the spread of an infection.

It may be obvious that the inflection point in the s_e vs. r_p curve and the collapse of $〈 S_{c} 〉$ at the same value of r_p are related and, in fact, symptoms of the same underlying phenomenon. This is further corroborated by the r_p dependence of the standard deviation of the cluster size. Figure 25 shows the r_p-dependence of $\sqrt 〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉$ corresponding to the data in Fig. 24. A divergence emerges precisely at the critical value of r_p marking the collapse of $〈 S_{c} 〉$ and the point of inflection in the s_e vs. r_p curve. The drawn curves in Fig. 25 thereby represent a scaling law of the type (8.16) and serve as a guide to the eye. Such observations unambiguously point towards a (second-order) phase transition taking place at $r_{p} \approx 0.235$ , since we have seen in Section 8.2 that a divergence in the standard deviation of the cluster size (i.e. a divergence in the fluctuation size) directly relates to a divergence in the correlation length (the quintessential feature of a second-order phase transition). Both the collapse of $〈 S_{c} 〉$ and the inflection point in the s_e vs. r_p curve are essential features of this phase transition as well.

Figure 25: — Main figure: normalized standard deviation $\sqrt{〈 (} S_{c} - 〈 S_{c} 〉)^{2} 〉 / n$ of the cluster size as a function of $p_{r} / p_{i}$ . Inset: close-up of the critical region.

As to the very nature of the phase transition at $r_{p} \approx 0.235$ , and the mechanism driving it, it turns out that we are dealing in fact with a percolation transition in its own right. An indication for this is obtained by plotting $\frac{〈 S_{c} 〉 / n}{s_{e}}$ against s_e for a range of different r_p-values. The advantage of the division of $〈 S_{c} 〉 / n$ by s_e is that the cluster size is related to the internal spatial arrangement of the contingent of cumulative infections (for instance, when $\frac{〈 S_{c} 〉 / n}{s_{e}} = 1$ the cumulative infections form a single bulk cluster) and that more subtle details of the r_p-dependence of $〈 S_{c} 〉$ are enhanced. Figure 26 shows the result of such a plot for the data represented in Fig. 24. We see that $\frac{〈 S_{c} 〉 / n}{s_{e}}$ remains almost zero for low values of s_e and then increases quite steeply over a narrow range of s_e-values towards a value $\frac{〈 S_{c} 〉 / n}{s_{e}} = 1$ , at which it then remains when s_e increases further. The curve seems to have an inflection point at approximately $s_{e} = 0.4$ (dotted line), which corresponds more or less to the middle of the narrow window of s_e-values in which $\frac{〈 S_{c} 〉 / n}{s_{e}}$ increases from almost zero to 1. The value $s_{e} \approx 0.4$ that marks the inflection point is not without meaning. First of all it agrees with the value of s_e at the inflection point of the s_e vs. $r_{p} = p_{r} / p_{i}$ curve in Fig. 24. It also agrees (within a very narrow margin) with the site-percolation threshold x_c for a 2D square lattice with only nearest-neighbour contacts, in the case where we begin with a completely filled lattice and then randomly remove/block a fraction of sites x (for $x > x_{c}$ , the remaining or unblocked sites no longer percolate in that case). As a logical consequence of the latter, if what we are dealing with here is truly a percolation transition, it has to be a transition in the subpopulation complementary to the subpopulation of cumulative infections, i.e. a percolation transition in the subpopulation of susceptibles. This may seem awkward at first, since at lower values of s_e the cumulative infections will come in dense-isolated clusters of neighbouring nodes surrounding the (single) initial infection (the one that provided the seed of the cluster), whereas the majority of susceptible nodes still forms a large bulk cluster. However, it can be shown that the phase transition indeed consists of a collapse of this very same bulk cluster of susceptibles.

Figure 26: — Plot of $\frac{〈 S_{c} 〉 / n}{s_{e}}$ vs. *s_e* (dotted line marks the inflection point).

For the ease of visualization, we represent the 2D square lattice of nodes by a 2D lattice of square tiles (the size of a tile being that of a lattice/unit cell of the node lattice) in such a way that each node corresponds to the centre of a single tile (see Fig. 27a).

Figure 27: — Construction of block-tiles and renormalization of the 2D square population lattice: (a) actual clusters (cumulative infections) on the original (2D square) lattice, (b) block-tiles effectively replacing the clusters on the original lattice, (c) block-tiles on the renormalized (2D square) lattice.

Now, as soon as the spread of the infection starts at t = 0, clusters of isolated (cumulative) infections begin to grow around each of the initial infections (see Fig. 27a for a pictorial impression). The number of clusters thereby equals the number of initial infections $n s_{0}$ . It is also important to be aware of the fact that the distribution of initial infections is assumed to be random, and that consequently also the distribution of the clusters over the (population) lattice will be random.

A suitable definition of the lattice position of an isolated cluster is thereby given by the cluster’s ‘centre of mass’ ${\vec{r}}_{c}$ :

{\vec{r}}_{c} = \frac{1}{n_{c}} \sum_{i} {\vec{r}}_{i}

(8.55)

since it not only matches our intuitive notion of the concept of cluster position closely, but also makes sense from a more mathematical standpoint. The ${\vec{r}}_{i}$ thereby represent the positions of the individual nodes (or tile centres) in a cluster, and n_c the number of nodes/tiles in a cluster (i.e. the size of a cluster). The index i runs over the individual nodes in the cluster. Note that the centre of mass of a cluster is independent of the representation of the population lattice: a lattice of nodes will give the same ${\vec{r}}_{c}$ as the corresponding lattice of tiles, since the position of the nodes is coincident with the centres of the corresponding tiles. In practice, ${\vec{r}}_{c}$ for a particular cluster of larger size is expected not to differ too much from the position ${\vec{r}}_{0, c}$ of the initial infection from which the cluster has grown (since the differences ${\vec{r}}_{0, c} - {\vec{r}}_{i}$ will tend to cancel out for larger cluster sizes). Once the clusters of cumulative infections have grown somewhat larger, we expect their individual sizes to differ only marginally (since all clusters grow under the same stochastic conditions) and the statistical spread in the cluster sizes low enough to consider all clusters to be of the same size as the average cluster size.

Now let us imagine that each isolated cluster of cumulative infections is being replaced by a square block of tiles (see Fig. 27b), the number of tiles in a block being (within as close a margin as possible) equal to the average number of tiles in a cluster. The centre of each square block is taken in accordance with the centre of mass ${\vec{r}}_{c}$ of the cluster it replaces. Next, we consider a new 2D square lattice, made-up of tiles the size of the aforementioned (cluster-replacing) square blocks. We will refer to these tiles as ‘block tiles’ (by analogy with a conceptually similar construct called ‘block spins’, introduced by L. Kadanoff in his approach to order–disorder transitions in Ising spin systems [44]). A block tile is considered to be corresponding to a single node type only [susceptible or (cumulative) infection].

The purpose of the lattice of block tiles is to obtain a simplified representation of the original (i.e. the real) population lattice of nodes/tiles while maintaining the essential features of the actual lattice, but with less tiles (i.e. with less degrees of freedom). The positions of the block tiles related to the cumulative infections are therefore taken to be as closely as possible to the position of the square blocks of tiles introduced to replace the clusters of cumulative infections in the original lattice. The result is a 2D square lattice of block tiles with a random distribution of tiles representing clusters of cumulative infections in the original lattice (see Fig. 27c).

By definition, the size of a block tile is given by the average cluster size $〈 S_{c} 〉$ . The number $n'$ of block tiles in the new lattice and the number n of nodes/tiles in the original lattice are related via $n' 〈 S_{c} 〉 = n$ , so that the ratio between $n'$ and n is given by

\frac{n'}{n} = \frac{1}{〈 S_{c} 〉} .

(8.56a)

The number $\sqrt{n' / n} = 1 / \sqrt{〈 S_{c} 〉}$ can be considered thereby as a scaling factor relating the typical length scales of both lattices (length of a tile edge, respectively, block tile edge). Therefore, the block-tile concept de facto corresponds to a renormalization transformation of the original 2D square lattice.

Furthermore, it is obvious that the fraction $s' = s_{0} n / n'$ of block tiles representing a cluster of cumulative infections in the renormalized lattice is (and must be) equal to the cumulative infection rate s of the population: $s' = s$ . An explicit expression for $s'$ obtained by using (8.56a) reflects this:

s' = \frac{s_{0} n}{n'} = s_{0} 〈 S_{c} 〉

(8.56b)

since $s = s_{0} 〈 S_{c} 〉$ in the case of isolated infected clusters.

Together, (8.56a) and (8.56b) are the mathematical expression of a pivotal insight. With increasing cluster size (i.e. increasing s), the total number $n'$ of (block) tiles in the renormalized lattice decreases [as expressed by (8.56a)]. The number of infected tiles remains the same however, since the number of the clusters they represent is independent of s and equal to the number of initial infections $s_{0} n$ . The decrease of the total number of tiles is therewith entirely due to a decrease of the number of susceptible tiles. Hence, the fraction $s'$ of infected (block) tiles in the renormalized lattice increases with increasing cluster size and s, whereas the fraction of susceptible tiles decreases. This is expressed by (8.56b), showing that the fraction of infected (block) tiles in the renormalized lattice is in fact equal to the fraction s of infected nodes (tiles) in the real population lattice, the fraction of susceptible tiles being $1 - s$ .

A legitimate question now is: will block tiles of a certain type percolate at some critical value of s? Since the block tiles are randomly distributed over the renormalized lattice, the answer is yes. Starting as a single bulk cluster, consisting of the entire population (except for the initial infections) when s = s₀, the susceptible tiles (randomly distributed like their infected counterparts) reach their percolation threshold $x_{c, s} \approx 0.5927$ on the original 2D square lattice with nearest neighbour contacts when the fraction of infected tiles reaches the critical value $x_{c, i} = 1 - x_{c, s} \approx 0.4073$ . The result is a percolation transition of the susceptible tiles: for $s < 1 - x_{c, s} = x_{c, i}$ , a large bulk cluster of susceptible tiles exists (which collapses when $s = x_{c, i}$ ), whereas for $s > 1 - x_{c, s} = x_{c, i}$ the contingent of susceptible tiles is split-up into clusters of smaller size (i.e. significantly smaller than the size of the lattice). However, the crossing of the aforementioned percolation threshold by the susceptible tiles also has important implications for the infected tiles, since the renormalized lattice is a bit odd. Its size is not constant when expressed in the lattice’s very own ‘natural’ units of area and length (given by a single block tile and the length of the edge of a block tile, respectively) but varies with s. This is (of course) closely intertwined with the fact that the total number $n / 〈 S_{c} 〉$ of block tiles in the renormalized lattice decreases a function of s. As a result, the lattice shrinks with increasing s when measured in its natural units of size. Since the number of infected tiles is constant, their average separation (measured in natural length units) must therefore decrease with increasing s. Therefore, when s reaches a critical value when $s = x_{c, i}$ , a majority of the infected block tiles comes in touch with other infected block tiles, thus forming (percolating) paths of infected block tiles. This is in fact how the percolation of the susceptible tiles is broken. We see therewith that the critical value $s = x_{c, i} = 1 - x_{c, s} \approx 0.4073$ marks in fact two percolation transitions, one in the sublattice of (cumulative) infections and one (in the opposite direction) in the sublattice of susceptibles. This can be clearly seen when we consider what happens at cluster level in the original lattice when s reaches $x_{c, i} = 1 - x_{c, s}$ . It is not difficult to recognize that percolation of block tiles corresponds to percolation of clusters in the original lattice. We thus can say that block-tile renormalization transforms cluster percolation in the original lattice into tile percolation in the renormalized lattice. At $s = x_{c, i} \approx 0.4073$ , the majority of infected clusters merges into a single cluster, splitting-up the remaining susceptibles into smaller clusters enclosed by cumulative infections.

This can be illustrated when we compare the average size of the cluster(s) of cumulative (removed) infections $〈 S_{c r} 〉$ to that of the cluster(s) of susceptibles $〈 S_{c s} 〉$ as a function of s_e (the end-value of s) and $1 - s_{e}$ . Figure 28 shows $〈 S_{c r} 〉$ and $〈 S_{c s} 〉$ as a function of s_e and $1 - s_{e}$ , normalized, respectively, against $1 - s_{e}$ and s_e (compare with Fig. 26). The data were obtained from similar simulations (same parameters) as the data represented in Figs 24–26. Figure 28 clearly shows the symmetry involved: when s_e approaches $x_{c, i} \approx 0.4073$ (lower horizontal scale) then $〈 S_{c r} 〉 / s_{e}$ (right scale) starts to undergo a rather steep transition, from $〈 S_{c r} 〉 / s_{e} \approx 0$ in the low s_e regime, to an almost ‘saturated’ value $〈 S_{c r} 〉 / s_{e} = 1$ for higher values of s_e (see also Fig. 26). Parallel to this, $〈 S_{c s} 〉 / (1 - s_{e})$ undergoes a similar steep transition in the opposite direction: when s_e crosses the threshold $x_{c, i} \approx 0.4073$ , its complement $1 - s_{e}$ (upper horizontal scale) crosses the percolation threshold $x_{c, s} = 1 - x_{c, i} \approx 0.5927$ of the susceptible nodes and $〈 S_{c s} 〉 / (1 - s_{e})$ (left scale) changes from $〈 S_{c s} 〉 / (1 - s_{e}) = 1$ to $〈 S_{c s} 〉 / (1 - s_{e}) \approx 0$ .

Figure 28: — Plots of $\frac{〈 S_{c s} 〉 / n}{1 - s_{e}}$ [left axis (see arrow)] and $\frac{〈 S_{c r} 〉 / n}{s_{e}}$ [right axis (see arrow)] vs. *s_e* (bottom axis) and $1 - s_{e}$ (top axis).

Furthermore, in a series of pictures, Fig. 29 shows the actual process of coalescence of the cumulative infections into a large bulk cluster (accompanied by a split-up of the bulk cluster of susceptibles into isolated smaller clusters) as it happens in reality (i.e. on the real, non-renormalized, population lattice). We see that for $p_{r} / p_{i} = 0.4$ (Fig. 29a), the epidemic is largely suppressed by the decay of active infections, and s_e does not exceed any further than a value of $s_{e} = 0.0184$ . As a result, the clusters of cumulative infections remain very small and isolated. With decreasing $r_{p} = p_{r} / p_{i}$ , the influence of infection removal diminishes, the value of s_e increases and the average size of the clusters of cumulative infections grows. Nevertheless, when $p_{r} / p_{i}$ decreases from $p_{r} / p_{i} = 0.4$ to $p_{r} / p_{i} = 0.3$ the corresponding increases in the infected-cluster size and s_e are still modest (a difference of $Δ r_{p} = - 0.1$ in r_p results in an increase of only $Δ s_{e} = 0.053$ in s_e). There is also still a bulk cluster of susceptibles in this case. However, when $p_{r} / p_{i}$ is further reduced, the effects of increased percolation take over. The size of the infected clusters and s_e increase more and more over ever smaller intervals of $p_{r} / p_{i}$ . Ever more infected clusters merge into larger clusters, while the bulk cluster of susceptibles is gradually split-up into smaller clusters, until at $p_{r} / p_{i} \approx 0.235$ (Fig. 29f), the percolation threshold has been reached and the cumulative infections form a large bulk cluster, whereas the susceptible nodes are confined to smaller (secondary) clusters. The differences between Fig. 29e and f are particularly noteworthy in this respect, since they illustrate the sharpness of the actual transition: a very minor difference in $r_{p} = p_{r} / p_{i}$ of only $Δ r_{p} = - 0.025$ makes the difference between predominantly isolated clusters of cumulative infections, accompanied by a bulk cluster of susceptibles, and the opposite situation.

Figure 29: — Status of the population after fade-out of the epidemic for different $r_{p} = p_{r} / p_{i}$ values (inherently corresponding to different *s_e* values). Black: (cumulative) infections, white: susceptibles. The gradual confluence of the clusters of infections with decreasing *r_p* (increasing *s_e*) is obvious. Case (f) represents the crossing of the percolation threshold(s). (a) $p_{r} / p_{i} = 0.40, s_{e} = 0.0184$ . (b) $p_{r} / p_{i} = 0.35, s_{e} = 0.0304$ . (c) $p_{r} / p_{i} = 0.30, s_{e} = 0.0714$ . (d) $p_{r} / p_{i} = 0.275, s_{e} = 0.1197$ . (e) $p_{r} / p_{i} = 0.25, s_{e} = 0.2566$ . (f) $p_{r} / p_{i} = 0.235, s_{e} = 0.4267$ .

The remaining question now is how, also at intermediate values of s, percolation phenomena can have such a significant effect on the production of new infections that they can compete with (and even compensate) infection removal. The answer here is in the effect of percolation on $〈 s_{s i} 〉$ or, equivalently, on $〈 s_{i s} 〉$ . This is illustrated best by considering $〈 s_{i s} 〉$ as a function of s. Figure 30 shows $〈 s_{i s} 〉$ vs. s for p_r = 0 (so that r_p = 0) as its main figure. We clearly see that $〈 s_{i s} 〉$ has a point of inflection of such a kind that $\partial 〈 s_{i s} 〉 / \partial s$ increases with s for higher values of s. Below the point of inflection $\partial 〈 s_{i s} 〉 / \partial s$ decreases with s. Numerical evaluation of $\partial 〈 s_{i s} 〉 / \partial s$ (inset in Fig. 30) shows that the inflection point occurs at $s \approx 0.4$ and corresponds therewith to the percolation transition described at the foregoing pages. Apparently, the formation of secondary clusters of susceptible nodes, and their enclosure by closed ‘fronts’ of active infections from the bulk cluster of cumulative infections, has a positive effect on $〈 s_{i s} 〉$ . This seems quite logical, since the infection now closes in on the susceptible nodes from all directions so to say. The resulting boost in $〈 s_{i s} 〉$ directly relates to a boost in the growth rate of the rate of infections given by $\dot{s} = p_{i} (1 - s) 〈 s_{i s} 〉$ (see Section 1). When p_r is large enough to keep s below $x_{p, i}$ , the epidemic will be unable to take advantage of this mechanism. However, as soon as s breaks through the percolation threshold ( $s \geq x_{c, i}$ ), the number of (cumulative) infections will get the aforementioned boost, and so will the value of s_e in the end.

Figure 30: — $〈 s_{i s} 〉$ vs. s for *p_r* = 0 (main figure) and $\partial 〈 s_{i s} 〉 / \partial s$ vs. s (inset). Dot and dashed line mark the point of inflection.

The Ising case with only nearest-neighbour contacts represents a rather extreme example of contact limitation, as applied in only the strictest of lock-downs. Percolation shapes the evolution of the epidemic on par with infection removal in this case. However, the effects of percolation seem to be important under less strict lock-down conditions too, although their magnitude decreases significantly when the number of close contacts (ν) of the population members increases (especially in the lower range of ν-values). Some essential characteristics of the results for the Ising case seem to remain however, irrespective of the value of ν.

To give a mathematical description for what (approximately) happens in non-Ising cases we introduce the area A_c of a cluster, i.e. the area within the 2D population lattice that can be attributed in a meaningful way to a particular cluster. Although a concept that is easy to grasp on an intuitive basis, a precise definition of A_c is subject to quite some arbitrariness. A possible definition that also resonates with intuition could be that A_c stands for the largest area that can be fenced-in by a closed circuit of straight lines connecting nodes of a certain type in the cluster, thereby enclosing all other nodes of that type in the cluster (see Fig. 31). It is obvious that a cluster area defined in this way generally contains nodes of both types (cumulative infections and susceptibles). We therefore introduce the filling factor $ϕ_{c}$ of a cluster, which is the fraction of the nodes/tiles contained by A_c that relate to the node type of interest. As such, the cluster size S_c is related to the cluster area via $S_{c} = ϕ_{c} A_{c}$ .

Figure 31: — Definition of cluster area *A_c* as the largest area that can be fenced-in by straight lines (dotted) connecting elements of the cluster.

We express the area A_c in the number of tiles in the original population lattice covered by it. Let $〈 A_{c} 〉$ be the average of A_c defined as [compare with (8.1a)]:

〈 A_{c} 〉 = \sum_{c} \frac{S_{c}}{n_{t}} \cdot A_{c} = \frac{1}{n_{t}} \sum_{c} \frac{S_{c}^{2}}{ϕ_{c}}

(8.57a)

which represents an average over the size S_c of the clusters normalized against the total number n_t of nodes of the type of interest in the network, with c running over all clusters of the node type of interest.

The average cluster size is given by

〈 S_{c} 〉 = \sum_{c} \frac{S_{c}}{n_{t}} \cdot S_{c} .

(8.57b)

Assuming that $ϕ_{c}$ is a constant (written simply as $ϕ$ ) for all clusters, combining (8.57a) and (8.57b) yields a relation between the average cluster area and the average cluster size:

〈 A_{c} 〉 = \frac{〈 S_{c} 〉}{ϕ} .

(8.58)

The nodes of our interest here are those related to the cumulative infections. An issue to be dealt with in this context is the distribution of the cumulative infections over the clusters that they concentrate in at low values of s. Those clusters start at an initial infection and then grow outwards. With time (moderate s) there will be a fairly high density of (cumulative) infections close to the centre of mass $\vec{r_{c}}$ of the cluster, forming a dense nucleus of infections around $\vec{r_{c}}$ . Moving further away, the density decreases however, and near the cluster boundaries only a few (cumulative) infections are present per unit of area. This is clearly shown in Fig. 32, which pictures the simulated node status in a population section of 700 × 700 nodes at $s \approx 0.24$ in the absence of infection removal (i.e. p_r = 0) for a case where the contact bubble of a particular node consists of the nodes in a 11 × 11 square centred around that node. Black tiles/dots represent the infections, the white areas the susceptibles. The individual clusters are easily recognizable, as well as their tendency to merge.

Figure 32: — Status of nodes in a 700 × 700 section of a population (black: cumulative infections, white: susceptibles).

However, the problem is how to define and identify the confluence of two clusters. That is, when can we say that two clusters have merged? As with the definition of the cluster area, there is a degree of arbitrariness also in this matter. We cope with the issue in a somewhat pragmatic way by introducing an effective area A_e:

A_{e} = θ 〈 A_{c} 〉,

(8.59)

where θ represents a parameter between 0 and 1 ( $0 \leq θ \leq 1$ ). The purpose/effect of θ is to lower the area that actually represents a cluster. Its value is such that two clusters become indistinguishable roughly when the distance between their respective centres of mass ${\vec{r}}_{c, 1}$ and ${\vec{r}}_{c, 2}$ relates to $A_{e} = θ 〈 A_{c} 〉$ as:

| {\vec{r}}_{c, 2} - {\vec{r}}_{c, 1} | = O (\sqrt{A_{e}}),

where $\sqrt{A_{e}}$ should be considered as a typical measure of the (effective) length/diameter of a cluster. Combining (8.58) and (8.59), we thus obtain for A_e:

A_{e} = (\frac{θ}{ϕ}) \cdot 〈 S_{c} 〉 .

(8.60)

It is this effective cluster area that defines the block tiles in a newly constructed 2D square lattice: a block tile consists of a square arrangement of a number of original lattice tiles as close as possible to A_e. The number n of tiles in the original lattice and the number $n'$ of block tiles in the lattice after renormalization are related via $n' A_{e} = (θ / ϕ) n' 〈 S_{c} 〉 = n$ , so that we obtain:

\frac{n'}{n} = (\frac{ϕ}{θ}) \cdot \frac{1}{〈 S_{c} 〉}

(8.61a)

as a generalization of (8.56a). A generalization of (8.56b) is also obtained straightforwardly:

s' = \frac{s_{0} n}{n'} = (\frac{ϕ}{θ}) \cdot s_{0} 〈 S_{c} 〉

(8.61b)

so that (with $s = s_{0} 〈 S_{c} 〉$ ):

s' = s \cdot \frac{θ}{ϕ} .

(8.62)

The block tiles related to the clusters of cumulative infections break the percolation of the susceptible block tiles when $1 - s' = x_{c, s} \approx 0.5927$ ( $s' = x_{c, i} \approx 0.4073$ ), which (as seen previously) corresponds to a collapse of the bulk cluster of susceptibles and the formation of a bulk cluster of cumulative infections. Note that by the very definition of a cluster, the existence of a bulk cluster also implies that its members are in a state of percolation.

To illustrate the influence of the size of the social bubble (value of ν), Fig. 33 shows both the normalized average cluster size $〈 S_{c} 〉 / n$ and s_e as a function of $p_{r} / p_{i}$ for the case where the contact environment (social bubble) of a node consists of its nearest and next-nearest neighbours (i.e. the nodes in the $(2 N + 1) \times (2 N + 1)$ square with N = 1 surrounding the node, which itself is at the centre of the square). We see that $〈 S_{c} 〉 / n$ and s_e behave as a function of the critical value of $p_{r} / p_{i} = r_{p}$ in a similar way as in the Ising case (see Fig. 24).

Figure 33: — End-value *s_e* of the accumulative infection rate (open circles/right axis) and normalized average cluster size $〈 S_{c} 〉 / n$ (dashed curve/left axis) vs. $p_{r} / p_{i}$ for a case with nearest- and next-nearest neighbour contacts.

However, compared with the Ising case, the critical value $p_{r} / p_{i} \approx 0.32$ that marks both the collapse of the bulk cluster of cumulative infections and the inflection point in the s_e vs. $p_{r} / p_{i}$ curve is higher than in the Ising case. Apparently, the effects of percolation phenomena have become less strong as a result of the changes in lattice topology and coordination number (the number of contacts per node ν is 8 in this case vs. 4 in the Ising case).

The qualitative behaviour of the curves in Fig. 33 suggests that there may be a second-order phase transition involved in this case as well. This is confirmed in Fig. 34, which shows $〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉$ vs. $p_{r} / p_{i}$ . The critical value of $p_{r} / p_{i} \approx 0.32$ observed in the data in Fig. 34 appears to be related to a divergence of $〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉 / n$ , which indeed provides us with conclusive evidence for a second-order phase transition taking place at $p_{r} / p_{i} \approx 0.32$ .

Figure 34: — Main figure: normalized standard deviation $\sqrt{〈 {(S_{c} - 〈 S_{c} 〉)}^{2} 〉} / n$ of the cluster size as a function of $p_{r} / p_{i}$ for a case with nearest- and next-nearest neighbour contacts.

Furthermore, it looks like the inflection point in the s_e vs. $p_{r} / p_{i}$ curve in Fig. 33 is due to the same mechanism as the one responsible for the inflection point of a similar kind in Fig. 24. To demonstrate this, Fig. 35 shows $〈 s_{i s} 〉$ vs. s for this case. The resemblance to Fig. 30 is obvious: like the curve in Fig. 30, the curve in the main figure of Fig. 35 also has a point of inflection (at approximately $s = 0.37$ ), and behaves similarly to the curve in Fig. 30, both below and above s = 0.37. The inflection point is again to be considered as a result of the type of percolation transition that we described in the above via block-tile renormalization of the population lattice: at $s \approx 0.37$ , the cumulative infections reach percolation while the percolation of the susceptibles is broken. As a consequence, like in the Ising case, percolation-induced boosting of the infection rate will also occur here for s > 0.37. Hence, the inflection point in the s_e vs. $p_{r} / p_{i}$ curve in Fig. 33. It is worth mentioning that apparently $θ / ϕ \approx 1$ for this case, since the value $s \approx 0.37$ differs only marginally from the critical value $s' \approx 0.4073$ that marks the percolation threshold of the block tiles representing the clusters of cumulative infections [see (8.62)].

Figure 35: — $〈 s_{i s} 〉$ vs. s for *p_r* = 0 (main figure) and $\partial 〈 s_{i s} 〉 / \partial s$ vs. s (inset) for the case with nearest- and next-nearest neighbour contacts. Dot and dashed line mark the point of inflection.

By comparison of the data in Figs 24 and 33, it is obvious (and not entirely unexpected) that the influence of percolation effects becomes smaller when the social bubbles of the members of the population increase: an increase of ν from 4 to 8 leads to a significant increase in the critical value of $r_{p} = p_{r} / p_{i}$ from $r_{p} \approx 0.235$ to $r_{p} \approx 0.32$ . It is easy to see that when the social bubbles are increased ever further, this trend will remain and take the form of an asymptotic approach of the percolation-free case ( $ν, N = \infty$ ) for which r_p = 1. In this connection, it is also obvious that for all social bubbles of finite size the critical values of r_p will relate to a phase transition of second-order involving (and partially driven by) a collapse/formation (depending on whether r_p is respectively, increased or decreased) of a bulk cluster of cumulative infections.

Being the limit for $ν, N \to \infty$ , the percolation-free case stands out against the latter. The cluster concept has no relevance in this case, since all cumulative infections are part of one single (bulk) cluster by definition (irrespective of S_c). Consequently, there is neither a percolation threshold in this case nor a divergence of a correlation length (which can be considered as infinite for all s in fact). A parallel to thermodynamic systems with cooperative long-range interactions presents itself here. Mean-field methods may give an exact description of the thermodynamic behaviour (including phase transitions) of these systems when the interaction range becomes infinite [45]. The individual entities that make-up the system (like spins, molecules, etc.) thereby interact with an effective interaction (mean field), which is described in terms of the average state of each entity in the system. As such, there is no meaningful notion of a correlation length (except that it is infinite), and fluctuations on finite length scales are averaged out without any loss of relevant information.

It is not difficult to see that the percolation-free case shows a strong resemblance to this picture, and that the standard SIR-model represents the equivalent of an exact mean-field approach in this. The standard SIR-model sets $〈 s_{s i} 〉$ and $〈 s_{s i} 〉$ equal to, respectively, the probability $p = 1 - s$ that a node is still in a susceptible state and the probability $p' = s$ that a node has already been infected: $〈 s_{s i} 〉 = (1 - s), 〈 s_{s i} 〉 = s$ . It is easy to recognize that these substitutions are the conceptual analogue of the introduction of a mean field. Like the mean-field methods, in most cases, the standard SIR-model is an approximation. However, when the social bubbles of the nodes extend to the entire population (so that the ‘range of social interactions’ becomes infinite), these identities become exact and the SIR-model gives an exact description of the evolution of the epidemic, thus accentuating the analogy between the model and mean-field methods for systems with cooperative interactions. It is evident that the similarities in concept and role between the standard SIR-model and the mean-field methods for phase transitions in physical systems with cooperative interactions are in accordance with the viewpoint that the onset/disappearance of herd immunity corresponds to a (second-order) phase transition.

8.6 The SIR-model and percolation in unvaccinated populations

On the basis of the previous section, the standard SIR-model seems (and actually is, as we will see in this section) inadequate for describing percolation phenomena of the kind that we encountered in the previous section. However, the more general SIR-model outlined in Section 1 is in fact able to account for the effects of percolation in both vaccinated (see Section 8.4) and unvaccinated populations. The latter can be illustrated best by using an approximation of $〈 s_{i s} 〉$ in the form of the truncated series expansion:

〈 s_{i s} 〉 \equiv a_{1}^{i s} s + a_{2}^{i s} s^{2} + a_{3}^{i s} s^{3},

(8.63)

where the coefficients are fitted to the $〈 s_{i s} 〉$ vs. s data represented in Fig. 35, yielding $a_{3}^{i s} = 2.365, a_{2}^{i s} = - 3.02, a_{1}^{i s} = 1.77$ . With these values taken for $a_{1}^{i s}, a_{2}^{i s}, a_{3}^{i s}$ , the approximation (8.63) reproduces the data in Fig. 35 fairly well (see Fig. 36a). The reason why an approach based on an expansion of $〈 s_{i s} 〉$ (instead of $〈 s_{s i} 〉$ ) is suited best here is 2-fold. First of all we have seen in the previous section that the behaviour of $〈 s_{i s} 〉$ as a function of s provides a very clear indicator for the influence of percolation in both a qualitative and quantitative sense (inflection point). Second, the variation of $〈 s_{i s} 〉$ with s can be approximated/described well in terms of a truncated series expansion with far less terms than the variation of $〈 s_{s i} 〉$ with s. The fit of the data presented in Fig. 35 already demonstrated that a third-order polynomial in s provides a good approximation for $〈 s_{i s} 〉$ in this respect. This allows for the (semi)-algebraic solution of the differential equations involved, as outlined in Section 3.3. In principle, the parameters $t'$ and $s'$ as they occur in Equation (3.12) can thereby be chosen in an arbitrary way. However, making this choice can be somewhat tricky. As we see from Fig. 36a, the relative difference between the actual value of $〈 s_{i s} 〉$ obtained from simulation and the best-fitting approximation based on (8.63) is quite substantial at very low values of s (near s = 0). Calculations based on (3.12) have shown that these fairly large (relative) variations in $〈 s_{i s} 〉$ considerably compromise the calculated s–t relation and its agreement with the simulated data. We must therefore choose the point $(t', s')$ at not too low a value of $t'$ [i.e. well within the regime where the fitted third-order polynomial (8.63) does provide a good approximation (with only minor relative differences) for $〈 s_{i s} 〉$ ].

Figure 36b shows simulated data for s vs. t along with a solution of the differential equation (3.11b) given by (3.12) under substitution of the values for $a_{1}, a_{2}, a_{3}$ obtained from the fit to the $〈 s_{i s} 〉$ vs. s data presented in Fig. 36a. Time is measured in units of ‘cycle times’, each of which corresponds to a full ‘cycle’ of n simulated contacts,¹³ so that in one single cycle time each member of the population makes, on the average, one single contact. The value of $t'$ chosen is $t' = 7.5$ (in cycle times). The agreement between the solution of the modified SIR-model (with fitted parameters) and the simulated data is splendid. This example emphasizes therewith not only the ability of the modified SIR-model to account for percolation effects rooted in lattice correlations, but also its necessity as a replacement for the standard SIR-model in those cases where a strong influence of percolation phenomena is to be expected. The inadequacy of the standard SIR-model is illustrated thereby by the dashed curve in Fig. 36b, which shows s vs. t calculated on the basis of the standard SIR-model for the same p_i as used in the simulation. The discrepancy with the simulated data and the modified SIR-model cannot be missed and reveals, in fact, a complete failure of the standard SIR-model for this case.

8.7 Combined effects of vaccination and social-network restrictions

We now focus on situations where vaccination and percolation effects due to limitations of the social-bubble size combine, so that their effects may add-up or even strengthen one another. Figure 37 shows the values of s_e for various vaccination rates x_v as a function of $r_{p} = p_{r} / p_{i}$ for a simulated epidemic on a 2D square lattice with nearest and next-nearest neighbour contacts (N = 1) and vaccine efficiency ϵ = 1 (full immunity when vaccinated). All curves show a continuous transition as a function of $p_{r} / p_{i}$ , from a regime of high s_e values to a regime of low and even negligible s_e values. The transitions are quite sharp (especially for low values of x_v) and as such they display the phase-transition behaviour that we have seen in the previous sections. The data for x_v = 0 have already been presented in Fig. 33, together with the variation of the average size of the clusters of cumulative infections indicative of a percolation transition. The transition points r_c (i.e. the critical $r_{p} = p_{r} / p_{i}$ ) are identified as the inflection points of the curves. Note that also the maximum values of s_e (at $p_{r} / p_{i} = 0$ ) show an expected decrease with x_v [which appears to be (almost) linear].

Figure 37: — Variation of *s_e* for *x_v* = 0 (○), $x_{v} = 0.0625$ ( $▹$ ), $x_{v} = 0.125$ ( $⋄$ ), $x_{v} = 0.1875$ ( $△$ ), $x_{v} = 0.25$ (▽), $x_{v} = 0.3125$ ( $◃$ ), $x_{v} = 0.375$ ( $□$ ), $x_{v} = 0.4375$ (+) and $x_{v} = 0.5$ (×).

Although way below the critical value $p_{r} / p_{i} = 1$ for removal-related herd-immunity already for x_v = 0 (see Sections 5 and 6), the values of r_c become even significantly lower than that with increasing x_v, and even approach zero when x_v approaches $x_{v} = 0.593 \equiv x_{p}$ , which corresponds to the percolation threshold $x_{c, s} = 1 - x_{p} = 0.407$ for the susceptible nodes at the onset of the epidemic in case of nearest and next-nearest neighbour contacts [43]. We clearly see vaccination and the effects of size reduction of the social bubbles team-up in bringing r_c down and suppressing the spread of an infection. This ‘bundling of forces’ can be understood as follows. Vaccination acts in a 2-fold manner by reducing the number of new infections possible per unit of time and by blocking/reducing paths along which the infection can propagate through the population. The latter mechanism is a true percolation effect and, as such, enhanced by reductions of the social-network size.

Figure 38 shows the variation of r_c as a function of x_v as obtained from the data in Fig. 37. The variation appears to be (almost) perfectly linear and with r₀ representing the value of r_c for $x_{v} = 0$ it can be described by the empirical relation:

r_{c} = r_{0} (\frac{x_{p} - x_{v}}{x_{p}}) = r_{0} (1 - \frac{x_{v}}{x_{p}}) .

(8.64)

This result implies the following criterion for (vaccine-acquired) herd-immunity:

\frac{r_{p}}{1 - x_{v} / x_{p}} = \frac{1}{1 - x_{v} / x_{p}} \cdot \frac{p_{r}}{p_{i}} \geq r_{0} .

(8.65)

At least from a phenomenological point of view, we may consider this relation as a generalization (for percolation effects) of the results on herd-immunity presented in Sections 5–7 [see formula (5.1), (6.24) and, with ϵ = 1, (7.7)]. It seems that a herd-immunity threshold generally exists and takes the mathematical form of a critical value r_c of the ratio between p_r and p_i, if we think of herd-immunity as a situation where a bulk cluster of cumulative infections is no longer possible and only a minor part of the population will get infected [both in case of strong herd-immunity under a regime of social normality and in case of weak herd-immunity under a regime of social measures (see Section 6)]. When $p_{r} / p_{i} > r_{c}$ there can be no large-scale propagation of infections throughout the population. The herd-immunity threshold for x_v = 0 is given by (written as) r₀. In general, $0 \leq r_{0} \leq 1$ . The exact value of r₀ results from the specific percolation phenomena involved, and is therefore typical of the specific (finite) size and structure of the social bubbles of the population members. As a result of vaccination, the herd-immunity threshold decreases in accordance with (8.64). Comparison of (7.7) and (8.65) suggests that the ratio ${\bar{x}}_{v} = x_{v} / x_{p}$ thereby acts as an effective vaccination rate [conceptually comparable to $s_{v} = ϵ x_{v}$ in (7.7)] and may therefore be considered as just that. As such (at least when ϵ = 1) the result (7.7) may be regarded as merely a special case for $r_{0} = 1$ and x_p = 1 (so that ${\bar{x}}_{v} = x_{v}$ ) of a more general result given by (8.65).

Instead of a rigorous mathematical treatment of the observed (linear) functional relationship between r_c and x_v, a simple qualitative analysis that makes the specific form of (8.64) at least plausible can be given as follows. The curves in Fig. 22 show that, even when x_v = 0, a reduction of the size of the social bubbles the nodes may lead to a very strong decrease of $〈 s_{s i} 〉$ with s at (very) low s-values. Upon increasing s for (very) low N and starting from $s \approx 0$ , a kind of plateau in the $〈 s_{s i} 〉$ vs. s curves is reached already at s-values only marginally higher than s = 0. Let $s'$ be a typical value for s in this respect. For $s > s'$ , the change in the active-infection rate per unit of time can be written as:

{\dot{s}}_{i} = p_{i} s_{i} ⟨ s_{s i} ⟩ - p_{r} {\overset{}{s}}_{i} = p_{i} s_{i} \cdot \sum_{n = 0}^{\infty} a'_{n} {(s - s')}^{n} - p_{r} {\overset{}{s}}_{i} .

For x_v = 0, the coefficient $a'_{0}$ is identified as the parameter r₀, so that for $s = s'$ and x_v = 0:

{\dot{s}}_{i} / s_{i} = p_{i} r_{0} - p_{r} .

In general, $a'_{0}$ will depend on x_v, and we write:

a'_{0} (x_{v}) = r_{0} (1 - E (x_{v}) \cdot x_{v}) .

The terms $E (x_{v}) \cdot x_{v}$ represent an effective vaccination rate (compare with $s_{v} = ϵ s_{v}$ in Section 7), where the function $E (x_{v})$ accounts for an additional decrease (due to percolation effects) of the average number of susceptibles (linked to a single active infection) that may become part of a bulk cluster of cumulative infections. In the absence of percolation effects $E (x_{v}) = 1$ so that the effective vaccination rate is equal to the actual vaccination rate x_v, whereas in cases where percolation effects are present $E (x_{v}) > 1$ , and the effective vaccination rate will be larger than the actual vaccination rate in those cases. Obtaining the exact mathematical form of the function $E (x_{v})$ is a difficult problem. However, x_v = x_p marks the threshold beyond which there will no longer be a bulk cluster of cumulative infections. Hence, $(1 - E (x_{p}) \cdot x_{p}) = 0$ , so that we obtain $E (x_{p}) = 1 / x_{p}$ . Consequently, when we consider $E (x_{v})$ to be a differentiable function of x_v for $0 < x_{v} < x_{p}$ , we can express $E (x_{v})$ as a series expansion in $x_{v} - x_{p}$ of the form:

E (x_{v}) = \frac{1}{x_{p}} + \sum_{n = 1}^{\infty} E_{n} \cdot {(x_{v} - x_{p})}^{n}

in order to account for x_v-dependence of $E (x_{v})$ explicitly. However, Fig. 38 suggests that, apparently, the contribution to $E (x_{v})$ due to terms of the order $n \geq 1$ is very modest and even negligible for all $x_{v}$ , and that only the 0th-order term $1 / x_{p}$ of the series expansion prevails. This then leads us straight to the empirical relation (8.64).

8.8 Herd-immunity and percolation

It may be clear from the previous sections in this section that percolation phenomena are a contributing factor to the achievement of herd-immunity. Even in unvaccinated populations, the critical value of $r_{p} = p_{r} / p_{i}$ (beyond which the cumulative infection rate will stick at a very low value) is significantly lowered when the social-bubble size is reduced. This is clearly illustrated by the data presented in Figs 24 and 33 in Section 8.5. These show, for both the cases $ν = 4$ and $ν = 8$ , a critical value of r_p much lower than the value r_p = 1 expected in the percolation-free case ( $ν \to \infty$ ), even when x_v = 0. It is emphasized that this reduction is caused by percolation effects alone, that is, by the increased reduction of the number of susceptibles in the social bubble of an active infection for given s arising from the reduction of the size of the social bubbles ν and the increased influence of lattice correlations that goes with such a reduction, as well as by cluster-percolation effects discussed in Section 8.5.

Furthermore, the reduction of the critical r_p-value becomes even (much) stronger in vaccinated populations, as shown by the data presented in Figs 37 and 38, and percolation effects are responsible for this observation. The increased influence of percolation effects in these cases may be inferred from Section 8.3, where it is shown that an epidemic becomes actually impossible when the percolation threshold of population lattice is passed, even without infection removal. When $p_{r} \neq 0$ , the effects of infection removal and percolation apparently team-up to reduce the critical value of r_p significantly more than the reduction resulting from each of these mechanisms separately [i.e. when only one of these mechanisms would apply while the other does not, like in the case of infection removal in a percolation-free case with $ν \to \infty$ or the case of percolation effects in the absence of infection removal (p_r = 0)].

Finally, it seems that the onset of herd-immunity is in fact related to a second-order phase transition, in which the average cluster size drops to (almost) zero, whereas the fluctuation size, reflected by $\sqrt{〈 (} S_{c} - 〈 S_{c} 〉)^{2} 〉$ , and the correlation length ξ diverge. It seems therefore that the herd-immunity transition is accompanied by a genuine percolation transition, and that percolation effects are therewith an integral part of the road towards herd-immunity. They are at least a ‘side-effect’, but in many cases, especially when the social bubbles are small, also a contributing cause. The entwining of the herd-immunity transition with a percolation transition to the extend that one may even argue that the herd-immunity transition is in fact a percolation transition epitomizes the essential role played by percolation phenomena in the achievement of herd-immunity, even in those cases where the primary mechanism driving the transition is infection removal. It may be clear that percolation effects contribute to the achievement of all types of herd-immunity that can be identified on the basis of the classification scheme presented in Section 6.2.

9. Epilogue

9.1 Summary, conclusions and suggestions for further research

The foregoing sections not only confirm previously known results and phenomena while putting them on more solid mathematical grounds (like the criterion for epidemic spread of an infection), but they also present a number of new results and viewpoints.

First of all, the inadequacy of the standard SIR-model, especially under certain conditions (strict lock-downs and the strong limitations of social contacts inherent to them), is revealed. Under very strict lock-down conditions, the failure of the standard SIR-model becomes quite dramatic, as demonstrated by the results of simulations presented in Section 4 and their comparison to predictions by the standard SIR-model for the same sets of parameters as those used in the statistical simulations. As such, the need becomes clear for a new paradigm that allows for either an extension or a correction of the standard SIR-model to address these issues. In Section 1, such a change of approach is actually presented by considering the population as a network, and through a particular focus on the parameters $〈 s_{s i} 〉$ and $〈 s_{i s} 〉$ and their expression as series expansions in the cumulative infection rate s (which itself acts as the analogue of a state parameter in thermodynamic systems). In contrast to the standard SIR-model, this new approach enables an appropriate reproduction of the data obtained from simulations in both a qualitative and quantitative way, even when strict social measures are in place that significantly reduce the number of contacts of each member of the population. However, modelling the spread of infectious diseases through a set of appropriately constructed differential equations is still surrounded by fundamental difficulties. The problem in this respect mainly lies with the influence of the network details on the actual spread of the infection. The complexity of network phenomena is notoriously difficult to describe in terms of an exact algebraic approach. The approach presented in this article bypasses this issue by the use of series expansions. However, the coefficients in these series expansions cannot be calculated via an ab initio scheme but their values have to be derived from already existing data (either from simulations or field data) in such a way that they provide a proper agreement between the solutions of the differential equations and the (actual) data already available from other sources (see Section 4). Nevertheless, even the insight that the standard SIR-model has serious shortcomings caused by the network structures of populations, and the awareness of the serious fundamental issues that these network structures bring about in relation to the modelling of infection spread in general, has its own merits. It comes therefore as a bit of a surprise that it can be shown (as in Section 5) that the criterion for an epidemic to develop from a limited number of initial active infections is actually independent of the structure of the population network (which, at first glance, seems to defy intuition).

Understanding the vital role of the network structure on the quantitative aspects of the spread of an infection also leads us to valuable insights regarding the unexpected and undesired effects of changing (scaling down) the regime of social restrictions, even when (or, better stated: especially when) the number of cumulative infections seems to stabilize or saturate (see Section 5.3). The fade-out of an epidemic is contextual, in the sense that the social-network structure has a direct influence on the total number of cumulative infections reached during a wave of infections that takes place under a specific regime of social restrictions. A change towards a regime of less social restrictions (larger social bubbles) when the cumulative number of infections seems to stabilize while there are still some residual active infections present may in some cases directly lead to a restart of the epidemic, and to a renewed increase of the number of active and cumulative infections (of potentially dramatic proportions). The relevance of these insights to policy makers is evident. Viewpoints as these are therefore among the key results presented in this article, as they also provide an illustration of the epistemic difficulties that exist in relation to the concept of herd-immunity and its understanding, and of how an inappropriate conception of herd-immunity may lead to the wrong social policies under the circumstances given.

One of the most urgent scientific issues with regard to herd-immunity is its definition: herd-immunity is still somewhat ill defined. To meet the need for a definition that is not only of practical use but also rooted in the fundamental mechanisms that govern the spread of infections, a classification (based on the results in Sections 4 and 5) is presented in Section 6.2. A distinction is made between weak and strong herd-immunity of either first or second degree. Weak herd-immunity corresponds to a situation as described in the above, where there is a saturation of the cumulative infection rate under a regime of social restrictions, but with the prospect of a new wave of infections once the restrictions are (partially) lifted. In contrast, strong herd-immunity relates to a stable situation where even a return to a situation of social normality will not lead to a new spark in the number of active infections and a new wave of infections. The distinction between first and second degree applies to both weak and strong herd-immunity. In a case of first degree herd-immunity (weak or strong), the number of active infections is over its peak and in decline: the epidemic is (under the circumstances given) in an inevitable state of fading-out. The complementary case where an epidemic (again under the circumstances given) has actually reached its end [so that the number of active infections is (close to) zero and the cumulative infection rate has reached saturation] is referred to a state of second-degree herd-immunity. It may be clear that the best (i.e. safest) policy, at least from an epidemiological point of view, should aim at achieving strong herd-immunity of (preferably) second degree.

The herd-immunity classification scheme was primarily conceived with a picture in mind of waves of infections that gradually come to a halt (mainly due to infection removal), thus resulting in a state of weak or strong herd-immunity. A less troublesome way to obtain a state of herd-immunity is vaccination. By vaccinating a sufficiently large proportion of the population with a vaccine of sufficiently high effectiveness ϵ, states of weak or even strong herd-immunity can be obtained. In a vaccine-induced state of weak herd-immunity, the vaccination campaign has to be complemented with social measures to prevent further spreading of an infection. In vaccine-induced cases of strong herd-immunity, the population is safe against an epidemic even without any social measures in place. It is evident that a vaccination campaign should aim at the latter. Section 7 deals extensively with the effects of vaccination. A criterion was obtained for a critical vaccination rate that represents the minimal vaccination rate for which immunity for the population at large (without social measures) against the outbreak of an epidemic is guaranteed (and therewith a state of strong herd-immunity is obtained). The line of thought to obtain such a criterion was similar to the one followed in Section 5.1 and, as a consequence, the critical vaccination rate thus obtained is independent of the structure of the social network. The effectiveness of the vaccine is an important parameter here and it explicitly enters the expression derived for the critical vaccination rate. In relation to this, it appeared useful to introduce an effective vaccination rate $s_{v c} = ϵ s_{v}$ . A crucial observation (see Fig. 14) is that for each $p_{i} / p_{r}$ a minimum effectiveness and a minimum effective vaccination rate exist below which a state of strong herd-immunity cannot be obtained, so that the pathogen involved has to be considered as endemic: unless vaccines of higher efficiency can be developed, vaccination is unable to provide (strong) herd-immunity and the pathogen will spread through the entire population unless appropriate social measures aimed at reducing $p_{i} / p_{r}$ are taken. The possibility of such a scenario may also become of relevance when new variants of a pathogen emerge: a vaccine that provides (strong) herd-immunity against one variant may not do so against a newer variant, the result being a new wave of infections. In all of this there is an additional caveat to be considered as well. Immunity, whether obtained via past infections or by vaccination, is not necessarily indefinite. Waning immunity is a well-known phenomenon in connection with many infectious diseases (both in relation to infection-acquired and vaccine-acquired immunity). As a consequence, and depending on the extend to which the immunity of the population members decreases, a (collective) state of herd-immunity may, once achieved, still not be ever-lasting (as the graphical illustration on the basis of Fig. 14a and b shows at the end of Section 7).

A link exists between the criterion for an epidemic to evolve from a modest number of initial infections (and correspondingly also the criteria for herd-immunity) and the value of the (basic) reproduction number R (R₀). However, the relation between the two is less straightforward than often assumed (see Sections 2, 5.4 and 6.3). Starting from its definition, the reproduction number R can be expressed as $R = Q \cdot p_{i} / p_{r}$ , where Q accounts for the depletion of the reservoir of susceptibles [the basic reproduction number follows as $R_{0} = Q_{0} \cdot p_{i} / p_{r}$ , where Q₀ is the value of Q at the beginning of an outbreak ( $t_{0} = 0$ )]. The criterion $p_{i} / p_{r} > 1$ for an epidemic to evolve is therefore not equivalent to R > 1 or $R_{0} > 0$ (which as often assumed however). Only when Q = 1 ( $Q_{0} = 1$ ) both criteria are equivalent. The issue can only be resolved by changing the definition of the reproduction number (by simply putting $R, R_{0} \equiv p_{i} / p_{r}$ ), or by relating the criterion $p_{i} / p_{r} > 1$ to the reproduction number by writing it as $R / Q > 1$ ( $R_{0} / Q_{0} > 1$ ). The often cited criterion $R, R_{0} > 1$ is therefore to be considered only as a crude approximation obtained under neglect of the depletion of the reservoir of susceptibles during the course of an outbreak (see Section 5.4).

The role of the network structure is already a central theme in Sections 1–7. After all, since the population is considered as a lattice or network, and since it is not difficult to imagine that the structure of such a population network will have at least some influence on the evolution of an epidemic, the effects of the details of the network have to be explicitly considered and accounted for, in order to obtain a realistic analysis. The results in the aforementioned sections clearly demonstrate this. However, there is barely another phenomenon where network effects manifest themselves in such an all-important way as in the percolation transition, the relevance of which to vaccination scenarios is almost self-evident by its very nature. Section 8 deals extensively with the phenomenon therefore. Via cluster-identification algorithms applied to data from simulations, a clear link is shown between vaccination-induced percolation transitions and the propagation of an infection through a population. It is shown that the fluctuations in the size of the final clusters of cumulative infections diverge when the percolation threshold is reached. It is also shown that such a divergence in the size-fluctuation can be related in general to a divergence in the correlation length (see Section 8.2). A divergence in the correlation length is a defining signature of a second-order phase transition in physical/thermodynamic systems. As such, the vaccine-induced herd-immunity threshold can be identified as the critical point of such a second-order phase transition. With some extensions, the framework outlined in Section 1 is able to cope with the observed percolation phenomena, including the percolation transition itself. That is, the variation as a function of the vaccination rate of the end-value of the number of cumulative infections, as well as its collapse at the percolation threshold can be adequately described. However, the variation in (average) cluster size and the divergence of the correlation length are beyond the scope of the presented framework. It should be realized however that these are difficult topics that cannot be caught easily (if at all) in a tractable set of formulas and require a numerical approach almost by nature. In Section 8.5, it is shown that the occurrence of percolation phenomena is not restricted to vaccinated populations alone. Including even percolation transitions, they also occur in unvaccinated populations, where they manifest themselves through a collapse of the end-value of the cumulative infection rate at a critical value of $r_{c} = p_{r} / p_{i}$ accompanied by a divergence in the size fluctuation of the clusters of cumulative infections, which is indicative of a second-order phase transition also in these cases. Cluster percolation plays a decisive role here (see Figs 26 and 28 and the related discussion in Section 8.5). The modified SIR-model of Section 1 gives an adequate description of the evolution of an epidemic in the case of unvaccinated populations as well, even when there is a percolation/herd-immunity transition.

The idea that the herd-immunity threshold actually represents a second-order phase transition is not entirely new, and was originally put forward by Grassberger [6] on the basis of a search for critical exponents that were in agreement with simulated data on the spread of an infection on a 2D square lattice in two limiting cases of the SIR-model. The present paper goes deeper and more directly into the physical aspects of the phenomenon however, by demonstrating the divergent behaviour of the standard deviation (basically representing the average fluctuation size) of the size of the clusters of accumulated infections, and by showing its relation to a divergence in the correlation length, thus corroborating Grassberger’s findings in a very illustrative way, as well as in a more realistic context [for instance, by considering a (random) distribution of initial infections instead of a single initial infection at the centre of the population as assumed by Grassberger].

The results in this article were predominantly obtained under the assumption that both infection removal and vaccination provide full immunity. Only in Section 7 on vaccination, the possibility of vaccines with an effectiveness ϵ less than 1 (i.e. less than 100%) was considered and also part of the analysis. In practice, both infection removal and vaccination may not result in full immunity of each population member involved however, especially when new variants of a pathogen with (slightly) different properties emerge to which the immune system does not have a fully effective response (like in the case of Covid-19 for instance). It is therefore a legitimate question whether the results and conclusions presented in this article would be different when infection removal and/or vaccination would only result in ‘partial’ immunity, that is when the population members that were vaccinated or the ones that have overcome an infection are not fully immune but may instead be susceptible to re-infection, albeit with a lesser transmission probability than the one that applies to the unvaccinated and also still uninfected (thus fully susceptible) population members. The analyses in the previous sections offer enough insights to allow for some expectations to be mentioned regarding this issue. For this purpose, it is important to notice that although the transmission probability that applies in relation to vaccinated population members and removed infections may be nonzero, a certain ‘stopping power’ is provided by a reduction of the transmission probability: a route of infection along members of the population may still pass a particular vaccinated node or removed infection, but with much greater difficulty (i.e. much less a probability) than when the node in question was still fully susceptible. Percolation effects are therefore still a very real possibility, and the growth rate of the number of active infections is (substantially) reduced when the vaccination rate and/or the fraction of removed infections are/is high. The effects of partial immunity are therefore expected to be quantitative rather than qualitative. To corroborate these expectations, an extension of the framework presented in this article to account also for cases of partial immunity would make an excellent suggestion for further research therewith. The lines along which such an extension may be conceived seem clear and could take the form of the introduction of a second transmission parameter $p'_{i}$ next to p_i (such that $p'_{i} < p_{i}$ ) which specifically applies to contacts between an active infection and a vaccinated population member or a removed infection. The spirit of such an approach was actually advanced already in Section 7 when we dealt with cases where $ϵ < 1$ [Equation (7.2) ff]. It is rather straightforward to take this approach one step further and to apply it to cases of partial immunity in general (cases of waning immunity included). It should be mentioned however, that those results presented in this article that were obtained on the basis of the assumption of full vaccine- or removal-acquired immunity represent fair and meaningful approximations for those cases where a substantial (near full) immunity can be obtained.

All together we may conclude that the results in this article provide quite some new insights in the spread of infectious diseases and the evolution of an epidemic. As stated earlier, it is at these insights that this article aims particularly, thus leading to new methods to forecast the evolution of ongoing epidemics. In fact, the article more than once points out that the goal of making such forecasts is quite ambitious and is met with serious fundamental issues. However, the author believes that the incorporation of the methods and viewpoints outlined in the previous sections could lead to valuable improvements of the models and methods presently in use for modelling the spread of infectious diseases. This applies to the more extended models based on the SIR-model (such as the SIRS, SEIR or SEIS models), as well as to more complicated compartmental models dealing with populations that already have some form of heterogeneity at the start of an outbreak. The subject of the article is also quite timeless indeed. At the very moment that this text is written, the Covid-19 pandemic seems to be in remission, but it is too early to tell what the near future will bring. In addition, it would be dangerously naive to assume that Covid-19 will be the last of the great pandemics, and that the SARS-CoV-2 family of viruses will be the last pathogens with the potential of causing pandemic outbreaks. Finally, it should also be realized that the viewpoints in this article equally apply to outbreaks of plant diseases in, for instance, densely packed mono-cultures and may therefore be of relevance in an agricultural context as well.

Glossary of most common symbols

s _s: fraction/rate of susceptible nodes/population members
s _i: active-infection rate
s _r: removed-infection rate
s: cumulative rate of infections
s ₀: initial value of active infection rate
$s_{0, b}$: initial value of active infection rate in bulk the cluster
$s_{0, c'}$: initial value of active infection rate in the secondary clusters
s _e: final value (asymptotic) of cumulative infection rate
w _i: transmission probability
p _i: transmission rate/parameter/constant
p _r: constant of removal
n _xy: number of xy pairs ( $x, y = s, i, r$ )
ν: number of contacts to a single node/population member
$〈 s_{y x} 〉$: average number of nodes of type y surrounding a node of type x ( $y, x = s, i, r$ )
f _cn: contact-frequency: number of contacts made per node per unit time
f _cl: contact-frequency: number of contacts made per link per unit time
f _cp: contact-frequency: number if contacts made per unit time throughout entire population
n: number of nodes/members in a population
n ₀: number of initial infections
n _e: final value of the number of cumulative infections
n _s: number of susceptibles
n _i: number of active infections
n _r: number of removed infections
${\bar{n}}_{b}$: relative (total) size of bulk cluster(s)
${\bar{n}}_{s, c'}$: relative total size of secondary clusters
$a_{n}^{s i}$: expansion coefficient for nth term in series expansion of $〈 s_{s i} 〉$ ( $n = 0, 1, 2….$ )
$a_{n}^{i s}$: expansion coefficient for nth term in series expansion of $〈 s_{i s} 〉$ ( $n = 0, 1, 2….$ )
R: reproduction number
R ₀: basic reproduction number
$s_{1, 2, 3}$: roots of third-order polynomial approximation of $〈 s_{s i} 〉$
$s_{\pm}$: roots of second-order polynomial approximation of $〈 s_{s i} 〉$ , or third order of $〈 s_{i s} 〉$
t: time
N: integer, scales the size of squares representing/enclosing a social bubble (size: $2 N + 1 \times 2 N + 1$ )
ξ ₀: relative reduction of social-bubble size at t = 0
ξ _e: relative reduction of social-bubble size for $t \to \infty$
ϵ: effectiveness of vaccine
$ξ_{v}, x_{v}$: vaccination rate, relative reduction of social-bubble size as a result of vaccination
x _c: percolation threshold
ξ: correlation length
$S_{c}, S_{c'}$: cluster size (number of nodes/members)
$S_{0}, S_{b}$: size of the bulk cluster (number of nodes/members)
${\bar{S}}_{c}, {\bar{S}}_{c'}$: relative cluster size (number of nodes/members)
${\bar{S}}_{0}$: relative size of the bulk cluster (number of nodes/members)
${\bar{S}}_{c'}$: expectation value of relative total size of secondary clusters
A _c: cluster area

Appendix 1

Solving the differential equations

(a) Third-order polynomial approximation of $〈 s_{s i} 〉$

Truncation of the terms of order >3 in the series expansions representing $〈 s_{s i} 〉$ in (1.42a,b) has the advantage that the resulting set of ODE’s (3.11a,b) can be solved easily via partially analytical methods. With $\dot{s_{i}} = \dot{s} (\partial s_{i} / \partial s)$ , we can rewrite (3.11a) as:

\dot{s} (\frac{\partial s_{i}}{\partial s}) = \dot{s} - p_{r} s_{i}

which, upon writing (3.11b) as $\dot{s} = s_{i} P_{i} (s)$ , can be reworked into:

(\frac{\partial s_{i}}{\partial s}) = 1 - \frac{p_{r}}{P_{i} (s)} .

(A1.1)

Substitution of the term in brackets in (3.11a) for $P_{i} (s)$ subsequently yields:

(\frac{\partial s_{i}}{\partial s}) = 1 - \frac{ν p_{r} / p_{i}}{ν + a_{1}^{s i} s + a_{2}^{s i} s^{2} + a_{3}^{s i} s^{3}}

which we rewrite as:

(\frac{\partial s_{i}}{\partial s}) = 1 - \frac{ν p_{r} / a_{3}^{s i} p_{i}}{(s - s_{1}) (s - s_{2}) (s - s_{3})}

(A1.2)

with $s_{1, 2, 3}$ representing the roots (real and complex) of the third-order polynomial $a_{3}^{s i} s^{3} + a_{2}^{s i} s^{2} + a_{1}^{s i} s + ν = 0$ , which can be calculated exactly via the somewhat tedious algebraic scheme of Cardano’s method (see, for instance, Ref. [46]). This scheme is easily implemented in a computational procedure however. We rewrite the fraction on the right-hand side of (A1.2) via decomposition by parts:

(\frac{\partial s_{i}}{\partial s}) = 1 - (\frac{ν p_{r}}{a_{3}^{s i} p_{i}}) [\frac{a}{(s - s_{1})} + \frac{b}{(s - s_{2})} + \frac{c}{(s - s_{3})}],

(A1.3)

where the a, b and c are readily obtained as

a = \frac{s_{3} - s_{2}}{s_{2} s_{3} (s_{3} - s_{2}) + s_{1} s_{3} (s_{1} - s_{3}) + s_{1} s_{2} (s_{2} - s_{1})} .

(A1.4a)

b = \frac{s_{1} - s_{3}}{s_{2} s_{3} (s_{3} - s_{2}) + s_{1} s_{3} (s_{1} - s_{3}) + s_{1} s_{2} (s_{2} - s_{1})} .

(A1.4b)

c = \frac{s_{2} - s_{1}}{s_{2} s_{3} (s_{3} - s_{2}) + s_{1} s_{3} (s_{1} - s_{3}) + s_{1} s_{2} (s_{2} - s_{1})} .

(A1.4c)

Integration of (A1.3) is straightforward:

\begin{matrix} s_{i} = \int 1 - (\frac{ν p_{r}}{a_{3}^{s i} p_{i}}) [\frac{a}{(s - s_{1})} + \frac{b}{(s - s_{2})} + \frac{c}{(s - s_{3})}] d s \\ = s - \frac{ν p_{r}}{a_{3}^{s i} p_{i}} \cdot log ({(s - s_{1})}^{a} {(s - s_{2})}^{b} {(s - s_{3})}^{c}) + C \end{matrix},

(A1.5)

where the complex logarithm function is implicated and C represents the constant of integration. The latter follows from the (initial) condition that $s_{i} = s = s_{0}$ when t = 0:

C = \frac{ν p_{r}}{a_{3}^{s i} p_{i}} \cdot log ({(s_{0} - s_{1})}^{a} {(s_{0} - s_{2})}^{b} {(s_{0} - s_{3})}^{c})

(A1.6)

so that (A1.5) can be rewritten as:

s_{i} = s - \frac{ν p_{r}}{a_{3}^{s i} p_{i}} \cdot log (\frac{{(s_{0} - s_{1})}^{a} {(s_{0} - s_{2})}^{b} {(s_{0} - s_{3})}^{c}}{{(s - s_{1})}^{a} {(s - s_{2})}^{b} {(s - s_{3})}^{c}}) .

(A1.7)

Via substitution of this result into Equation (3.11b), the following nonlinear ordinary differential equation for s as a function of t is obtained at last:

\dot{s} \approx p_{i} (1 + \frac{a_{1}^{s i}}{ν} s + \frac{a_{2}^{s i}}{ν} s^{2} + \frac{a_{3}^{s i}}{ν} s^{3}) [s - \frac{ν p_{r}}{a_{3}^{s i} p_{i}} \cdot log (\frac{{(s_{0} - s_{1})}^{a} {(s_{0} - s_{2})}^{b} {(s_{0} - s_{3})}^{c}}{{(s - s_{1})}^{a} {(s - s_{2})}^{b} {(s - s_{3})}^{c}})],

(A1.8)

which can only be solved numerically, thus finalizing the solution of the system of ODE’s given by (3.11a) and (3.11b).

(b) Third-order polynomial approximation of $〈 s_{i s} 〉$

When approximating $〈 s_{i s} 〉$ via a series expansion up to third order in s, we can obtain the solution of the system of ODE’s that (1.43a) and (1.43b) therewith become along somewhat similar lines. We have:

\dot{s} = p_{i} (1 - s) \frac{〈 s_{i s} 〉}{ν} = p_{i} (1 - s) \cdot \frac{(a_{1}^{i s} s + a_{2}^{i s} s^{2} + a_{3}^{i s} s^{3})}{ν}

(A1.9a)

or, equivalently:

\frac{\dot{s}}{s (1 - s) (s - s_{+}) (s - s_{-})} = \frac{a_{3}^{i s} p_{i}}{ν}

(A1.9b)

where the $s_{\pm}$ represent the roots of $a_{3}^{i s} s^{2} + a_{2}^{i s} s + a_{1}^{i s} = 0$ . Via separation by parts and some algebra we can write:

\frac{1}{s (1 - s) (s - s_{+}) (s - s_{-})} = \frac{a}{s} + \frac{c}{1 - s} + \frac{c}{s - s_{+}} - \frac{d}{s - s_{-}},

(A1.10)

where

\begin{matrix} a = \frac{1}{s_{+} s_{-}} b = \frac{1}{s_{+} (1 - s_{+}) (1 - s_{-})} \\ c = \frac{1}{s_{+} (1 - s_{+}) (s_{+} - s_{-})} d = \frac{1}{s_{-} (1 - s_{-}) (s_{+} - s_{-})} \end{matrix} .

(A1.11)

Remember that $\dot{s} d t = d s$ since s is a ‘state variable’. The solutions of the differential equations (A1.9a,b) are now easily obtained by substitution of (A1.10) into:

\int \frac{d s}{s (1 - s) (s - s_{+}) (s - s_{-})} = \frac{a_{3}^{i s} p_{i} t}{ν} + C

(A1.12)

yielding, apart from a constant of integration:

\int \frac{d s}{s (1 - s) (s - s_{+}) (s - s_{-})} = log {\frac{s^{a}}{{(1 - s)}^{b}} \cdot \frac{{(s - s_{+})}^{c}}{{(s - s_{-})}^{d}}},

(A1.13)

where, as in Section 3.1, the complex logarithm function is applied.

Via substitution of (A1.11) for $a, b, c, d$ into (A1.13) we get, after some algebra:

\begin{array}{l} log {s^{\frac{1}{s_{+} s_{-}}} \cdot {((s - s_{+}) {(1 - s)}^{- s_{+}^{2}})}^{\frac{1}{s_{+} (1 - s_{+}) (s_{+} - s_{-})}} \cdot {((s - s_{-}) {(1 - s)}^{- s_{-}^{2}})}^{\frac{1}{s_{-} (1 - s_{-}) (s_{-} - s_{+})}}} \\ = \frac{a_{3}^{i s} p_{i} t}{ν} + C \end{array} .

(A1.14)

The constant of integration C is determined by the value $s'$ of s at some arbitrary time $t = t' > 0$ :

\begin{matrix} C = (log {s'^{\frac{1}{s_{+} s_{-}}} \cdot {((s' - s_{+}) {(1 - s')}^{- s_{+}^{2}})}^{\frac{1}{s_{+} (1 - s_{+}) (s_{+} - s_{-})}} \\ \cdot {((s' - s_{-}) {(1 - s')}^{- s_{-}^{2}})}^{\frac{1}{s_{-} (1 - s_{-}) (s_{-} - s_{+})}}} - \frac{a_{3}^{i s} p_{i} t'}{ν}) \end{matrix} .

(A1.15)

Combining (A1.14) and (A1.15), we then obtain:

\begin{matrix} log {{(\frac{s}{s'})}^{\frac{1}{s_{+} s_{-}}} \cdot {(\frac{(s - s_{+}) {(1 - s)}^{- s_{+}^{2}}}{(s' - s_{+}) {(1 - s')}^{- s_{+}^{2}}})}^{\frac{1}{s_{+} (1 - s_{+}) (s_{+} - s_{-})}} \cdot {(\frac{(s - s_{-}) {(1 - s)}^{- s_{-}^{2}}}{(s' - s_{-}) {(1 - s')}^{- s_{-}^{2}}})}^{\frac{1}{s_{-} (1 - s_{-}) (s_{-} - s_{+})}}} \\ = \frac{a_{3}^{i s} p_{i} (t - t')}{ν} \end{matrix} .

(3.12)

Note that the argument of the logarithm is always real, for when one of the roots $s_{\pm}$ is complex, the other one is its complex conjugate.

Appendix 2

An eigenvalue equation relating the s_x and the $〈 s_{y x} 〉$

We suppose three types of nodes a, b and c. Using (1.14) and (1.15):

\begin{matrix} n_{x y} = n s_{x} 〈 s_{y x} 〉 = n s_{y} 〈 s_{y x} 〉 \\ n_{x x} = \frac{1}{2} n s_{x} 〈 s_{x x} 〉 \end{matrix}

we can rewrite the relation between the numbers of aa, ab and ac pairs, respectively, represented by n_aa, n_ab and n_ac:

2 n_{a a} + n_{a b} + n_{a c} = ν s_{a}

as follows:

s_{a} 〈 s_{a a} 〉 + s_{b} 〈 s_{a b} 〉 + s_{c} 〈 s_{a c} 〉 = ν s_{a} .

(A2.1)

Upon systematic permutation of a, b, c in (A2.1), we also obtain:

s_{a} 〈 s_{b a} 〉 + s_{b} 〈 s_{b b} 〉 + s_{c} 〈 s_{b c} 〉 = ν s_{b},

(A2.2)

s_{a} 〈 s_{c a} 〉 + s_{b} 〈 s_{c b} 〉 + s_{c} 〈 s_{c c} 〉 = ν s_{c} .

(A2.3)

We can write (A2.1), (A2.2) and (A2.3) in matrix form as:

(\begin{matrix} 〈 s_{a a} 〉 & 〈 s_{a b} 〉 & 〈 s_{a c} 〉 \\ 〈 s_{b a} 〉 & 〈 s_{b b} 〉 & 〈 s_{b c} 〉 \\ 〈 s_{c a} 〉 & 〈 s_{c b} 〉 & 〈 s_{c c} 〉 \end{matrix}) (\begin{matrix} s_{a} \\ s_{b} \\ s_{c} \end{matrix}) = ν (\begin{matrix} s_{a} \\ s_{b} \\ s_{c} \end{matrix}),

(A2.4)

which is essentially an eigenvalue equation with the number of links per node ν as the eigenvalue. When the number of nodes of a certain type, say c, is zero (i.e. s_c = 0 so that $〈 s_{x c} 〉 = 〈 s_{c x} 〉 = 0$ for $x = a, b, c$ ) we get the simplified eigenvalue equation:

(\begin{matrix} 〈 s_{a a} 〉 & 〈 s_{a b} 〉 \\ 〈 s_{b a} 〉 & 〈 s_{b b} 〉 \end{matrix}) (\begin{matrix} s_{a} \\ s_{b} \end{matrix}) = ν (\begin{matrix} s_{a} \\ s_{b} \end{matrix}) .

(A2.5)

Appendix 3

Derivation of the coefficients $a_{n} ({\bar{n}}_{b})$

Substitution of (8.28) into (8.27) yields for $x_{v} \neq 1$ , and with i = n and $j = n + m$ :

1 + \sum_{n = 1}^{\infty} \sum_{m = - \infty}^{\infty} α_{m, n} {\bar{n}}_{b}^{n + m} = 1 + \sum_{i = 1}^{\infty} \sum_{j = - \infty}^{\infty} α_{j - i, i} {\bar{n}}_{b}^{j} = 0 .

(A3.1)

Collecting terms with equal powers of ${\bar{n}}_{b}$ we get, for $j = n + m = 0$ :

1 + \sum_{i = 1}^{\infty} α_{- i, i} = 0

(A3.2a)

and for $j = n + m \neq 0$ :

\sum_{i = 1}^{\infty} α_{j - i, i} = 0 .

(A3.2b)

Following the approach in Section 3.1, by assuming that we can neglect terms of third and higher order in the series expansion of ${〈 s_{s i} 〉}_{b}$ , only $a_{1} ({\bar{n}}_{b})$ and $a_{2} ({\bar{n}}_{b})$ will be relevant. For n = 1 and n = 2 (8.28) then yields, respectively,

a_{1} = \frac{α_{- 1, 1}}{{\bar{n}}_{b}} + \sum_{m = 0}^{\infty} α_{m, 1} {\bar{n}}_{b}^{m}

(A3.3a)

a_{2} = \frac{α_{- 1, 2}}{{\bar{n}}_{b}} + \frac{α_{- 2, 2}}{{\bar{n}}_{b}^{2}} + \sum_{m = 0}^{\infty} α_{m, 2} {\bar{n}}_{b}^{m} .

(A3.3b)

Note that $α_{m, n} = 0$ for $m < - 1$ when $n = 1$ , and $α_{m, n} = 0$ for $m < - 2$ when n = 2 since otherwise ${〈 s_{s i} 〉}_{b}$ would diverge for ${\bar{n}}_{b} \to 0$ . For the second-order polynomial approximations of ${〈 s_{s i} 〉}_{b}$ considered here, the following constraint must apply when $s_{b} = {\bar{n}}_{b}$ [see (8.25)]:

{〈 s_{s i} 〉}_{b} \approx a_{0}^{(b)} + a_{1}^{(b)} {\bar{n}}_{b} + a_{2}^{(b)} {\bar{n}}_{b}^{2} \equiv 0

(A3.4)

so that in combination with (8.26) we get:

1 + a_{1} {\bar{n}}_{b} + a_{2} {\bar{n}}_{b}^{2} \equiv 0 .

(A3.5a)

That is,

a_{2} = - \frac{1 + a_{1} {\bar{n}}_{b}}{{\bar{n}}_{b}^{2}} .

(A3.5b)

Substitution of (A3.3a) then yields:

\begin{matrix} a_{2} = - \frac{1 + α_{- 1, 1} + {\bar{n}}_{b} \sum_{j = 0}^{\infty} α_{j, 1} {\bar{n}}_{b}^{j}}{{\bar{n}}_{b}^{2}} \\ = - \frac{(1 + α_{- 1, 1})}{{\bar{n}}_{b}^{2}} - \frac{α_{0, 1}}{{\bar{n}}_{b}} - \sum_{j = 0}^{\infty} α_{j + 1, 1} {\bar{n}}_{b}^{j} \end{matrix}

(A3.6)

from which, by comparison with (A3.3b), $α_{- 1, 2}$ and $α_{- 2, 2}$ can be identified as:

α_{- 1, 2} = - α_{0, 1} α_{- 2, 2} = - (1 + α_{- 1, 1})

(A3.7a)

a result in agreement with (A3.2a,b). Also by comparison of (A3.6) and (A3.3b), the $α_{j, 2}$ for $j \geq 0$ follows as:

α_{j, 2} = - α_{j + 1, 1} .

(A3.7b)

By combining (8.26), (A3.3a,b) and (A3.6), the Laurent series in ${\bar{n}}_{b}$ for $a_{1}^{(b)}$ and $a_{1}^{(b)}$ can thus be written as:

a_{1}^{(b)} = ν (1 - x_{v}) [\frac{α_{- 1, 1}}{{\bar{n}}_{b}} + \sum_{j = 0}^{\infty} α_{j, 1} {\bar{n}}_{b}^{j}]

(8.28a)

a_{2}^{(b)} = ν (x_{v} - 1) [\frac{α_{0, 1}}{{\bar{n}}_{b}} + \frac{(1 + α_{- 1, 1})}{{\bar{n}}_{b}^{2}} + \sum_{j = 0}^{\infty} α_{j + 1, 1} {\bar{n}}_{b}^{j}] .

(8.28b)

Appendix 4

Derivation of the coefficients $a_{n}^{c'} ({\bar{S}}_{c'})$

Analogous to (8.24) we write ${〈 s_{s i} 〉}_{c'}$ as:

{〈 s_{s i} 〉}_{c'} = ν (1 - x_{v}) + \sum_{n = 1}^{\infty} a_{n}^{(c')} s_{c'}^{n} .

(A4.1)

When we approximate (8.33) as $s_{c'} = {\bar{S}}_{c'} = c s_{0, c'} 〈 {\bar{S}}_{c'} 〉$ , the following condition must then be met:

{〈 s_{s i} 〉}_{c'} = ν (1 - x_{v}) + \sum_{n = 1}^{\infty} a_{n}^{(c')} {\bar{S}}_{c'}^{n} = 0 .

(A4.2)

Introducing the function $a_{n}^{'} = a_{n}^{'} ({\bar{S}}_{c'})$ as the analogue of the previously introduced function $a_{n} = a_{n} ({\bar{n}}_{b})$ :

a_{n}^{(c')} = a_{n}^{'} \cdot ν (1 - x_{v})

(A4.3)

we reexpress (A4.2) as:

{〈 s_{s i} 〉}_{c'} = ν (1 - x_{v}) {1 + \sum_{n = 1}^{\infty} a_{n}^{'} {\bar{S}}_{c'}^{n}} = 0 .

(A4.4)

The Laurent series of $a_{n}^{'}$ in ${\bar{S}}_{c'}$ is written as:

a_{n}^{'} ({\bar{S}}_{c'}) = \sum_{m = - \infty}^{\infty} α_{m, n}^{'} {\bar{S}}_{c'}^{m} .

(A4.5)

Substitution of (A4.5) into (A4.4) then yields, with i = n and $j = n + m$ :

1 + \sum_{n = 1}^{\infty} \sum_{m = - \infty}^{\infty} α_{m, n}^{'} {\bar{S}}_{c'}^{n + m} = 1 + \sum_{i = 1}^{\infty} \sum_{j = - \infty}^{\infty} α_{j - i, i}^{'} {\bar{S}}_{c'}^{j} = 0 .

(A4.6)

Collecting terms with equal powers of ${\bar{S}}_{c'}$ we get, for $j = n + m = 0$ :

1 + \sum_{i = 1}^{\infty} α_{- i, i}^{'} = 0

(A4.7a)

and for $j = n + m \neq 0$ :

\sum_{i = 0}^{\infty} α_{j - i, i}^{'} = 0 .

(A4.7b)

Neglecting terms of third and higher order in ${\bar{S}}_{c'}$ in the series expansion of ${〈 s_{s i} 〉}_{c'}$ , only $a_{1}^{'} ({\bar{S}}_{c'})$ and $a_{2}^{'} ({\bar{S}}_{c'})$ remain relevant. For n = 1 and n = 2, we get from (A4.5):

a_{1}^{'} = \frac{α_{- 1, 1}^{'}}{{\bar{S}}_{c'}} + \sum_{j = 0}^{\infty} α_{j, 1}^{'} {\bar{S}}_{c'}^{j}

(A4.8a)

a_{2}^{'} = \frac{α_{- 1, 2}^{'}}{{\bar{S}}_{c'}} + \frac{α_{- 2, 2}^{'}}{{\bar{S}}_{c'}^{2}} + \sum_{j = 0}^{\infty} α_{j, 2}^{'} {\bar{S}}_{c'}^{j} .

(A4.8b)

Analogous to (A3.4), for the second-order polynomial approximations of ${〈 s_{s i} 〉}_{c'}$ , the following constraint must apply with $s_{c'} = {\bar{S}}_{c'}$ [see (A4.2)]:

{〈 s_{s i} 〉}_{c'} \approx a_{0}^{(c')} + a_{1}^{(c')} {\bar{S}}_{c'} + a_{2}^{(c')} {\bar{S}}_{c'}^{2} \equiv 0

(A4.9)

so that in combination with (A4.3) we get, for $x_{v} \neq 1$ :

1 + a_{1}^{'} {\bar{S}}_{c'} + a_{2}^{'} {\bar{S}}_{c'}^{2} \equiv 0 .

(A4.10a)

That is,

a_{2}^{'} = - \frac{1 + a_{1}^{'} {\bar{S}}_{c'}}{{\bar{S}}_{c'}^{2}} .

(A4.10b)

Substitution of (A4.8a) then yields:

\begin{matrix} a_{2}^{'} = - \frac{1 + α_{- 1, 1}^{'} + {\bar{S}}_{c'} \cdot \sum_{j = 0}^{\infty} α_{j, 1}^{'} {\bar{S}}_{c'}^{j}}{{\bar{S}}_{c'}^{2}} \\ = - \frac{(1 + α_{- 1, 1}^{'})}{{\bar{S}}_{c'}^{2}} - \frac{α_{0, 1}^{'}}{{\bar{S}}_{c'}} - \sum_{j = 0}^{\infty} α_{j + 1, 1}^{'} {\bar{S}}_{c'}^{j} \end{matrix}

(A4.11)

from which $α_{- 1, 2}^{'}$ and $α_{- 2, 2}^{'}$ can be identified as:

α_{- 2, 2}^{'} = - (1 + α_{- 1, 1}^{'}) α_{- 1, 2}^{'} = - α_{0, 1}^{'},

(A4.12a)

respectively, in agreement with (A4.7a) and (A4.7b). For the $α_{m, 2}^{'}$ when $m \geq 0$ , we get from comparison of (A4.11) and (A4.8b):

α_{m, 2}^{'} = - α_{m + 1, 1}^{'} .

(A4.12b)

By combining (A4.4), (A4.8a) and (A4.11), the Laurent series in ${\bar{S}}_{c'}$ for $a_{1}^{(c')}$ and $a_{2}^{(c')}$ can thus be written as:

a_{1}^{(c')} = ν (1 - x_{v}) [\frac{α_{- 1, 1}^{'}}{{\bar{S}}_{c'}} + \sum_{j = 0}^{\infty} α_{j, 1}^{'} {\bar{S}}_{c'}^{j}]

(A4.13a)

a_{2}^{(c')} = ν (x_{v} - 1) [\frac{α_{0, 1}^{'}}{{\bar{S}}_{c'}} + \frac{(1 + α_{- 1, 1}^{'})}{{\bar{S}}_{c'}^{2}} + \sum_{j = 0}^{\infty} α_{j + 1, 1}^{'} {\bar{S}}_{c'}^{j}] .

(A4.13b)

Appendix 5

Derivation of an approximative expression for ${\bar{S}}_{c'} = {\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$

We reasonably assume ${\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$ to be a continuous function on the relevant part $0 \leq s_{0, c'} \leq n_{s, c'} \land 0 \leq 〈 {\bar{S}}_{c'} 〉 \leq 1$ of its domain, and that the Taylor series of ${\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$ in $〈 {\bar{S}}_{c'} 〉$ and $s_{0, c'}$ exists on this domain part, so that:

{\bar{S}}_{c'} (n_{0, c'}, 〈 {\bar{S}}_{c'} 〉) = \sum_{n = 0}^{\infty} \sum_{m = 0}^{n} c_{n, m} s_{0, c'}^{n - m} {〈 {\bar{S}}_{c'} 〉}^{m} .

(8.32)

Since ${\bar{S}}_{c'} = 0$ for $s_{0, c'} = 0$ :

\sum_{n = 0}^{\infty} c_{n, n} {〈 {\bar{S}}_{c'} 〉}^{n} = 0 .

(A5.1a)

Furthermore, since ${\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$ is a continuous function on the relevant part of its domain, and since ${\bar{S}}_{c'} = 0$ for $〈 {\bar{S}}_{c'} 〉 = 0$ , it is not difficult to see that for all $s_{0, c'}$ :

\lim_{〈 {\bar{S}}_{c'} 〉 \to 0} {\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉) = 0 .

Hence,

\sum_{n = 0}^{\infty} c_{n, 0} s_{0, c'}^{n} = 0 .

(A5.1b)

Based on (A5.1a) and (A5.1b), the series expansion in (8.32) can be reduced to:

{\bar{S}}_{c'} (n_{0, c'}, 〈 {\bar{S}}_{c'} 〉) = \sum_{n = 2}^{\infty} \sum_{m = 1}^{n - 1} c_{n, m} s_{0, c'}^{n - m} {〈 {\bar{S}}_{c'} 〉}^{m}

(A5.2)

leaving (with $i = n - m$ and j = m) a summation over terms $c_{i + j, j} s_{0, c'}^{i} {〈 {\bar{S}}_{c'} 〉}^{j}$ with $i, j > 0$ [as required by (A5.1a) and (A5.1b)]. We assume that a sufficient part of the secondary clusters, and also the number of initial infections $n_{0, c'} = n_{s} s_{0, c'}$ are small enough to neglect all terms with i > 1 and j > 2. We can therefore put:

{\bar{S}}_{c'} \equiv c_{2, 1} s_{0, c'} 〈 {\bar{S}}_{c'} 〉 + c_{3, 2} s_{0, c'} {〈 {\bar{S}}_{c'} 〉}^{2} = c_{2, 1} s_{0, c'} 〈 {\bar{S}}_{c'} 〉 (1 + (c_{3, 2} / c_{2, 1}) 〈 {\bar{S}}_{c'} 〉) .

(8.33)

Footnotes

A variation on the famous sentence by Richard Hamming: “The purpose of scientific computing is insight, not numbers.”

Note that these identities reflect that in the SIR-model all susceptibles are considered to be equally vulnerable and to equally respond to an infection: for quantifying the transmission rate and the removal rate only a single parameter (p_i, respectively, p_r) is used, instead of a multitude (distribution) of p_i- and p_r-values. Extensions of the SIR-model that account for variations in susceptibility among the (susceptible) contacts of a node or variations in infection response (and the corresponding heterogeneity of the population thus implied) exist (see for instance [23]), but are beyond the scope of this article.

That is, the number of contacts/connections of a node.

⁴

This phenomenon will be addressed extensively in Section 6.

⁵

The second derivative $\partial^{2} g (s) / \partial s^{2}$ is given by: $\frac{\partial^{2} g (s)}{\partial s^{2}} = \frac{α (s_{-} - s_{+})}{η^{2}} \cdot \frac{e^{s / η} (α e^{s / η} + 1)}{{(α e^{s / η} - 1)}^{3}} \approx \frac{α (s_{-} - s_{+})}{η^{2}} \cdot \frac{e^{s / η} (s_{+} e^{s / η} + s_{-})}{{(s_{+} e^{s / η} - s_{-})}^{3}},$ where the approximation on the right applies when $s_{0} ≪ 1$ or, more in particular, when $s_{0} ≪ | s_{+} |, | s_{-} |$ (which is basically the regime we focus on). Provided that $s_{+} \neq s_{-}$ , the numerator of the second term of the approximation on the right is positive or negative definite when $s_{+}, s_{-} < 0$ or $s_{+}, s_{-} > 0$ . When $s_{+}$ and $s_{-}$ have different sign [i.e. when (5.13a) applies], the numerator may change sign only for some s < 0 (since $η \leq 0$ ), whereas the denominator does not change sign for any $s \in R$ . Therefore, if $\partial^{2} g (s) / \partial s^{2}$ changes sign in the cases of our interest, it can only be when $s_{+}$ and $s_{-}$ have different sign, and only for some s < 0.

⁶

Be aware that when $a_{1}^{s i} = - ν$ and $a_{n}^{s i} = 0$ for n > 1, Equations (1.42a,b), respectively, reduce to Equations (1.6) and (1.7) of the standard SIR-model.

⁷

The equation for $〈 t 〉$ in this case is: $〈 t 〉 = \int_{0}^{\infty} t {\dot{s}}_{i} (t) d t = p_{r} \int_{0}^{\infty} t e^{- p_{r} t} d t = 1 / p_{r}$ .

⁸

Note that the effects of waning immunity are ignored therewith. However, for many practical cases this is not an unreasonable working hypothesis (see also the remarks on this matter in Section 0).

⁹

Note that

n_{e} / n_{s}

relates to the final cumulative infection-rate

s_{e} = n_{e} / n

and the vaccination rate x_v via

n_{e} / n_{s} = s_{e} / s_{s} = s_{e} / (1 - x_{v})

¹⁰

(i.e. the number of initial infections in a cluster).

¹¹

Systematic numerical evaluation is possible.

¹²

Note that $s'_{-} < 0$ for $p_{r} / p_{i} > 1$ , consistent with the results in Section 5 demonstrating the impossibility of an epidemic when $p_{r} / p_{i} > 1$ .

¹³

n being the total number of nodes/individuals in the population lattice.

Data availability

The FORTRAN code of the program epidesim used for the Monte-Carlo simulations presented in this article is available (together with a short outline on how to use the code and the program itself) in Zenodo and can be accessed at https://dx.doi.org/10.5281/zenodo.8002418.

Author contributions

J. H. V. J. Brabers (Conceptualization [lead], Methodology [lead])

Conflict of interest statement. None declared.

References

1. Avila M, Saïd N, Ojcius DM.. The book reopened on infectious diseases. Microbes Infect 2008;10:942–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Kermack WO, McKendrick AG.. Contribution to the mathematical theory of epidemics. Proc R Soc A 1927;115:700–21. [DOI] [PubMed] [Google Scholar]
3. Ising E. Beiträge zur theorie des ferromagnetismus. Z Physik 1925;31:253–8. [Google Scholar]
4. Brauer F. Mathematical epidemiology: past, present, and future. Infect Dis Model 2017;2:113–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Stauffer D, Aharony A, Filk T.. Perkolationstheorie (transl.). VCH Verlagsgesellschaft Weinheim, 1995. ISBN 3-527-29334-5. [Google Scholar]
6. Grassberger P. On the critical behaviour of the general epidemic process and dynamical percolation. Math Biosci 1983;63:157–72. [Google Scholar]
7. Moore C, Newman MEJ.. Epidemics and percolation in small-world networks. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics 2000;61:5678–82. [DOI] [PubMed] [Google Scholar]
8. Newman MEJ. Spread of epidemic diseases on networks. Phys Rev E Stat Nonlin Soft Matter Phys 2002;66:016128. [DOI] [PubMed] [Google Scholar]
9. Davis S, Trapman P, Leirs H. et al. The abundance threshold for plague as a critical percolation phenomenon. Nature 2008;454:634–7. [DOI] [PubMed] [Google Scholar]
10. Binder K, Heerman DW.. Monte Carlo Simulation in Statistical Physics. Berlin, Heidelberg: Springer-Verlag, 1988. ISBN 0-540-19107-0. [Google Scholar]
11. Potter GM. Contagious Abortion of Cattle. Kansas State Agricultural College Circular, No. 69, 1918. [Google Scholar]
12. Mcdermott A. Herd immunity is an important, and often misunderstood, public health phenomenon. Proc Natl Acad Sci USA 2021;118:e2107692118. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Randolph HE, Barreiro LB.. Herd Immunity: understanding COVID-19. Immunity 2020;52:737–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Read JM, Keeling M.. Disease evolution on networks: the role of contact structure. Proc R Soc Lond B 2003;270:699–708. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Keeling MJ, Eames KTD.. Networks and epidemic models. J R Soc Interface 2005;2:295–307. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Roy M, Pascual M.. On representing network heterogeneities in the incidence rate of simple epidemic models. Ecol Complex 2006;3:80–90. [Google Scholar]
17. Bansal S, Grenfell BT, Meyers LA.. When individual behaviour matters: homogeneous network models in epidemiology. J R Soc Interface 2007;4:879–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Gou W, Jin Z.. How heterogeneous susceptibility and recovery rates affect the spread of epidemics on networks. Infect Dis Model 2017;2:353–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Stein C, Nassereldine H, Sorensen RJD. et al. ; Covid-19 Forecasting Team. Past SARS-CoV-2 infection protection against re-infection: a systematic review and meta-analysis. Lancet 2023;401:833–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Huang K. Statistical Mechanics, 2nd edn, Chapter 17, p. 422ff.New York: Wiley & Sons Inc., 1987. ISBN: 0-471-85913-3. [Google Scholar]
21. Pastor-Satorras R, Vespignani A.. Epidemic spreading in scale-free networks. Phys Rev Lett 2001;86:3200–3. [DOI] [PubMed] [Google Scholar]
22. Boguñá M, Pastor-Satorras R, Vespignani A.. Absence of epidemic threshold in scale-free networks with degree correlations. Phys Rev Lett 2003;90:028701. [DOI] [PubMed] [Google Scholar]
23. Neipel J, Bauermann J, Bo S. et al. Power-law population heterogeneity governs epidemic waves. PLoS ONE 2020;15:e0239678. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Broido AD, Clauset A.. Scale-free networks are rare. Nat Commun 2019;10:1017. [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Voitalov I, van der Hoorn P, van der Hofstad R. et al. Scale-free networks well done. Phys Rev Res 2019;1:033034. [Google Scholar]
26. Heesterbeek JAP. $R_{0}$ . Ph.D. Thesis, Leiden University, The Netherlands, 1992.
27. Ross R. The Prevention of Malaria, London: John Murray (Publ.), 1910. [Google Scholar]
28. Allen LA. A primer on stochastic models: formulation, numerical simulation and analysis. Infect Dis Model 2017;2:128–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
29. Angelov AG, Slavtchova-Bojhova M.. Bayesian estimation of the offspring mean on branching processes: application to infectious disease data. Comput Math Appl 2012;64:229–35. [Google Scholar]
30. Balescu R. Equilibrium and Nonequilibrium Statistical Mechanics (reprint edition). Malabar, FL: Krieger Publishing Company, 1991, 379. ISBN 0-89464-524-2. [Google Scholar]
31. Mohn P. Magnetism in the Solid State, Chapter 6, Berlin, Heidelberg: Springer-Verlag, 2003. ISBN 3-540-431183-7. [Google Scholar]
32.Webdata from Statistics Norway/Statistisk sentralbyrå [Norwegian (public) bureau of statistics].
33. Verhulst P-F. Notice sur la loi que la population poursuit dans son accroissement. In: Correspondance Mathématique et Physique, Vol. 10. Société Belge de Librairie Hauman et Cie Bruxelles, 1838, 113–21. [Google Scholar]
34. Mossong J, Hens N, Jit M. et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med 2008;5:e74.doi: 10.1371/journal.pmed.0050074 [DOI] [PMC free article] [PubMed] [Google Scholar]
35. Brooks-Pollock E, Read JM, McLean AR. et al. Mapping social distancing measures to the reproduction number for COVID-19. Philos Trans R Soc Lond B Biol Sci 2021;376:20200276.doi: 10.1098/rstb.2020.0276 [DOI] [PMC free article] [PubMed] [Google Scholar]
36. Hill RA, Dunbar RIM.. Social network size in humans. Hum Nat 2003;14:53–72. doi: 10.1007/s12110-003-1016-y. [DOI] [PubMed] [Google Scholar]
37. Huang K. Statistical Mechanics, 2nd edn, Chapter 14, p. 341ff.New York: Wiley & Sons Inc., 1987. ISBN 0-471-85913-3. [Google Scholar]
38. Goldenfeld N. Lectures on Phase Transitions and the Renormalisation Group 111. (paperback edition). Addison Wesley Publishing Company, 1992. ISBN 0-201-55409-7. [Google Scholar]
39. Chikazumi S, Charap SH.. The Physics of Magnetism. Malabar, FL: Krieger Publishing Company, 1978, p. 64ff. ISBN 0882756621. [Google Scholar]
40. Heffernan JM, Smith RJ, Wahl LM.. Perspectives on the basic reproductive ratio. J R Soc Interface 2005;2:281–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
41. Heesterbeek JAP, Dietz K.. The concept of R₀ in epidemic theory. Stat Neerl 1996;50:89–110. [Google Scholar]
42. Grimmett G. Percolation, 2nd edn. Berlin, Heidelberg: Springer Verlag, 1999. ISBN 3-540-64902-6. [Google Scholar]
43. Malarz K, Galam S.. Square-lattice site percolation at increasing ranges of neighbour bonds. Phys Rev E Stat Nonlin Soft Matter Phys 2005;71:016125. [DOI] [PubMed] [Google Scholar]
44. Kadanoff LP. Scaling laws for Ising models near Tc. Phys Phys Fiz 1966;2:263–72. [Google Scholar]
45. Mori T. Analysis of the exactness of mean-field theory in long-range interacting systems. Phys Rev E Stat Nonlin Soft Matter Phys 2010;82:060103. [DOI] [PubMed] [Google Scholar]
46. Bronstein IN, Semendjajew KA, Musiol G. et al. Taschenbuch der Mathematik. Verlag Harri Deutsch Frankfurt am Main, 2006, 40–2. ISBN 978-3-8171-2006-2 (6. Auflage). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[bpad010-B1] 1. Avila M, Saïd N, Ojcius DM.. The book reopened on infectious diseases. Microbes Infect 2008;10:942–7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B2] 2. Kermack WO, McKendrick AG.. Contribution to the mathematical theory of epidemics. Proc R Soc A 1927;115:700–21. [DOI] [PubMed] [Google Scholar]

[bpad010-B3] 3. Ising E. Beiträge zur theorie des ferromagnetismus. Z Physik 1925;31:253–8. [Google Scholar]

[bpad010-B4] 4. Brauer F. Mathematical epidemiology: past, present, and future. Infect Dis Model 2017;2:113–27. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B5] 5. Stauffer D, Aharony A, Filk T.. Perkolationstheorie (transl.). VCH Verlagsgesellschaft Weinheim, 1995. ISBN 3-527-29334-5. [Google Scholar]

[bpad010-B6] 6. Grassberger P. On the critical behaviour of the general epidemic process and dynamical percolation. Math Biosci 1983;63:157–72. [Google Scholar]

[bpad010-B7] 7. Moore C, Newman MEJ.. Epidemics and percolation in small-world networks. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics 2000;61:5678–82. [DOI] [PubMed] [Google Scholar]

[bpad010-B8] 8. Newman MEJ. Spread of epidemic diseases on networks. Phys Rev E Stat Nonlin Soft Matter Phys 2002;66:016128. [DOI] [PubMed] [Google Scholar]

[bpad010-B9] 9. Davis S, Trapman P, Leirs H. et al. The abundance threshold for plague as a critical percolation phenomenon. Nature 2008;454:634–7. [DOI] [PubMed] [Google Scholar]

[bpad010-B10] 10. Binder K, Heerman DW.. Monte Carlo Simulation in Statistical Physics. Berlin, Heidelberg: Springer-Verlag, 1988. ISBN 0-540-19107-0. [Google Scholar]

[bpad010-B11] 11. Potter GM. Contagious Abortion of Cattle. Kansas State Agricultural College Circular, No. 69, 1918. [Google Scholar]

[bpad010-B12] 12. Mcdermott A. Herd immunity is an important, and often misunderstood, public health phenomenon. Proc Natl Acad Sci USA 2021;118:e2107692118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B13] 13. Randolph HE, Barreiro LB.. Herd Immunity: understanding COVID-19. Immunity 2020;52:737–41. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B14] 14. Read JM, Keeling M.. Disease evolution on networks: the role of contact structure. Proc R Soc Lond B 2003;270:699–708. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B15] 15. Keeling MJ, Eames KTD.. Networks and epidemic models. J R Soc Interface 2005;2:295–307. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B16] 16. Roy M, Pascual M.. On representing network heterogeneities in the incidence rate of simple epidemic models. Ecol Complex 2006;3:80–90. [Google Scholar]

[bpad010-B17] 17. Bansal S, Grenfell BT, Meyers LA.. When individual behaviour matters: homogeneous network models in epidemiology. J R Soc Interface 2007;4:879–91. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B18] 18. Gou W, Jin Z.. How heterogeneous susceptibility and recovery rates affect the spread of epidemics on networks. Infect Dis Model 2017;2:353–67. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B19] 19. Stein C, Nassereldine H, Sorensen RJD. et al. ; Covid-19 Forecasting Team. Past SARS-CoV-2 infection protection against re-infection: a systematic review and meta-analysis. Lancet 2023;401:833–42. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B20] 20. Huang K. Statistical Mechanics, 2nd edn, Chapter 17, p. 422ff.New York: Wiley & Sons Inc., 1987. ISBN: 0-471-85913-3. [Google Scholar]

[bpad010-B21] 21. Pastor-Satorras R, Vespignani A.. Epidemic spreading in scale-free networks. Phys Rev Lett 2001;86:3200–3. [DOI] [PubMed] [Google Scholar]

[bpad010-B22] 22. Boguñá M, Pastor-Satorras R, Vespignani A.. Absence of epidemic threshold in scale-free networks with degree correlations. Phys Rev Lett 2003;90:028701. [DOI] [PubMed] [Google Scholar]

[bpad010-B23] 23. Neipel J, Bauermann J, Bo S. et al. Power-law population heterogeneity governs epidemic waves. PLoS ONE 2020;15:e0239678. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B24] 24. Broido AD, Clauset A.. Scale-free networks are rare. Nat Commun 2019;10:1017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B25] 25. Voitalov I, van der Hoorn P, van der Hofstad R. et al. Scale-free networks well done. Phys Rev Res 2019;1:033034. [Google Scholar]

[bpad010-B26] 26. Heesterbeek JAP. $R_{0}$ . Ph.D. Thesis, Leiden University, The Netherlands, 1992.

[bpad010-B27] 27. Ross R. The Prevention of Malaria, London: John Murray (Publ.), 1910. [Google Scholar]

[bpad010-B28] 28. Allen LA. A primer on stochastic models: formulation, numerical simulation and analysis. Infect Dis Model 2017;2:128–42. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B29] 29. Angelov AG, Slavtchova-Bojhova M.. Bayesian estimation of the offspring mean on branching processes: application to infectious disease data. Comput Math Appl 2012;64:229–35. [Google Scholar]

[bpad010-B30] 30. Balescu R. Equilibrium and Nonequilibrium Statistical Mechanics (reprint edition). Malabar, FL: Krieger Publishing Company, 1991, 379. ISBN 0-89464-524-2. [Google Scholar]

[bpad010-B31] 31. Mohn P. Magnetism in the Solid State, Chapter 6, Berlin, Heidelberg: Springer-Verlag, 2003. ISBN 3-540-431183-7. [Google Scholar]

[bpad010-B32] 32.Webdata from Statistics Norway/Statistisk sentralbyrå [Norwegian (public) bureau of statistics].

[bpad010-B33] 33. Verhulst P-F. Notice sur la loi que la population poursuit dans son accroissement. In: Correspondance Mathématique et Physique, Vol. 10. Société Belge de Librairie Hauman et Cie Bruxelles, 1838, 113–21. [Google Scholar]

[bpad010-B34] 34. Mossong J, Hens N, Jit M. et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med 2008;5:e74.doi: 10.1371/journal.pmed.0050074 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B35] 35. Brooks-Pollock E, Read JM, McLean AR. et al. Mapping social distancing measures to the reproduction number for COVID-19. Philos Trans R Soc Lond B Biol Sci 2021;376:20200276.doi: 10.1098/rstb.2020.0276 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B36] 36. Hill RA, Dunbar RIM.. Social network size in humans. Hum Nat 2003;14:53–72. doi: 10.1007/s12110-003-1016-y. [DOI] [PubMed] [Google Scholar]

[bpad010-B37] 37. Huang K. Statistical Mechanics, 2nd edn, Chapter 14, p. 341ff.New York: Wiley & Sons Inc., 1987. ISBN 0-471-85913-3. [Google Scholar]

[bpad010-B38] 38. Goldenfeld N. Lectures on Phase Transitions and the Renormalisation Group 111. (paperback edition). Addison Wesley Publishing Company, 1992. ISBN 0-201-55409-7. [Google Scholar]

[bpad010-B39] 39. Chikazumi S, Charap SH.. The Physics of Magnetism. Malabar, FL: Krieger Publishing Company, 1978, p. 64ff. ISBN 0882756621. [Google Scholar]

[bpad010-B40] 40. Heffernan JM, Smith RJ, Wahl LM.. Perspectives on the basic reproductive ratio. J R Soc Interface 2005;2:281–93. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bpad010-B41] 41. Heesterbeek JAP, Dietz K.. The concept of R₀ in epidemic theory. Stat Neerl 1996;50:89–110. [Google Scholar]

[bpad010-B42] 42. Grimmett G. Percolation, 2nd edn. Berlin, Heidelberg: Springer Verlag, 1999. ISBN 3-540-64902-6. [Google Scholar]

[bpad010-B43] 43. Malarz K, Galam S.. Square-lattice site percolation at increasing ranges of neighbour bonds. Phys Rev E Stat Nonlin Soft Matter Phys 2005;71:016125. [DOI] [PubMed] [Google Scholar]

[bpad010-B44] 44. Kadanoff LP. Scaling laws for Ising models near Tc. Phys Phys Fiz 1966;2:263–72. [Google Scholar]

[bpad010-B45] 45. Mori T. Analysis of the exactness of mean-field theory in long-range interacting systems. Phys Rev E Stat Nonlin Soft Matter Phys 2010;82:060103. [DOI] [PubMed] [Google Scholar]

[bpad010-B46] 46. Bronstein IN, Semendjajew KA, Musiol G. et al. Taschenbuch der Mathematik. Verlag Harri Deutsch Frankfurt am Main, 2006, 40–2. ISBN 978-3-8171-2006-2 (6. Auflage). [Google Scholar]

PERMALINK

The spread of infectious diseases from a physics perspective

J H V J Brabers

Roles

Abstract

0. Introduction

1. SIR-model with network correlations and local fluctuations

Figure 1:

2. Reproduction numbers

3. Solutions of the differential equations for approximative cases

3.1 Second-order polynomial approximation of 〈ssi〉

3.2 Third-order polynomial approximation of 〈ssi〉

3.3 Third-order polynomial approximation of 〈sis〉

4. Simulations

Figure 2:

Figure 3:

Figure 4:

Figure 5:

Figure 6:

Figure 7:

Figure 8:

Figure 9:

Figure 10:

Figure 11:

5. Properties of the model and its solutions: conditions for an epidemic, effects of infection removal

5.1 Criterion for an epidemic to develop from a limited number of infections

5.2 The maximum number of active infections reached during an epidemic

5.3 The effects of infection removal and the herd-immunity pitfall

Figure 12:

Figure 13:

5.4 Reconsidering the meaning of reproduction numbers

6. Defining and understanding herd-immunity

6.1 Fundamentals

6.2 Different types of herd-immunity: a classification

6.3 Herd-immunity and reproduction numbers

7. Vaccination

Figure 14:

8. Percolation

8.1 The percolation transition and its relevance in the context of vaccination

Figure 15:

Figure 16:

Figure 17:

8.2 The correlation length

8.3 The vaccination-induced herd-immunity threshold as the critical point of a second-order phase transition

Figure 18:

Figure 19:

8.4 Vaccination-induced percolation transitions in the framework of the SIR-model

Figure 20:

Figure 21:

8.5 Percolation phenomena in unvaccinated populations

Figure 22:

Figure 23:

Figure 24:

Figure 25:

Figure 26:

Figure 27:

Figure 28:

Figure 29:

Figure 30:

Figure 31:

Figure 32:

Figure 33:

Figure 34:

Figure 35:

8.6 The SIR-model and percolation in unvaccinated populations

Figure 36:

8.7 Combined effects of vaccination and social-network restrictions

Figure 37:

Figure 38:

8.8 Herd-immunity and percolation

9. Epilogue

9.1 Summary, conclusions and suggestions for further research

Glossary of most common symbols

Appendix 1

Solving the differential equations

(a) Third-order polynomial approximation of 〈ssi〉

(b) Third-order polynomial approximation of 〈sis〉

Appendix 2

An eigenvalue equation relating the sx and the 〈syx〉

Appendix 3

3.1 Second-order polynomial approximation of $〈 s_{s i} 〉$

3.2 Third-order polynomial approximation of $〈 s_{s i} 〉$

3.3 Third-order polynomial approximation of $〈 s_{i s} 〉$

(a) Third-order polynomial approximation of $〈 s_{s i} 〉$

(b) Third-order polynomial approximation of $〈 s_{i s} 〉$

An eigenvalue equation relating the s_x and the $〈 s_{y x} 〉$

Derivation of the coefficients $a_{n} ({\bar{n}}_{b})$

Derivation of the coefficients $a_{n}^{c'} ({\bar{S}}_{c'})$

Derivation of an approximative expression for ${\bar{S}}_{c'} = {\bar{S}}_{c'} (s_{0, c'}, 〈 {\bar{S}}_{c'} 〉)$