Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2021 Feb 19;16(2):e0246056. doi: 10.1371/journal.pone.0246056

Epidemiological model for the inhomogeneous spatial spreading of COVID-19 and other diseases

Yoav Tsori 1,3,*, Rony Granek 2,3
Editor: Emanuele Giorgi4
PMCID: PMC7894958  PMID: 33606684

Abstract

We suggest a novel mathematical framework for the in-homogeneous spatial spreading of an infectious disease in human population, with particular attention to COVID-19. Common epidemiological models, e.g., the well-known susceptible-exposed-infectious-recovered (SEIR) model, implicitly assume uniform (random) encounters between the infectious and susceptible sub-populations, resulting in homogeneous spatial distributions. However, in human population, especially under different levels of mobility restrictions, this assumption is likely to fail. Splitting the geographic region under study into areal nodes, and assuming infection kinetics within nodes and between nearest-neighbor nodes, we arrive into a continuous, “reaction-diffusion”, spatial model. To account for COVID-19, the model includes five different sub-populations, in which the infectious sub-population is split into pre-symptomatic and symptomatic. Our model accounts for the spreading evolution of infectious population domains from initial epicenters, leading to different regimes of sub-exponential (e.g., power-law) growth. Importantly, we also account for the variable geographic density of the population, that can strongly enhance or suppress infection spreading. For instance, we show how weakly infected regions surrounding a densely populated area can cause rapid migration of the infection towards the populated area. Predicted infection “heat-maps” show remarkable similarity to publicly available heat-maps, e.g., from South Carolina. We further demonstrate how localized lockdown/quarantine conditions can slow down the spreading of disease from epicenters. Application of our model in different countries can provide a useful predictive tool for the authorities, in particular, for planning strong lockdown measures in localized areas—such as those underway in a few countries.

Introduction

The COVID-19 pandemic is now spread over most of the globe. Its vast consequences are associated with severe public health issues, i.e. overwhelmed health system, high death toll, and a huge economic crisis worldwide [15]. In order to optimize decisions in both aspects, health and economy, authorities need information and predictions about the spatial distribution of the disease [6], thereby allowing selective quarantine or lockdown measures [7, 8].

Infectious disease spreading models are largely based on the assumption of perfect and continuous “mixing”, similar to the one used to describe the kinetics of spatially-uniform chemical reactions. In particular, the well-known susceptible-exposed-infectious-recovered (SEIR) model, builds on this homogeneous-mixing assumption. Some extensions of SEIR-like models that account for spatial variability employed mainly diffusion processes for the different sub-populations [912] (where the term sub-population refers to people under a certain stage of the disease). Yet, clearly, while such processes can effectively describe wildlife motion in some systems, they fail to describe the (non-random) human behavior [13]. To mimic human behavior more realistically, recent extensions employed diffusion processes of the sub-populations that are limited to contact networks [14]. However, one of the most important artifacts for the application of diffusion process for human population is its unrealistic tendency to spread all populations to uniformity (be it in real space or on contact networks). Moreover, these models do not involve naturally a spatial dependence of infection spreading parameters, which are required to model geographically local quarantine. Thus, to implement such dependence using homogeneous models requires a division of the geographic region into multiple number of patches [14, 15].

Early in the COVID-19 pandemic spread, different modeling groups used homogeneous models to predict the epidemic evolution in China and in other countries [1623]; their predictions urged the World Health Organization (WHO) to issue a global warning. Wu et al. were the first to model the COVID-19 spreading [16]. They applied the SEIR model using data from the very early (exponential) stage of the outbreak, to predict epidemic spread mainly in Wuhan and mainland China. Extensions for this first attempt were quick to follow. Ivorra and Ramos applied the “Be-CoDiS” mathematical model—a multi-sub-population extension of the SEIR model—to COVID-19 [19, 20]. A fit of the parameters of the model to a longer period of evolution, up to the time where the outbreak nearly peaked (maximum number of new daily infected people), yielded remarkably accurate predictions for the stages that followed. More recently, He et al. [21] and Giordano et al. [22] provided further improvements and analysis on the original application of the SEIR model to COVID-19 [16].

As mentioned, conventional epidemiological models assume spatially uniform (statistical) frequency of encounters between infectious and susceptible people, which is associated with uniform spatial densities of these sub-populations at all times [16, 24, 25]. As such, these models do not require any spatial variable. However, the assumption of “infinitely fast mixing” might fail even in normal life conditions, let alone under (the often used) various travel and gathering restrictions, or moderate quarantine conditions [7]. Moreover, the basic reproductive number, R0, may vary from one area to the other, for example, if people behave differently across areas. A further complication may arise when human behavior evolves in time during an epidemic, implying that R0 is also time-dependent. As a consequence, accurate predictions of such models rely on repeated readjustments of the infection rate constant as the epidemic progresses.

Specifically, these models give a broad regime of exponential growth of the cumulative number of infected people, whereas the actual growth is sub-exponential, which can be effectively described by a power-law (i.e. Atν where A and ν are numerical constants) [2630]. Early data from several countries indeed demonstrated such a wide temporal regime of power-law growth, which occurred much prior to the peak of the epidemic [26, 3032]. In accord with similar ideas [26], we propose that such a temporal behavior can result from the lateral spreading of infected domains, similar to the spreading of wildfire, which gives further motivation to the present work.

In this paper, we develop a novel theoretical framework to model the spread of infections, thereby allowing researchers working in close contact with authorities to apply the model in their own countries. To demonstrate our theoretical framework, we chose to improve on homogeneous SEIR-like models, in particular in the context of the COVID-19 pandemic, in three major aspects. The first is derivation of a spatial spreading (diffusion-like) operator, that generates propagation of the front of an infected population domain into a susceptible one [3335]. Importantly, this operator is associated only with infection spreading and does not describe motion of the different populations as previously suggested [9, 12]; as argued above, the use of a diffusion process is inadequate for human population. The second, and strongly linked to the first, is the ability to account for the geographical population density variation and study its effect on the spreading [36]. The combination of these two aspects leads to the spreading of the disease from areas of low density to areas of high density. The third aspect is the account of geographic variation in quarantine levels if such are employed. In addition, to apply our approach for COVID-19, we split the infectious sub-population into a transient “infectious-presymptomatic” group and an “infectious-symptomatic” group (the SEPIR model mentioned below).

Our results provide infection “heat-maps”, reminiscent of those appearing in publicly available resources (e.g., for South Carolina [37] and Nashville, Tennessee [38]); see also the heat-map snapshots in the SI, Figs SI-6 and SI-7 in S1 File. These heat-maps demonstrate unique features of the disease spreading depending on the spatial variation of population density, location of the initial epicenters, and local quarantine levels. They show that the accumulated number of infected people deviates significantly from the associated homogeneous model.

In-homogeneous SEPIR model

Our model, which builds on the homogeneous SEIR model, includes five sub-populations associated with different stages of the disease: susceptible-exposed-presymptomatic-infectious-recovered (SEPIR). This basic SEPIR model is extended to account for the spatially varying density of people n(x)—where x is a 2-dimensional (2D) vector in the plane, whose components will be denoted by x and y—between different geographical areas of the region under study, which is assumed in the present study to be isolated from other regions. Our model also aims to predict the effect of different quarantine levels imposed in different areas within the region of study, which is modeled via spatial dependence of infection rates.

We define the variables h, b, w, f, and r as follows:

  • h(x, t): 2D (areal) density of susceptible (healthy but not immune) people

  • b(x, t): density of exposed people that do not yet infect others and are not yet symptomatic (i.e. within the incubation period [39])

  • w(x, t) density of pre-symptomatic people that can already infect others but are not yet symptomatic (i.e. still within the incubation period)

  • f(x, t): density of symptomatic (infectious) people.

  • r(x, t): density of people that have recovered from illness, thus assumed (here) to be immune from a second infection

We require that the total local density of people is equal to prescribed values n(x) at different positions x (obtained, e.g., from public databases) and is independent of time:

h(x,t)+b(x,t)+w(x,t)+f(x,t)+r(x,t)=n(x), (1)

implying the same for the total population number, N = ∫area d2 x n(x). In what follows, capital letters will denote the corresponding global quantities, e.g., H(t) = ∫area d2 x h(x, t), B(t) = ∫area d2 x b(x, t), etc. We note in passing that these variables correspond to the variables of the well-known (homogeneous) SEIR model [24, 40, 41] as follows: HS (susceptible), BE (exposed), W + FI (infectious), RR (recovered). Thus, unlike in the SEIR model, in our model the infectious sub-population (I) is split into two sub-populations, pre-symptomatic (W) and symptomatic (F).

In order to develop the spatial epidemic spread model, we consider first a 2D discrete space (square or triangular lattice), in which the nodes are defined as areal units of linear size (henceforth “grid-size”) δ. The present model assumes some traveling (or mobility) restrictions, and the value of δ is chosen such that node-node infections can occur only between those nodes that are nearest-neighbors, while within each node homogeneous infections take place. For example, if people avoid traveling distances over 30 km, but still travel a lot within 30 km, one has to choose δ ≃ 30 km. Likewise, under stronger restrictions (i.e. a “lockdown”), traveling may be restricted to 1 km, which implies δ ≃ 1 km.

We define by hi(t) the number of susceptible people at node i at time t. Similarly bi(t), wi(t), fi(t), and ri(t) describe the numbers of the different sub-populations at each node. Infection can occur at a rate constant k1(i) when infectious and susceptible people from the same node i meet each other, and at a rate constant k2(j, i) = k2(i, j) when meetings occur between an infectious person from node i and a susceptible person from a nearest-neighbor (NN) node j (i.e. the infection rate, for an infectious person at node i and a susceptible one at node j, is identical to the rate when the two nodes are interchanged). The total number of people at node i is denoted by ni. Accordingly, the set of (non-linear) master equations for the distribution of these sub-populations is

hit=-k1(i)hini(wi+fi)-hinijik2(j,i)(wj+fj)bit=k1(i)hini(wi+fi)+hinijik2(j,i)(wj+fj)-γ0biwit=γ0bi-γ1wifit=γ1wi-γ2firit=γ2fi. (2)

where ji stands for node j that is NN to i. In Eq (2), γ0, γ1, and γ2 are the rate coefficients for the transition of the b-population to w, of w to f, and of f to r, respectively.

We now transform the master Eq (2) to the continuum using the Kramers-Moyal expansion [42], xi. Using the symmetry for infection rates between NN nodes i and j, k2(j, i) = k2(i, j), and defining a local density of a sub-population y as y(x, t) ≡ yi(t)/δ2 (e.g., b(x, t) ≡ bi(t)/δ2), we obtain

ht=-k(x)hn(w+f)-hn[Dk(x)(w+f)]bt=k(x)hn(w+f)+hn[Dk(x)(w+f)]-γ0bwt=γ0b-γ1wft=γ1w-γ2frt=γ2f. (3)

where the last line in Eq (3) can be replaced—using the conservation law Eq (1)—by r = n − (h + b + w + f). In Eq (3), k(x) = k1(x) + zk2(x) defines an effective local rate coefficient of infection growth, and Dk(x) = (z/4)k2(x)δ2 is an effective diffusion coefficient of the infection spreading, henceforth termed epidemic diffusion coefficient; z is the number of nearest-neighbors to a node (coordination number), z = 4, 6 for square and triangular lattices, respectively. The diffusive-like term, in the first two lines of Eq (3), governs the diffusion of the epidemic, not the people. Its presence, on top the (familiar, homogeneous) infection growth rate (first term in these two lines), can describe the lateral growth of infected sub-population domains (henceforth “infected domains”), as the front—i.e. the boundary between infected and susceptible domains—propagates into susceptible domains [34, 35]. In the special case of homogeneous distribution of all populations and homogeneous rate constants, k can be identified as the SEIR parameter ratio R0/τI, where R0 is the basic reproductive rate and τI is the mean infectious period. It is easy to verify, by summing all lines in Eq (3), that ∂n(x,t)/∂t = 0, as required. Thus, any initial in-homogeneous population density distribution n(x) is not altered by our infection spreading model.

For simulation purposes we rescale the local densities by the mean total population density (in the whole region under study), n0, such that y˜(x,t)y(x,t)/n0. In particular, n˜(x)=n(x)/n0 presents the relative local population density. In addition, distance is scaled by δ, i.e. x˜=x/δ, such that becomes dimensionless. This leads to the following scaled equations

b˜t=k(x˜)h˜n˜(w˜+f˜)+h˜n˜[D˜k(x˜)(w˜+f˜)]-γ0b˜h˜t=-k(x˜)h˜n˜(w˜+f˜)-h˜n˜[D˜k(x˜)(w˜+f˜)]w˜t=γ0b˜-γ1w˜f˜t=γ1w˜-γ2f˜r˜t=γ2f˜. (4)

where Dk˜=Dk/δ2=k2.

The parameters to be used for COVID-19 pandemic should be obtained from the up-to-date literature. Parameters that are associated with the physiological response to the disease are fairly well known, however, the basic reproductive rate, R0, varies strongly between different countries due to differences in social behavior [43, 44]. In the absence of any quarantine conditions or safety measures (e.g., use of masks), it ranges predominantly between 2 and 4 whereas initial estimates from China were R0 = 2–3. In this work we chose R0 = 2.5 as a typical value. It has been reported that τI = 16.6 days [45] yielding an infection rate coefficient k = R0/τI that is about 0.15days−1. The mean time for the appearance of symptoms (from the moment of infection), τS, is known to be about 5 days [45, 46] (ranging between 2-11 days) which sets up γ0-1+γ1-1=τS=5 days. In addition, there is evidence that people are infecting already about 2-3 days before showing symptoms [47], hence we set γ1-1=3 days. It follows that the transition rate from exposed (infected but non-infecting) to presymptomatic-infecting, is γ0-12 days, which is quite a reasonable estimate given that viral load needs to rise before a person sheds enough virus to be considered infecting.

The rate coefficients γ1 and γ2, describing the transitions from presymptomatic-infecting to symptomatic-infecting, and from symptomatic-infecting to recovered, respectively, must obey γ1-1+γ2-1=τI=16.6 days (i.e. the whole infection period), implying γ2-1=13.6 days. The dimensionless effective diffusion coefficient Dk˜=k2 is the most difficult model parameter to estimate and is sensitive to the choice of nodes. Likely Dk˜k, since k = k1 + zk2 and we may also assume k2k1. For numerical purposes in the present study we use, from now on, square lattice node geometry, i.e. z = 4, and choose k1 = k2 = 0.03 days−1, yielding k = 0.15 days−1 (as required) and Dk˜=0.03 days−1. Sensitivity analysis for a wide parameter range is performed in the S1 File; see Figs SI-3, SI-4, and SI-5 in S1 File. As can be expected, the speed of epidemic spread is highly sensitive to the model parameters. Yet, the spatial epidemic spreading patterns that are obtained are quite similar to one another in the range of chosen parameters, e.g., a larger R0 pattern appears similar—though at shorter times—to a smaller R0 pattern.

In the proceeding section we solve this spatially dependent multiple population model, at different initial conditions (using the above parameters unless otherwise stated). For a few initial conditions, we use a specific inhomogeneous density populations n(x) to examine its effect. Obviously, to obtain realistic predictions one requires: (i) detailed local population density data (i.e. density maps), and (ii) data for the initial local densities of the above five different populations (i.e. “heat-maps”), both given to the grid size resolution δ (requiring cooperation with authorities). The present work is therefore limited to present the strength of the model and its ability to give insight on the way the infection spreads under different levels and spatial variation of quarantine or safety measures. For brevity, henceforth, we drop the ‘∼’ sign from the notations of (density) normalized spatially dependent variables, i.e. f˜f, and a similar transformation with the other variables. Recall that capital letters denote global quantities, i.e. spatial integrals of the lowercase spatially dependent quantities, e.g., F = ∫ f(x)d2 x, representing now the fraction of the specific population—out of the total population—as now f(x) implies f˜(x).

Results

The initial conditions of an epidemic are unknown unless in exceptional cases, yet they have major consequences on the number of infected people. In all examples below, we use identical initial conditions for the global quantities as follows: W = R = F = 0 and B = 10−3 (i.e., one out of 1000 people is in the incubation period). In our non-uniform model, we are able to analyze the effect of the different, non-uniform, initial conditions, yet identical global initial conditions. Comparing the evolution in time of the global quantities, between the locally different—yet globally identical—initial conditions, we will assess both the spreading patterns and the overall effect of the epidemic. For clarity, Table 1 summarizes the scenarios considered.

Table 1. Summary of population distribution n, b population at time t = 0, and quarantine measures employed in all figures.

For more details see the figure captions.

Fig. Population density n Exposed population density b(t = 0) Quarantine
1 Uniform Uniform No
2 Uniform Two centers No
3 Gaussian Two centers near “city” No
4 Uniform Scattered centers No
5 Gaussian Numerous centers inside “city” No
6 Gaussian Scattered centers outside “city” No
7 Nonuniform Scattered centers outside “city” No
8 Nonuniform Scattered centers outside “city” Yes, belt
9 Uniform Numerous infections near center No
10 Uniform Numerous infections near center Yes, belt
11 Uniform Numerous infections near center Yes, area

Study of model behavior

In order to better understand the basic features of the model, we commence with oversimplified cases. First, we ignore any spatial dependence of the initial (t = 0) sub-populations, and that of the (overall) local population density n(x). In such cases, when the initial conditions are uniform in space, Eq (4) ensures that all sub-population densities are kept uniform at all times. In this limit, our model converges to a homogeneous SEPIR model (i.e. no diffusive-like terms in Eq (4)). As mentioned above, it is similar to the SEIR model with the addition of a pre-symptomatic sub-population. Although this limit cannot be achieved realistically, it is studied here for comparison with the conventional, homogeneous, models. Obviously, in this case of uniform distributions, the global (spatial integral) quantities do not provide additional information.

Fig 1 shows the model predictions for the evolution of the global variables against time t when all local variables: n, b, h, w, f, and r, are spatially uniform, and for the above stated initial conditions (W = R = F = 0 and B = 10−3). Fig 1(a) shows the time evolution of the model variables H, B, W, F, and R. At a time t ≃ 92 days the epidemic attains its peak, i.e. the infectious population (W + F) attains its maximum and later declines (dashed line). At the epidemic peak, we have H ≃ 0.37, i.e. the fraction of immune population is 1 − H ≃ 0.63. Thus, within our (five sub-populations) SEPIR model, and the chosen parameters, “herd immunity” is reached at 1 − H = 0.63 fraction of the susceptible population being infected. The fraction of recovered (“removed”) people, R (green curve), is increasing monotonously. In this particular example, at very long times R attains a value of R ≈ 0.89 and correspondingly H ≈ 0.11.

Fig 1. Solution of the epidemic model for the case of spatially uniform population densities n, b, h, w, f, and r, against the time t (in units of days).

Fig 1

(a) Curves depicting the global sub-population fractions (capital letters), amounting here to simple multiplication of the local densities by the area. The initial conditions are B = 10−3 and W = F = R = 0. (b) A log-log plot of the cumulative infected population, 1 − H, vs time (in days). The dashed and dash-dotted lines are fits at t = 2 and t = 55, respectively. In this and in all other figures Dk = 0.03, k = 0.15 days−1, γ0 = 1/2 days−1, γ1 = 1/3 days−1, γ2 = 1/13.6 days−1.

A common quantity used to follow the epidemic is the accumulated number of people that have been infected until time t, 1 − H. In Fig 1(b) we present 1 − H vs time on a log-log scale. We observe a separation of the growth into two (albeit very short) power-law regimes, 1 − HAtν (where A and ν are numerical constants) with the early evolution exponent ν ≃ 0.26 being much smaller than the late evolution exponent ν ≃ 4; a smaller power-law exponent ν signifies a slower growth. However, given the short duration in time of these power-law regimes, it should be recognized that a power-law description is not useful for the homogeneous SEPIR model; the above analysis is shown mainly for comparison with the in-homogeneous examples below, where power-law regimes are much wider.

As a second example (Fig 2), consider a rather different initial spreading of the b-population, still with n uniform in space. In Fig 2, and in all following figures, the epidemic heat-maps are on shown on the left-hand-side, and the corresponding global quantities are depicted on the right-hand-side. In Fig 2, the initial conditions are two relatively large infection centers—see panel (a). The parameters are set such that the (initial) value of B is taken to be the same as in Fig 1. All other populations are initially vanishing: w = f = r = 0. Although n is uniform at all times, all specific populations are nonuniform at t > 0 (even though h, w, f, and r are set uniformly to zero at t = 0). The exposed sub-population centers grow and develop into ring-like structures of the symptomatic sub-population f, later start to overlap, and subsequently merge into one large ring that continuously spreads outwards—panels (b)-(f). The core of the rings is seen to evolve quickly to contain mostly recovered population (since rnf). The evolution in time of global populations is shown in panels (g) and (h). In comparison to the case of uniform initial b (Fig 1), here the peak of the epidemic occurs at much longer times (t ≃ 324) and is much smaller in magnitude. The early-time power-law regime quickly crosses over (at t = 10) to a long power-law behavior with exponent ν ≃ 2. The latter exponent may be explained by the outward propagation of the ‘f’-rings (or those of w + f, not depicted in Fig 2), at constant velocity. If the front of infectious sub-populations moves at constant velocity [34, 35], as dictated by Eq (4), it implies that the domain area, corresponding to the cumulative number of infected people, grow as ∼ t2.

Fig 2. Time evolution of an epidemic starting from two infection centers (in this and in all other figures, t is the time in days, and x and y are the spatial Cartesian coordinates).

Fig 2

(a) Initial conditions of b. B—the global value of b—is the same as in Fig 1. n is uniform and all other populations are initially zero: w = f = r = 0. Panels (b)-(f) depict the spreading pattern of the symptomatic population ‘f’ as time progresses. The two circular domains grow and merge into one oval-like domain. Panel (g) shows the global sub-populations F, H, B, W, and R vs time t (in days). Panel (h) shows the cumulative fraction of infected population 1 − H vs time t in (in days) on a log-log scale. Compare to the cases of uniform (Fig 1). Dashed and dash-dotted curves in panel are linear fits at t = 3 and t = 150, respectively.

We now turn to study a situation in which the given population density is not uniform in space, and our model is highly suitable to handle such cases. To mimic a large density variation around a densely populated area, as for a city surrounded by suburban areas, the population density is assumed to follow a centered Gaussian function with a non-zero baseline, n(x)=10ae-(x2+y2)/2+a, where the standard deviation (width), , of the density is 10, and the baseline a is set such that the spatial average of n is 0.2; see Fig SI-1 in S1 File for illustration. The density in the “city center”, x ≃ 0, is therefore 10 times higher than in its “suburbs”, and its main core spans over a radius of 10.

Fig 3 depicts an epidemic that initiates from two infection centers of exposed (b) sub-population, situated on the two sides of this model “city”, as in Fig 2. Interestingly, there is a remarkable strong influx of the epidemic towards the densely populated area, as evident from panels (d) and (e): the two growing infection centers merge into one large central spot with significantly more symptomatic people. This is very different from the evolution seen in Fig 2. The “late” power-law exponent governing the cumulative fraction of infected population is ν ≃ 3.2, i.e. in between the values obtained for the evolution depicted in Fig 2 (homogeneous n only) and Fig 1 (no spatial dependence).

Fig 3.

Fig 3

Time evolution of a epidemic starting from two infection centers near a heavily populated region, see (a) (top-left panel); t is the time given in days, and x and y are the spatial Cartesian coordinates. n is non-uniform and given by n(r)=10ae-r2/2+a, with = 10 and a taken such that the spatial average of n is 0.2; see Fig SI-1 in S1 File for illustration. The global value of b is the same as in previous figures, B = 10−3, and all other populations are initially zero everywhere: w = f = r = 0. Panels (b)-(f) show the spread of the symptomatic population f as time progresses. The symptomatic population quickly spreads into the denser region in the center and its density there increases dramatically. The global sub-population fractions, and the cumulative fraction of infected population 1 − H, are shown in panels (g) and (h), respectively.

More complex scenarios

Let us now look at epidemic evolution that initiates from different nonuniform initial conditions, in either uniformly or non-uniformly populated areas. Consider, first, several infection centers of the exposed sub-population b, randomly scattered within a uniformly populated area, Fig 4(a) (t = 0). In real-life, such initial infection centers obviously result from in-flux of infections via numerous processes, e.g., when infecting people visit the city for a short period, or when infecting and susceptible people meet in different locations outside the city (creating infection events) and then return home, and so on. The global value of b is the same as in Fig 1, i.e. B = 10−3, and all other populations vanish at t = 0 as before. Panels (b)-(f) show the time evolution of the symptomatic population f. As can be seen, each infection point initially grows locally and develops into a “ring” of symptomatic f centered around the original center (similar to Fig 2). These rings further grow and coalesce into a complex boundary pattern which keeps evolving. Panels (g)-(h) show the integrated values of F, H, B, W, and R and 1 − H. In comparison to Fig 1 for the case of uniform b, here the epidemic evolves much more slowly, peaks at a longer time and, more importantly, the value of F at the peak (likewise that of W+ F, almost indistinguishable from F) is significantly smaller. The “herd immunity” value deduced from the value of 1 − H when W + F peaks (almost indistinguishable from F), at t ≃ 123, is 1 − H = 0.35, which is quite smaller than the value obtained from the homogeneous case, Fig 1 (1 − H = 0.63), showing the non-universality of this value and its high sensitivity to the initial spreading conditions. The early evolution power-law exponent is similar to the uniform b case, but the long-time exponent ν ≃ 2.6 is significantly smaller, leading to a significantly longer time for the epidemic to decay. In Fig SI-5 in S1 File we use identical initial conditions as in Fig 4 but with a value of R0 that is twice larger—R0 = 5. The snapshots in Fig SI-5 in S1 File are taken at shorter times, which yields patterns almost indistinguishable from those shown in Fig 4, and shows the universality of the epidemic spreading patterns formed by the SEPIR model.

Fig 4. Time evolution of an epidemic with randomly scattered infection centers; t is the time given in days, and x and y are the spatial Cartesian coordinates.

Fig 4

(a) At t = 0 (top-left panel), there are small infection centers of the exposed sub-population b scattered randomly in space. The global value of b (B) is the same as in Fig 1, i.e. B = 10−3. n is uniform and all other populations are set initially to zero: w = f = r = 0. Panels (b)-(f) show the spread of the symptomatic population f as time progresses. Panel (g) shows the global sub-populations F, H, B, W, and R vs time t (in days), and panel (h) shows the cumulative fraction of infected population 1 − H vs time in on a log-log scale. Dashed and dash-dot curves are linear fits at t = 2 and t = 73, respectively. In Fig SI-2 in S1 File we plot heat-maps of 1 − h—the corresponding cumulative infections.

We now go back to study situations in which the given population density is non-uniform in space, and consider again, as in Fig 3, a central heavily populated area (“city”) whose density declines away from its center as a Gaussian with a finite width, see explanation related to Fig 3. Here, however, we consider more realistic initial conditions, where exposed (b) sub-population centers are scattered in a number of places. We distinguish here between two different cases: (i) The infection (b sub-population) initiates from multiple centers within the city (Fig 5). (ii) The infection evolves from several, randomly distributed, centers in the city periphery (Fig 6).

Fig 5.

Fig 5

Time evolution of an epidemic starting from multiple infection centers inside a heavily populated region (“city”, panel (a)); t is the time given in days, and x and y are the spatial Cartesian coordinates. The population density of the city n is nonuniform and given by n(x)=10ae-x2/2+a, with x = (x,y), = 10, and a taken such that the spatial average of n is 0.2; see Fig SI-1 in S1 File for illustration. The integrated initial value of b (B = 10−3) is the same as in all previous figures. All other populations are initially zero: w = f = r = 0. Panels (b)-(f) show the spread of the symptomatic sub-population f as time progresses. The global sub-populations and the cumulative infected population 1 − H are shown in panels (g) and (h), respectively.

Fig 6. Same as in Fig 5 but with the initial (t = 0) infection centers located outside of the “city”.

Fig 6

Consider first an infection initiating from around the city center (case (i)), as several small b-centers randomly distributed within the city core. Initially, the epidemic consumes non-negligible portion the susceptible (h) sub-population within the city core, associated with a substantial growth of f, see Fig 5(b) and 5(c). After this early evolution (t ≳ 90) the infection slowly spreads outward by formation of ring-like patterns, bearing some similarity to the above studies. Conversely, when the infection initiates from the city outskirts (case (ii), Fig 6), the pattern is more heterogeneous, and local infection centers grow effectively independently of each other. Here, as the epidemic reaches the core of the city, the relatively high density of susceptible population (h) allows the f-population to keep rising. Hence F(t), seen in case (ii) (Fig 6(g)), grows for a somewhat longer time and reaches a higher peak, as compared to case (i) (Fig 5(g)). The apparent “herd immunity” value (1 − H at the peak of F or W + F) is also higher in case (ii) (1 − H ≃ 0.45) than in case (i) (1 − H ≃ 0.27), and both values are lower than the one obtained in the homogeneous case (Fig 1), demonstrating again the sensitivity of this value to both initial conditions and spatial density variation.

Quarantine strategies

So far we looked at the unperturbed spread of an epidemic. Yet, authorities often use numerous active tools to confine the disease or slow down its spread. Usually people are instructed to stay home for a considerable period, the so-called “quarantine”, “lockdown” or “stay-at-home” order. In addition, roads connecting between “hotspots” and uninfected regions are sometimes blocked. Here we consider two types of quarantine strategies. We define local “area lockdown” as strong restrictions on activity within a certain area. Examples where “area lockdown” have been employed are: Wuhan (China), Anxin county (China), dozens of residential compounds in Beijing (China), Bnei-Brak (Israel), New York (USA)—“stay-at-home” order [48, 49], Florida State—“stay-at-home” order [50, 51]. We also propose another strategy that inflicts less burden on the society, which we term “belt quarantine”. Here movement restrictions are imposed between a certain region and its surroundings, but not inside the region itself. Partial belt quarantine has been employed for instance in Canada, as the whole state reopened from lockdown, using checkpoints and roadblocks [5254]. In both, there are fewer and less frequent encounters between people implying locally reduced values of Dk and k.

We commence by examining the effectiveness of belt quarantine that is imposed in order to protect an uninfected heavily populated area (“city”) from its infected surroundings. For comparison, we also consider the case without quarantine, see Fig 7. To mimic belt quarantine, see Fig 8, we impose significantly reduced values of Dk and k within a belt surrounding a city. Specifically, between radii 10 and 12, the values of Dk and k are reduced to 20% of their background values (that are identical to those used for Fig (7)). The density profile (in both figures), describing the city and its surroundings, is not a Gaussian (as in the previous examples), but uniform within the city, and uniform but ten-folds lower outside the city. The initial (t = 0) scattered centers that surround the city, of exposed sub-population (b), are identical in both figures (panel (a) in both). The resulting epidemic spreading patterns shown in Figs 7(b)–7(f)) and 8(b)–8(f), suggest that with the protective belt quarantine the infection takes considerably longer time to invade the city.

Fig 7.

Fig 7

Time evolution of an epidemic starting from multiple random infection centers near a heavily populated region—mimicking a city—see (a) (top-left panel); t is the time given in days, and x and y are the spatial Cartesian coordinates. n is non-uniform, and is given by n = 10a within a circle of radius 10, and n = a outside of that circle, with a taken such that the spatial average of n is 0.2. The initial (t = 0) integrated value of b is the same as in previous figures, B = 10−3. All other populations are initially zero: w = f = r = 0. The global sub-population fractions are shown in panels (g)-(h).

Fig 8. Belt quarantine.

Fig 8

Time evolution of an epidemic starting from multiple random infection centers (see panel (a)), near a city identical to that of Fig 7: n is nonuniform, and is given by n = 10a within a circle of radius 10, and n = a outside of that circle, with a taken such that the spatial average of n is 0.2; t is the time given in days, and x and y are the spatial Cartesian coordinates. The “city” is under a protective circumferential “belt”, formed by two concentric circles (radii 10 and 12), within which Dk and k are reduced to 20% of their values elsewhere. The initial (t = 0) integrated value of b is the same as in previous figures, B = 10−3. All other populations are initially zero: w = f = r = 0. Panels (b)-(f): The infection is seen to spread quickly within the external area, but penetrates very slowly into the protected region. The global sub-population fractions are shown in panels (g)-(h); F shows a very wide plateau followed by a higher peak.

Comparing the resulting evolution in time of the global sub-populations, Fig 7(g) and 7(h) vs Fig 8(g) and 8(h), we observe that the belt quarantine slows down significantly the growth of symptomatic population F. Under quarantine (Fig 8(g)), F first develops a very wide plateau corresponding to the epidemic spreading only in the surroundings, outside the quarantined city. Later, at t ≃ 230 days, F further grows and builds a major (second) peak (t ≃ 268), corresponding to the epidemic spreading in the quarantined zone, Fig 8(f). While the height of the peak without quarantine is similar to the one with quarantine, the first occurs at about 90 days earlier (t ≃ 177 days). As a result, the value of H at the major epidemic peak is, in fact, smaller in the presence of quarantine, as most of the (whole) region has been infected before penetration to the quarantined city occurred. This suggests that a belt quarantine, protecting a highly populated area from its surrounding, only delays the epidemic spread and, in case the majority of population lives in the protected zone, is unable to flatten the epidemic curve significantly.

Finally, we turn to investigate a neighborhood, which is considered as the epidemic epicenter, within a large, uniformly populated, urban area. The population density n is thus uniform in the whole region of study. The neighborhood area is assumed to be a circle of radius 11, and within it there is initially a high fraction of exposed population b as multiple, randomly scattered, centers. We consider again belt quarantine which is imposed on the neighborhood in order to contain the epidemic within it. In addition, we examine the effect of a more severe measure: lockdown on the whole neighborhood, which we term “area lockdown”. Fig 9 depicts the spreading patterns in the absence of quarantine, which is shown for comparison. Fig 10 represents the effect of belt quarantine between radii 10 and 11. Fig 11 depicts the consequences of area lockdown within the whole (highly infected) neighborhood. In both belt quarantine and area lockdown the values of Dk and k are reduced to 20% of their background values.

Fig 9.

Fig 9

(a)-(f): Time evolution of a epidemic starting from multiple random infection centers with uniform n, see (a); t is the time given in days, and x and y are the spatial Cartesian coordinates. The global value of b is the same as in previous figures, B = 10−3. All other populations are initially zero: w = f = r = 0. Both Dk and k are uniform. The symptomatic population f spreads and forms ring-like structures that expands in time. Panels (g) and (h) show (respectively) the different global populations and the cumulative infected population, 1 − H, vs time t.

Fig 10. Belt quarantine.

Fig 10

Same as in Fig 9 but now with a protective “belt” quarantine in the region between the two concentric circles (radii 10 and 11). Within the belt, the values of Dk and k are reduced to 20% of their values in the rest of the region. Initially the epidemic is confined to the quarantined region, but at long times it leaks out through the belt and contaminates the exterior. Note the relatively isotropic spreading patterns in the exterior, despite the initial non-isotropic b depicted in (a).

Fig 11. Area lockdown.

Fig 11

Same as Fig 9 but now the quarantine is throughout the whole area within a circle of radius 11. Within this quarantined area, the values of Dk and k are reduced to 20% of their values in the rest of the region. The “contamination” of the exterior is slower than in Fig 10. Also note the relatively anisotropic spreading patterns seen at long times in the exterior, as compared to Figs (9) and (10).

The comparison of the epidemic spreading patterns between the above three cases, shows that with belt quarantine (Fig 10) the escape of infection from the neighborhood to its surroundings takes longer time relative to the no quarantine situation (Fig 9). Note that area lockdown (Fig 11) is even more efficient than belt quarantine in prolonging this escape time. Furthermore, it appears that in the case of belt quarantine the late-time spreading pattern are nearly isotropic, while in the case of area lockdown the escape pattern is highly non-isotropic. This occurs since prior to the escape, in the case of belt quarantine the infected population homogenizes rather quickly within the neighborhood, while for area lockdown the slow spreading within the neighborhood prevents this homogenization.

The overall effect of the different quarantine measures on the global populations is shown in panels (g) and (h) of the respective figures. Belt quarantine produces two peaks of symptomatic population (F), one (small) that corresponds to the spreading within the neighborhood, and the other (larger) corresponding to the external spreading after escape from the neighborhood occurred. The major rise in F is delayed mostly under area lockdown, due to the relative long epidemic escape time from the neighborhood. Another pronounced effect of area lockdown appears in the slow increase of cumulative fraction of infected population, 1 − H: at the longest simulation time we obtain 1 − H ≃ 89% without quarantine, 1 − H ≃ 88% with belt quarantine, and 1 − H ≃ 77% with area lockdown. Thus, it is clear that area lockdown should be the preferred choice for containing the epidemic; belt quarantine, unless almost hermetic (“leakage-proof”), has a minor contribution. Moreover, none of the two quarantine strategies considered is able to reduce significantly the level of the (major) peak in F (even though its timing is delayed), which is considered important for the ability of health systems to cope with the epidemic. This occurs due the escape of the epidemic from the quarantined region, leading to its free spreading. This may be prevented by moving the quarantine to newly infected regions, which has not been addressed here. If such a careful orchestration is performed, localized quarantine strategies can have a strong impact.

Conclusions

A general mathematical framework is presented for the spatial spreading of an infectious disease. We take into account nearest-neighbor node infection kinetics, and show that it leads to a diffusion-like term in the dynamical equations, thereby providing a unified framework for heterogeneous spread of the epidemic. Nodes are defined by the assumption that within each node the frequency of contacts is still uniform, thus following a homogeneous model description. This allows to estimate the lateral (linear) size of a node δ (i.e. area equal to  δ2), for example by examining the mean distance traveled by people—e.g., 30 km under moderate restrictions or normal life, or 1 km under strong lockdown—and makes our model flexible enough to describe infectious disease spreading on a scale ranging from a neighborhood to a whole country.

We focus here on an epidemiological spreading model for COVID-19 with inherent spatial dependency of five populations, the in-homogeneous SEPIR model. We show that the complex pattern formation is sensitive to the initial conditions, i.e., to the spatial location of the exposed population, which has important consequences for the total number of infected people. Our general mathematical framework allows to include many more populations than those appearing in our SEPIR model, as suggested by other studies [19, 20, 22]. In particular, the asymptomatic population forms about 16%-40% of the total infectious population [55], and its infectious properties are quite different from those of the symptomatic population [56]. As knowledge is accumulating, it would be worthwhile to include into the model the asymptomatics, either as a single population, or as two separate populations, “asymptomatic-normal” and “asymptomatic-super-spreaders”.

The homogeneous (i.e. prefect mixing) models cannot provide epidemiological heat-maps, such as those occasionally appearing on-line (e.g., for South Carolina [37]), and prediction of such heat-maps is the main purpose of our work. Moreover, failures of the homogeneous models might be due to their inability to account for the spatial spreading of the epidemic, which yields an overestimation of the epidemic growth rate. Hence, observed deviations from the homogeneous models, i.e. as effective power-law regimes [26, 30], can be rationalized without assuming time variation of the infection rates, as is customary done in practice. We show this in the present work by comparing the results of the homogeneous SEPIR model (Fig 1) to the results of our inhomogeneous SEPIR model (Figs 26). It is gratifying that some of the power-law exponents found in our studied examples are close to observed exponents in different counties; in particular the exponent ν ≃ 2 has been deduced for China and Iran [30].

The evolution of the homogeneous SEPIR model suggests “herd immunity” at about H ≃ 0.37 (corresponding to W + F reaching its maximum, Fig 1)—i.e. fraction of immune population 1 − H ≃ 0.63—indeed very close to the known SIR result 1 − H = 1 − (1/R0) = 0.6 for R0 = 2.5. Importantly, the “herd immunity” values that could have been (wrongly) inferred from the inhomogeneous evolution, i.e. assuming the homogeneous SEPIR model to still hold, are much lower. For example, from Fig 4 one obtains 1 − H ≃ 0.35. Moreover, different initial conditions are seen to lead to different apparent herd immunity values. Thus, interpreting observed epidemic curves based on homogeneous models may lead to wrong conclusions regarding the population reaching herd immunity. Further support for this conclusion has been reached by examining epidemic curves of European countries, where it was noted that spatial heterogeneity can lower the apparent herd immunity value [57], supporting our model conclusions.

Our model can naturally describe the flux of infection from a suburban area into a densely populated city, or in the opposite direction. Interestingly, we find that relative “curve flattening” of the infected-symptomatic population (F) can naturally occur due to either non-uniform population density, non-uniform distribution of initial infectious populations, or the combination of both. This may have important implications when comparing the epidemic evolution in different regions or states, where one needs to distinguish between the effects of quarantine measures and population conditions. For example, when comparing Sweden (essentially no quarantine measures) and Israel (severe quarantine measures), conclusions might be hampered.

Accurate predictions of COVID-19 heat-maps will require complete data sets for: (i) the geographic (i.e. position dependent) overall population density n(x), and (ii) the “initial heat-maps” for the different five population types defined in our model. In many countries n(x) can be obtained from public resources, e.g., see [58]. Unfortunately, the initial heat-maps, i.e. initial conditions for the five population densities, are currently publicly available only for a small number of countries (apparently due to privacy regulations), and moreover, usually limited only to the time-cumulative density of the infected population, i.e. 1 − h(x) [37]. Cooperation with health authorities is needed to obtain complete data sets. Our predicted evolution of COVID-19 heat-maps (see Fig SI-2 in S1 File for the cumulative infected population, corresponding to the heat-maps shown in Fig 4) show strong resemblance to those available on public resources [37]; future work will be devoted to quantitative comparison. Incomplete testing data is not going to severely hamper our predictions so long as it is uniform in space (e.g., only 10% of the COVID-19 positives are detected everywhere), and in general we may expect this to be so within a certain country where a uniform testing policy is adopted. Obviously, absolute predicted numbers will not be obtained without the use of an adjusting factor between tested and predicted numbers. It should be noted that when normal life conditions are restored, our model has to be modified to include far-distance traveling; this can be readily done using long distance infections, which we defer to future work.

Importantly, the possibility to mimic in our model spatially-varying and evolving quarantine or lockdown conditions, by using both spatially-dependent (as done in this work) and time-dependent values of Dk and k, will allow a quantitative predictive tool for the effectiveness of quarantine measures. For instance, in Israel, a plan has been proposed (“the traffic lights plan” [59]) to impose differential lockdown measures on cities with strong outbreak (“red cities”), and similar plans have been issued for the UK [60]. Our model allows to simulate the impact of these measures and make comparison with the evolution without intervention. We hope authorities will use this tool, in addition to established venues [8, 61], to simulate different lockdown policies for choosing the best exit strategy [62].

Supporting information

S1 File

(PDF)

Acknowledgments

We are grateful to Ariel Kushmaro for insightful discussions.

Data Availability

All figures are available from the medRxiv database (https://doi.org/10.1101/2020.07.08.20148767).

Funding Statement

YT acknowledges support by the Israel Science Foundation Grant No. 274/19 (www.isf.org.il). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

Decision Letter 0

Emanuele Giorgi

2 Sep 2020

PONE-D-20-21655

Epidemiological model for the inhomogeneous spatial spreading of COVID-19 and other diseases

PLOS ONE

Dear Dr. Tsori,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Oct 17 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Emanuele Giorgi

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.

We suggest you thoroughly copyedit your manuscript for language usage, spelling, and grammar. If you do not know anyone who can help you do this, you may wish to consider employing a professional scientific editing service.  

Whilst you may use any professional scientific editing service of your choice, PLOS has partnered with both American Journal Experts (AJE) and Editage to provide discounted services to PLOS authors. Both organizations have experience helping authors meet PLOS guidelines and can provide language editing, translation, manuscript formatting, and figure formatting to ensure your manuscript meets our submission guidelines. To take advantage of our partnership with AJE, visit the AJE website (http://learn.aje.com/plos/) for a 15% discount off AJE services. To take advantage of our partnership with Editage, visit the Editage website (www.editage.com) and enter referral code PLOSEDIT for a 15% discount off Editage services.  If the PLOS editorial team finds any language issues in text that either AJE or Editage has edited, the service provider will re-edit the text for free.

Upon resubmission, please provide the following:

  • The name of the colleague or the details of the professional service that edited your manuscript

  • A copy of your manuscript showing your changes by either highlighting them or using track changes (uploaded as a *supporting information* file)

  • A clean copy of the edited manuscript (uploaded as the new *manuscript* file)

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: I Don't Know

Reviewer #3: I Don't Know

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Please see attached comments.

Responses to answers to questions justified:

Is the manuscript technically sound, and do the data support the conclusions? PARTLY

I answered PARTLY because I think the model is technically sound, but the conclusion demonstration why this framework is better for COVID-19 is lacking

Has the statistical analysis been performed appropriately and rigorously? I DONT KNOW

I find the derivation of the model itself rigorous. I answered I DONT KNOW because I would like justification on why the assumption that individuals cannot infect others beyond their nearest neighbor is appropriate.

Reviewer #2: Review Comments

The authors have presented an interesting mathematical model for COVID-19 that accounts for spatial heterogeneity in population density and population mixing. I believe this type of model can be very useful and an improves upon the SEIR model by representing the spatial dynamics of transmission. However, the manuscript is written in a very technical manner and is not accessible to the general readership of PLOS ONE. In order for this work to be published in this journal, I suggest it be rewritten to be more explicitly relevant to the current COVID-19 pandemic. I have provided specific comments below, but I broadly recommend that the authors better describe how their model and findings improve our understanding of COVID-19 transmission dynamics and relevant interventions. How does the model improve upon other existing models that account for spatial heterogeneity in population density and mixing? How do the specific simulations presented here relate to the real world spread of COVID-19? How plausible is it that the data needed for this model will be available? What are other possible uses of this model in the context of the COVID-19 pandemic?

Introduction

- Lines 32-38: These sentences are unclear and have a lot of terminology that has not been defined. Could you rewrite them defining terms such as power-law growth and “front propagation”. As it stands, I’m unsure what the hypothesis is and how it relates to the spatial mixing discussed above.

- Lines 49 – 53: These sentences don’t say much about the results and seem to be saying that the model is indeed different from SEIR, which is true by construction. If the authors want to include a sentence describing their results, it should be more specific and describe what is learned from their model in the larger context of transmission dynamics and interventions.

- It would be useful to add a section to the introduction about other models that have attempted to account for spatial heterogeneity, quarantine, and heterogeneous mixing. This will allow the authors to make more explicit how their model is novel and/or adds to the existing literature.

Methods:

- Line 115: This section should be updated to include more recent literature estimating these parameters across various populations and time periods. There have been a myriad of studies that estimate these parameters and the authors should provide justification for their parameter value choices.

- Line 124-125: These parameter values are very uncertain, so the authors should use sensitivity analyses or multiple simulations to explore how sensitive the results are to the chosen values.

- Line 138: I wouldn’t think that the “heat map” data the authors are describing is readily available in many contexts. They should discuss the availability of such data here and in the conclusion, and provide citations and examples of these data, if available.

- The authors should make it clearer in the introduction that they are demonstrating the use of a new model structure but are not relating it to any real population. That said, why are the authors not focusing on a specific region or population affected by COVID-19?

Results

- Generally, the results should be written more for a general audience. For each simulation, the authors should make it clear why the examples were chosen and how they relate to real-world transmission situations. What do the results of each simulation tell us about COVID-19 transmission? For example, what would cause each of these simulation scenarios to happen in the real world? Have there been examples of such transmission scenarios the authors can point to? What are the implications of the initial conditions of the simulations for how we interpret the results?

- All heatmap figures are hard to understand due to a lack of legend and labeling. The axes (currently X and Y) of each panel need to be labeled and the color scheme needs to be defined and labeled.

- If the authors want to do simulations in a city, why not use real population density data from a relevant city experiencing a COVID-19 outbreak?

- I think the city analyses are useful and more relevant to real life, but the decisions about how to initiate infections should be explained more in terms of real-life situations. How would these initial conditions occur in the real world?

- Are “belt quarantine” and “area lockdown” real strategies used to fight the spread of COVID-19? Please provide examples and citations for these strategies. If they are not common strategies, can the authors include simulations of more commonly used quarantine strategies?

Conclusion

- This section needs more discussion of existing literature and models that account for spatial heterogeneity and estimate the impacts of quarantine on transmission. How do the results from this model/analysis compare with existing literature? How does this paper add to our understanding of quarantine interventions and transmission dynamics?

- As mentioned above, the authors need to add more in-depth discussion of the data needed for this model to be operationalized. I’m not familiar with what they call “heat maps” and am not sure how available they will be. Please add some information about what exactly these data are and where they are available.

- It would also be useful for the authors to more explicitly discuss the potential uses of their model in the context of the COVID-19 pandemic.

Minor comment: The authors should edit the writing to make sure the grammar is correct throughout the paper.

Reviewer #3: PONE-D-20-21655

Page 2 Lines 8-10: "Infectious disease spreading models are largely based on the assumption of perfect and continuous \\stirring", similar to the one used to describe the kinetics of spatially-uniform chemical reactions. In particular, the well-known susceptible-exposed-infectious-recovered (SEIR) model, builds on this assumption."

This is true in some cases, but is very limited. The term SEIR just refers to the compartments which are used, and many extensions are available in both the deterministic and stochastic literature (the authors note a few). It would be clearer to call this the homogenous-mixing or baseline SEIR model.

It seems that more information about how the proposed techniques differ from other deterministic approaches which appear superficially similar. A few examples:

https://aip.scitation.org/doi/full/10.1063/1.5116807

https://link.springer.com/article/10.1007/s11587-013-0151-y

https://arxiv.org/abs/2005.03499

> Page 2 line 31 "often require refitting the infection rate constant as the epidemic progresses"

I don't believe this should neccessarily be listed as a negative - in real epidemics/pandemics, there is usually very little reason to believe that epidemic intensity (which involves infectiousness as well as contact rates) is constant over space or time. The authors later note this and express the idea more clearly, but I think this could be clarified.

I think the manuscript would be greatly enhanced by a more detailed explanation of the degree of model fit, and any appropriate sensitivity to the driving parameters. In addition, the discussion should address the following issues:

1. Application of this technique to areas of sparse spread/inflated zero counts

2. Limitations concerning the use of incomplete testing data for validation

3. People travel great distances, which would seem to limit the utility of this model. Can it be applied to large areas where non-contiguous spreading events occur, or would that require generalization?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PONE_Rev_Model for the inhomogeneous spatial spreading of COVID19.docx

Decision Letter 1

Emanuele Giorgi

2 Dec 2020

PONE-D-20-21655R1

Epidemiological model for the inhomogeneous spatial spreading of COVID-19 and other diseases

PLOS ONE

Dear Dr. Tsori,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 16 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Emanuele Giorgi

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: (No Response)

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: I Don't Know

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have put forth a great effort to address the comments of the reviewers, and should be commended. In particular, I appreciate the addition of the supplement as well as the clear outlining of quarantine strategies and model scenarios. I have some remaining minor comments, regarding clarity:

Introduction:

Substantial edits were made to the introduction in order to make the work more accessible to the general readership. However, some improvements could still be made to improve readability. In particular:

- Line 12 "Some extensions of SEIR-like models that account for spatial variability employed diffusion processes for the different sub-populations". It is unclear to me here what specifically is diffusing. I also don't think the reference to wildlife is particularly helpful. One suggestion-- these sentences might be replaced with sentences such as: "Some extensions of SEIR-like models account for spatial variability by employing diffusion processes on contact networks, in which an infection that appears at one node of a network spreads throughout the system through inter-node transmissions. However, this diffusive movement of disease through a population fails to capture human disease dynamics as human movement is not uniform".

- Line 21: is it possible to replace “such” with “homogenous”? I think it would be helpful to have this term introduced early on since it is highlighted throughout.

Results/Abstract:

- I appreciate the additional comparison of the infection heat maps to data from South Carolina. However, while reference to South Carolina appears in the abstract, no mention of it is in the text (with the exception of its reference in lines 70, 435, 475, 478). When referenced, it would be helpful if it would be clear that the "publicly available resources" or "heat maps appearing online" is from South Carolina.

- Relatedly, the citation for [35] appears to be annotated. I also was unable to visit the link and received an error. I understand there are challenges with changing links and the authors should not be held to this. If, however, it is possible to screenshot some instances of the video heat map to include in the supplement, this could alleviate this issue.

Discussion:

- The authors now use "sub-populations" to refer to the compartments of the SEPIR model. However, in many places (e.g., paragraph starting at line 422; paragraph starting at line 468) “populations” are still used rather than “sub-populations”

- Line 457: “supporting” our model conclusions, rather than “confirming” our model conclusions

- Line 492: define “red cities” instead of using jargon (e.g. "cities labeled 'red' for the highest risk tier")

Reviewer #2: The authors have carefully and successfully responded to my initial comments, and I believe the manuscript is much stronger. I only have a few minor remaining comments:

Line 51: Can you reword the hypothesis? Specifically, what are you referring to with “spatial growth of infected geographical domains”? Can you use more general language to make this clearer?

Line 69: What sorts of publicly available resources are you referring to here? It would be good to be more specific about the data products you are emulating and how those data can be used. A few specific examples such as the South Carolina Health department data would be useful in the text.

Line 11: I like this added text to provide context for the model but I think it could still use more detail. For example, as it is written, it isn’t clear why diffusion processes “fail to describe human behavior”. Could you describe why this is inappropriate more fully?

I appreciate the added sensitivity analyses for parameter values. Could you add a description of the results to the main text? What do you mean by “qualitative similarities” in the results? It would be good to describe specifically how the results change with different specified parameter values.

I like the statement that “the main purpose of the paper is to present our novel model to allow researchers working in close contact with authorities to apply it in their own countries” and believe it should be placed at the end of the Introduction section to make it clear what the scope of the paper is.

Table 1 is very useful to guide readers. Could the authors relabel the second and third columns using words? E.g., “Population density, n” and “initial infections” or something similar.

The figures and legends are much clearer now, but I am still unsure why the displayed time steps are different for each figure? I understand there are space constraints but maybe the authors could justify the choice of displayed time steps. I would also make sure that “x” and “y” are clearly defined in all figure legends.

There remain some grammar and spelling mistakes in the text. It would be good to carefully review.

Reviewer #3: The authors have made a number of important improvements, and clarified some of the issues which were brought up. There are some remaining issues to be addressed, however.

There are still some problematic claims about the existence of spatially heterogeneous techniques (e.g., "Moreover, these models do not include spatially-dependent infection spreading parameters, which are required to model geographically local quarantine.")

1. Metapopulation models routinely allow spatially-dependent infection parameters.

https://www.sciencedirect.com/science/article/pii/S0022519396900429

Part of the objection here is to the strength of the language used describing the limitations of existing models (particularly since the authors don't really engage the stochastic modelling literature much). Using a rich spatially heterogeneous parameterization of a compartmental epidemic model is not terribly difficult to do from a forward-simulation perspective, but often becomes computationally impractical when attempts at fitting a model to actual data are made (especially where formal inference is desired). It is certainly true that the vast majority of work published recently on COVID makes problematic assumptions like homogeneous mixing, but that's largely not a theoretical problem, but a practical one. This distinction should be clearer.

2. In the previous review, I asked that the discussion address "application of this technique to areas of sparse spread/inflated zero counts" - I apologize if I was insufficiently clear.

In response, the authors discuss initial conditions, but the question was intended to be more about the application. In real epidemics, regions of low and varying population density often give rise to chaotic, rather than smooth, disease outcomes - virtually identical communities with the same level of ongoing spread observe vastly (on a proportional scale) different outcomes due to things like cluster outbreaks (long term care facilities, in the case of COVID) and super-spreader events. Unless I'm mistaken, the proposed deterministic technique could run into trouble when fit to actual data in such regions, particularly where the grid produces regions of sparse population. To be clear, I'm not suggesting that this limitation be addressed by the current work, but I think that it should probably be addressed in the discussion.

3. Myself and several reviewers noted the issue of long-distance transmission, and the authors added the language: "It should be noted that when normal life conditions are restored, our model has to be modified to include far-distance traveling; this can be readily done using long distance infections, which we defer to future work."

I don't think this sufficiently addresses the concern. Even in areas with strict lockdowns, people routinely travel greater distances than 1km, whether essential workers or performing essential tasks like grocery shopping. Coupled with the smooth nature of the deterministic framework, this seems like a very important limitation worthy of additional discussion.

4. When addressing the lack of uniform testing, the authors added the language: "Incomplete testing data is not going to severely hamper our predictions so long as it is uniform in space (e.g., only 10% of the COVID-19 positives are detected everywhere), and in general we may expect this to be so within a certain country where a uniform testing policy is adopted. Obviously, absolute predicted numbers will not be obtained without the use of an adjusting factor between tested and predicted numbers."

There are a couple of things which come to mind that may be worth addressing here as well:

a. For COVID specifically, testing was non-uniform over time essentially everywhere, due to the availability of tests. In many places spatial non-uniformity was/is observed as well.

b. It's not clear to me that the lack of complete testing is a non-issue even if it is uniform - it seems that the dynamics on which the model is based perform very differently for different infectious fractions. It may be the case that the proposed model is sufficiently flexible to be fit to reported cases despite this limitation, but I'm not seeing that explicitly demonstrated in the manuscript. If I've missed something this would be helpful to clarify in the discussion section.

5. I also agree with the other reviewers that "population" and "sub-population" are confusing terms to use for disease states, since they conflict with the common practice of introducing strata into metapopulation style models (for example, age and geographic strata).

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 2

Emanuele Giorgi

13 Jan 2021

Epidemiological model for the inhomogeneous spatial spreading of COVID-19 and other diseases

PONE-D-20-21655R2

Dear Dr. Tsori,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Emanuele Giorgi

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Emanuele Giorgi

1 Feb 2021

PONE-D-20-21655R2

Epidemiological model for the inhomogeneous spatial spreading of COVID-19 and other diseases 

Dear Dr. Tsori:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Emanuele Giorgi

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    (PDF)

    Attachment

    Submitted filename: PONE_Rev_Model for the inhomogeneous spatial spreading of COVID19.docx

    Attachment

    Submitted filename: response_to_reviewers.docx

    Attachment

    Submitted filename: response_to_reviewers_2.docx

    Data Availability Statement

    All figures are available from the medRxiv database (https://doi.org/10.1101/2020.07.08.20148767).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES