Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2021 Aug 9;186:115710. doi: 10.1016/j.eswa.2021.115710

COVID-19 and other viruses: Holding back its spreading by massive testing

José L Sainz-Pardo 1,, José Valero 1
PMCID: PMC8351357  PMID: 34393387

Abstract

The experience of Singapore and South Korea makes it clear that under certain circumstances massive testing is an effective way for containing the advance of the COVID-19.

In this paper, we propose a modified SEIR model which takes into account tracing and massive testing, proving theoretically that more tracing and testing implies a reduction of the total number of infected people in the long run. We apply this model to the spreading of the first wave of the disease in Spain, obtaining numerical results.

After that, we introduce a heuristic approach in order to minimize the COVID-19 spreading by planning effective test distributions among the populations of a region over a period of time. As an application, the impact of distributing tests among the counties of New York according to this method is computed in terms of the number of saved infected individuals.

MSC: 90-08, 34D05, 92D30

Keywords: COVID-19, Coronavirus disease, Optimal testing, SIR model, SEIR model, Epidemic

1. Introduction

The emergence of the coronavirus disease 2019 (COVID-19) and its mutations motivated actions on the inhabitants of several countries like isolation, social distance and others. The purpose of these restrictions is to slow down the spreading of the pandemic in order not to collapse health systems. Carrying out massive testing is a complementary way to control the pandemic. Throughout this article we will analyse the impact of testing on the number of infections and we will explain how we have developed an expert system to obtain test distributions which minimize the number of infections within a region and for a temporal horizon.

The classical SIR model and its variations are a powerful mathematical tool for predicting the evolution of epidemics around the world. The amount of literature studying the properties of solutions of such models is huge, see e.g. Brauer and Castillo-Chávez, 2012, Ji and Jiang, 2017, Jiao et al., 2020, Kuniya and Nakata, 2012, Li and Muldowney, 1995, Zhang and Teng, 2007, Zhao et al., 2017 and the references therein among many others. In the present, much effort is taken in order to estimate the evolution of the COVID-19 pandemic, see e.g. Annas et al., 2020, Arcede et al., 2020, Britton et al., 2005, Chen, Lu et al., 2020, Chen, Rui et al., 2020, Gomes et al., 2020, Ndaïrou et al., 2020, Roda et al., 2020, Sauter and Pacheco, 2020 and Xu et al. (2020).

The coefficients which are involved in the system of differential equations of a SIR model are constants in the simplest situation. However, it is more realistic in order to estimate the evolution of an epidemic to consider them as functions of time, as given in some of the above references. Moreover, when we need to take into account the impact of governmental actions like confinements, quarantines, restrictions on mobility and travelling and so on, the coefficients, especially the rate of transmission, can change abruptly. Thus, it is better to define them piecewise by choosing appropriate time intervals depending on the moments in which the governmental restrictions are more severe or, on the contrary, become more relaxed. This approach has been considered in several papers estimating the evolution of the COVID-19 pandemic in several countries, see Falco et al., 2020, Kwuimy et al., 2020, Lin et al., 2020, Mushayabasa et al., 2020, Niazi et al., 2020 and Tang, Bragazzi et al. (2020). Also, the parameters can depend on time due to seasonality (see He et al. (2020)).

Apart from tough measures leading to a lockdown, tracing, massive testing and isolation are very effective tools which help to contain the spread of the illness even without restrictive social distancing, as we can see in the examples of Singapore, South Korea and some Italian towns (see Romagnani et al. (2020)). The main drawback of massive testing is the economic burden. However, as pointed out in Eichenbaum et al. (2020) in the long run it pays off, as these policies can dramatically reduce the economic costs of the epidemic.

Thus, it is quite interesting to modify the SIR or SEIR model in order to measure their influence in the evolution of the pandemic and also to determine the best way to carry out massive testing. In Berger et al. (2020) the SEIR model is modified by taking into account testing and quarantine measures. It is shown that increasing testing the governments could relax the quarantine conditions (implementing a targeted and more efficient quarantine), so that the economic and social costs would be smaller while maintaining constant the human costs. In Wang (2020) the author makes a qualitative analysis of a SIR model in which the infected individuals are divided into two groups: those undetected and those detected by means of some tests, which have a lower rate of transmission of the disease due to quarantine measures. It is shown that testing reduces the number of infections in the long term, avoiding in this way herd immunity. In Niazi et al. (2020) a model taking into account random massive tests is given. The impact of testing is analysed, obtaining the optimal values for the number of tests to be carried out each day in two possible testing policies to control the epidemic. In these papers, detection by tracing and random testing are put together. In Tang, Wang et al. (2020) a model in which contact tracing and quarantine are considered is used in order to estimate the evolution of the epidemic in Wuhan. In Ubaru et al. (2020) the problem of optimal testing for individuals under limited testing and tracing resources is studied by using a dynamic-graph based SEIR epidemiological model.

In this paper we modify the SEIR model in such a way that the impact of tracing and random testing can be measured separately. For this aim we assume that infected people who are detected are posed in quarantine and then they cannot infect other people any more; hence, only undetected infected individuals are able to spread the epidemic (of course, this is an ideal situation, being more realistic to consider that the rate of infection of those detected is lower, as given in Li et al. (2020)). In our model we differentiate two kind of detected infected individuals: people which are detected because either they are symptomatic or are direct contacts of infected individuals (and then detected by tracing) and people which are detected by means of massive random testing. These two methods of detection are complementary, as by tracing one detects people with symptoms and their contacts, whereas by testing one detects asymptomatic individuals that otherwise would remain undetected and would continue infecting other susceptible individuals.

In Section 3, we make a qualitative analysis of the model in the case where the coefficients are constants. We show first that, as in the standard SEIR model, all the solutions converge as times increases to a fixed point. The number of susceptible individuals in the long term determines the size of the epidemic. We prove that the limit number of susceptible individuals is an increasing function of the parameters of tracing and testing. That is, the more tracing and testing, the less number of infected people in the long term.

In Section 4, we estimate first the parameters of the model during the first wave of the COVID-19 epidemic in Spain if no massive testing is used. We consider the case where the parameters are functions of time and are defined piecewise by taking into account the moments where confinement restrictions were established or relaxed in Spain between March and June of 2020. After that we study the evolution of the epidemic when a constant number of tests is carried out each day, calculating the number of infected individuals which are saved in the long term for several values for the parameters which characterize tracing and testing.

However, a constant distribution of tests per day is far to be optimal. Moreover, it could be better to distribute the tests among different populations in a non-proportional way. Thus, the main purpose of this work is to develop an expert system that provides an effective distribution of tests among the populations of a region over a period of time. So, in Section 5 a heuristic method based on the proposed modified SEIR model is introduced in order to optimize the distribution of tests. In this section it is also explained how to estimate the parameters of the model by using the Differential Evolution technique. This technique was firstly introduced in Storn (1996) as an evolutionary method for optimizing nonlinear functions. Also, it has been employed for estimating parameters of infectious diseases models such as SIR, SIS, SEIR, SEIS and others Kotyrba et al. (2015). For example, it has been applied to estimate the SIR parameters of the COVID-19 pandemic in Italy (see Iorio and Li (2020)). Several improvements, versions and applications of this technique like the ones in Wang et al. (2015) and Yi et al. (2016) or the version employed in Iorio and Li (2006), which will be the one used in this paper, can be found in the literature.

Finally, in Section 6 we study the effectiveness of the proposed heuristic approach by an extensive computational analysis of the spreading of the COVID-19 pandemic in the New York counties during the months of April, May and June of 2020. The results of our distribution are also compared with a distribution which is homogeneous in time and proportional to the size of each population.

2. The model

The classical SEIR model is the following

dSdt=βNSI,dEdt=βNSIσE,dIdt=σEγI,dRdt=γI, (1)

where N is the size of the population, St is the number of the susceptible individuals to the disease, E(t) is the number of exposed people assuming that in the incubation period they do not infect anyone, It is the number of currently infected individuals which are able to infect other people, R(t) is the number of individuals that have been infected and then removed from the possibility of being infected again or of spreading infection (which includes dead, recovered people and those in quarantine or with immunity to the disease). The constant β is the average number of contacts per person per time, γ is the rate of removal (1/γ is the average time after which an infected individual is removed), and 1/σ is the average time of incubation of the disease. All these parameters are non-negative.

For our purposes we need to modify system (1) in several ways.

First, in a real situation the coefficients of the model are not constants but functions of time. Moreover, these functions should not be continuous in general, because in an epidemic outburst the governments impose restrictive measures to the population leading to a sudden change of the rate of transmission.

Second, as our intention is to analyse the efficiency of a testing method, the variable It will consist of all currently infected individuals (not only of those able to infect) and a new variable Dt, the number of currently infected people which are detected, will be introduced. We will assume the ideal situation in which any detected individual is placed in quarantine, so this person is not able to infect anyone from that moment. Thus, the number of people with the capacity to infect others is ItDt. Also, we do not take into account that there could be people which are immune, so the variable Rt will contain only dead and recovered individuals but neither those in quarantine nor immune ones.

Third, we aim to estimate dead and recovered people among the detected ones separately, so Rt is split into three variables:

  • F1t: number of dead individuals among the detected ones;

  • R1t: number of recovered individuals among the detected ones;

  • Lt: number of removed individuals among the undetected ones.

As a first step, we consider the situation where mainly people with symptoms and their direct contacts are detected, but there is no plan for massive testing. The rate of detection is given by the variable ρt (0<ρt<1) and, therefore, Dt=ρ(t)I(t).

With these new variables at hand system (1) becomes:

dSdt=βtNSt(1ρt)It,
dEdt=βtNSt(1ρt)ItσEt,
dIdt=σE(t)ρtγ1(t)+γ2(t)+(1ρt)γ¯tIt,
dF1dt=γ1(t)ρtI(t),
dR1dt=γ2tρtI(t),
dLdt=γ¯(t)(1ρt)I(t), (2)

and the currently detected individuals are given by

D(t)=ρtIt. (3)

Here, γ1t is the rate of mortality of detected people at moment t, γ2t stands for the rate of recovery of detected people at moment t, whereas γ¯t is the rate of removal among those undetected.

In a second step, we describe the situation where a massive testing is planned in order to increase the number of detected people, which can be placed then in quarantine. For this aim we define the new variable Tt, which stand for the number of people that have been detected at time t by the test programme among those currently infected It.

Hence, system (2) becomes:

dSdt=βtNSt(1ρt)ItT(t),
dEdt=βtNSt(1ρt)ItT(t)σEt,
dIdt=σE(t)ρtγ1(t)+γ2(t)+(1ρt)γ¯tIt,
dTdt=Δ(t)γ˜1(t)+γ˜2(t)T(t),
dF1dt=γ1(t)ρtI(t)+γ˜1(t)T(t),
dR1dt=γ2tρtI(t)+γ˜2(t)T(t),
dLdt=γ¯(t)(1ρt)I(t)(γ˜1t+γ˜2t)T(t), (4)

where Δt is the number of detected people by testing at day t and γ˜1(t) (γ˜2(t)) is the rate of death (recovery) among those detected by testing. The currently detected people are now calculated by

D(t)=ρtIt+Tt. (5)

Assuming that the individuals for the testing are chosen randomly, the variable Δ(t) is approximated by

Δ(t)α(t)(1ρ(t))I(t)T(t)NR1(t)F1tρ(t)I(t)T(t),

with α(t)0 being the number of tests performed at day t. This formula can be simplified if we assume that NR1(t)+F1t+ρI(t)+T(t), which is true in big populations. Hence,

Δ(t)α(t)(1ρ(t))I(t)T(t)N. (6)

For simplicity we could assume that the rates of death and recovery are the same among the detected and the undetected infected people. In such a case, γ¯(t)=γ1t+γ2t, and γ˜1t=γ1t,γ˜2t=γ2t. Thus, model (4) would be the following:

dSdt=βtNSt(1ρt)ItT(t),
dEdt=βtNSt(1ρt)ItT(t)σEt,
dIdt=σE(t)γ1(t)+γ2(t)It,
dTdt=Δ(t)γ1(t)+γ2(t)T(t),
dF1dt=γ1(t)ρtI(t)+T(t),
dR1dt=γ2tρtI(t)+T(t),
dLdt=(γ1t+γ2t)(1ρt)I(t)T(t). (7)

In this model we have not taken into account some important factors like the density of the population or mobility. It is shown in Li et al. (2018) that there is a weak relation between the density of the population and the rate of infection, being this rate greater if the population is more dense. Also, mobility plays a key role in the spreading of an epidemic as explained in Li, Wang et al. (2017) (see also Li, Dong et al. (2017) for its influence in other contexts). In this sense, we would like to do some comments. First, the inclusion of these factors would lead to an increment in the number of parameters to be estimated, which is problematic due to the limited amount of available data. Second, the impact of mobility is more important at the initial stage, when the virus has started its spreading. However, we estimate the model in a period when COVID-19 had already reached all the areas of the country or region and, moreover, the governmental restrictions reduced drastically people mobility. Finally, the influence of these factors can be absorbed by the other parameters of the model.

On the other hand, it is known that the latent period for COVID-19 is likely to be shorter that the exposed period (see Rai et al. (2021)), which means that infected people could transmit the virus before showing symptoms. Obviously, the SEIR model can be modified in order to take into account this fact (see Brauer and Castillo-Chávez (2012)). However, this would lead to adding a new parameter without having any observed data about the exposed individuals. Also, as before, in our model this effect is absorbed by the parameter β.

Thus, we prefer not to consider these factors explicitly and keep a simpler model which gives good enough estimations in order to study the impact of tracing and testing in the evolution of a pandemic.

3. Qualitative analysis in the case of constant coefficients

Our first aim is to prove theoretically that massive testing helps reducing the number of infected people in the long-term.

The qualitative behaviour of the solutions of system (1) is simple and well known (see Brauer and Castillo-Chávez (2012)). There exists an interval of fixed points given by

S,E,I,R=S,0,0,NS,S[0,N],

and any solution with non-negative initial condition S0,E0,I0,R0 satisfying that N=S0+E0+I0+R0 converges to one of this fixed points as the time t tends to +. Supposing that R0=0, the limit point is determined by the initial value of the susceptible S and is given by the equation

logS0S=βγ1SN. (8)

We consider system (7) in the following particular situation:

  • The coefficients β,ρ,σ,γ1,γ2,γ˜1,γ˜2, γ¯ are constant.

  • The rates of death and recovery are the same among the detected and the undetected infected people, so γ¯=γ=γ1+γ2,γ˜1=γ1,γ˜2=γ2.

  • The number of tests which are carried out per day α is constant and positive and the approximation (6) is valid.

With such assumptions and putting together the variables F1,R1,L, system (7) reads as

dSdt=βNS((1ρ)IT),
dEdt=βNS((1ρ)IT)σE,
dIdt=σEγI,
dTdt=α1ρITNγT,
dRdt=γI. (9)

What we want to show is that the solutions of system (9) behave in the same way as those of system (1) and that the limit value S is increasing with respect to the parameters α and ρ, showing in this way theoretically that more detection implies a lower number of infected people in the long-term.

The initial condition has to satisfy that S0+E0+I0+R0=N and the variable W=S+E+I+R remains equal to N for every time t0. It is straightforward to see that the unique fixed points of this system are:

S,E,I,T,R=S,0,0,0,NS,S[0,N].

We observe also that the initial condition has to be non-negative. This implies that the solution is non-negative for every forward moment of time if, moreover, 1ρI0T00.

Lemma 1

If S0,E0,I0,T0,R00 and 1ρI0T00 , then St,Et,It,Tt,Rt,yt=1ρItTt0 for all t0 . Also, St>0 , for t0 , if S0>0 , whereas St0 if S0=0 .

Moreover, if the solution is not a fixed point, then It,Tt,Rt>0 for all t>0 , and the following statements hold true:

  • 1.

    If eitherE0>0orE0=0, S0>0 and y01ρI0T0>0 , then Et,yt>0 for all t>0 .

  • 2.

    If E0=0 , S0=0 and y0>0 , then Et=0 , yt>0 for all t>0 .

  • 3.

    If E0=0 and y0=0 , then Etyt0 .

Proof

It is easy to see from the first equation that St0 for all t0. Moreover, St>0, for all t0, when S0>0 and St0 if S0=0.

If E0=I0=0, we have a fixed point. Thus, either I0>0 or E0>0.

Let E0>0. If Et>0 for all t0, then it follows from the third and fourth equations in (9) that

I(t)=I0eγt+σ0teγtrErdr>0, (10)
T(t)=T0eαN+γt+α1ρN0teαN+γtrIrdr>0, (11)

for all t>0, and the function yt=1ρItTt satisfies

dydt+αN+γy=1ρσE, (12)

so that

yt=y0eαN+γt+0teαN+γ(tr)1ρσErdr>0,t>0. (13)

From the last equation in (9) we infer that Rt>0, for every t>0, as well.

Next, we prove that the function Et cannot vanish at any point if E0>0. Assume on the contrary the existence of a first moment of time t0>0 such that Et0=0, so that Et>0 for all t[0,t0). By the equality in (13) we conclude that yt>0 for t[0,t0]. Hence, from the second equation in (9) we get

Et0=E0eσt0+βN0t0eσ(t0r)Sr((1ρ)IrTr)dr>0, (14)

which is a contradiction.

Let now I0>0 and E0=0. Here we have to study three cases: (1) y0>0,S0>0;(2) y0>0,S0=0; (3) y0=0.

Let y0>0, S0>0. By continuity there is ɛ0>0 such that It,yt>0 for t[0,ɛ0]. Making use of (11) we obtain that Tt>0 for all t(0,ɛ0] as well. From the second equation in (9) we have

Et=E0eσt+βN0teσ(tr)Sryrdr>0, for all t(0,ɛ0],

as St>0 for every t0. Thus, taking any 0<ɛɛ0 we infer arguing as before but in the interval [ɛ,t] that Et>0, for all tɛ, and that It,yt,Tt,R(t)>0, for all tɛ, as well. Since ɛ is arbitrarily small, Et,It,yt,Tt,Rt>0, for all t>0.

Let y0>0,S0=0. In this case, the first two equations in (9) imply that St=Et=0 for all t0. Then from the third equation in (9), (12) we obtain that It,yt>0 for every t0. Finally, by (11) and using the last equation in (9) the inequalities Tt,Rt>0, for all t>0, hold true.

Let y0=0. The fact that Et,yt0,0 is the unique solution of the problem

dEdt=βNS(t)yσE,dydt=αN+γy+1ρσE,E0=y0=0,

implies that yt0 and Et0. Then from the last three equations in (9) and T0=1ρI0>0 we conclude that It, Rt,Tt>0 for t>0. ■

Theorem 2

Let S0,E0,I0,T0,R00 and 1ρI0T00 . Then every solution converges as t+ to one fixed point. Moreover, S is determined uniquely by S0 , R0 and T0 as the solution of the following equation:

β(1ρ)α+γNSlogS=β(1ρ)α+γN(NR0)T0βα+γNlogS0. (15)

Remark 3

If the particular case where α=ρ=R0=T0=0 , we obtain (8) .

Proof

We divide the proof into several steps.

First, we state that It0,Tt0 as t+.

Summing up the first three equations we have

ddtS+E+I=γI. (16)

Then Wt=St+Et+It is a non-negative smooth non-increasing function, so it has a limit as t+. Also, as

Wt=γIt=γσEtγItγσN=K,K0,

the derivative of W tends to zero. Indeed, by contradiction assume that there is ν>0 and a sequence tn+ such that Wtn2ν. For ttn it is clear that

WtWtn=Wt˜ttnKttn.

Hence,

Wt2ν+Ktnt,

so Wtν if tJn=[tnνK,tn]. The sequence {tn} can be chosen in such a way that the intervals Jn are disjoint. In each of them the function Wt decreases at least in the quantity ν2K and outside them the function does not increase anywhere. Therefore,

WtnW0nν2K for any n.

For n great enough the function becomes negative, which is not possible. Thus, (16) implies that It0 as t+.

As by Lemma 1 we know that 0Tt1ρIt, it follows that Tt0 as t+.

Second, we establish that Et0 as t+.

We know that Wt=St+Et+Itη, It0 as t+. Also, as St is non-increasing and bounded from below, St converges to some S. Hence, EtηS. We will prove that ηS=0. If not, there would exists t0>0 such that

EtηS2>0 for all tt0.

From the second equation in (9) and Tt0, It0 we infer the existence of t1t0 for which

dEdtσηS4 for tt1.

Thus,

EtEt1σηS4tt1,

which is not possible because Et0.

The fact that every solution converges to a fixed point has been established.

Finally, let us prove (15). From (16) and the above convergences we have

0dSdt+dEdt+dIdtdt=γ0Itdt=SS0+E0+I0=SN+R0.

Integrating the fourth and first equations in (9) and putting all together we obtain

0dTdtdt=T0=α1ρN0ItdtαN+γ0Ttdt,
logSlogS0=β1ρN0Itdt+βN0Ttdt=β1ρN0Itdt+βα1ρN(α+γN)0Itdt+T0βα+γN=β1ρN1αα+γNSN+R0γ+T0βα+γN=β1ρα+γNNSR0+T0βα+γN,

giving rise to relation (15). ■

We aim now to analyse expression (15) in order to show that S is an increasing function of α and ρ. We denote Sα the value of S for α assuming the rest of the parameters being constant. In the same way we define the function Sρ.

Theorem 4

We assume the conditions of Theorem 2 and that S0>0 and either E0>0 or y0>0 . Then Sα1>Sα2 , if α1>α2 , and Sρ1>Sρ2 , if ρ1>ρ2 .

Proof

We write (15) in the form

α+γNlogSS0+β1ρNR0ST0β=0.

We observe that Lemma 1 implies that yt>0, for t>0, so from the first equation in (9) we obtain that St is strictly decreasing and then S<S0. Analysing the function

fx=α+γNlogxS0+β1ρNR0xT0β

we deduce that:

  • fS0=β1ρNR0S0T0β=β1ρ(I0+E0)T00.

  • f has a maximum at x0=α+γNβ1ρ, fx00.

  • f is increasing if 0<x<x0 and fx as x0+.

  • f is decreasing if x>x0 and fx as x+.

We state that fx0>0. Indeed, if fx0=0, then x0=S0 and this is moreover the only point where the function vanishes. Then S0=S, which is impossible because St is a strictly decreasing function.

Therefore, the above properties imply that the equation fx=0 has exactly two solutions: 0<x1<x0,x2>x0. It follows from S<S0 and S0(x1,x2] that S=x1.

Denote by gαx the function fx for the value α. It is easy to see that

gα1x<gα2x for any x<S0 if α1>α2,

assuming that the other parameters are constant. From here we conclude that Sα1>Sα2 if α1>α2.

In the same way, considering the functions hρx we obtain that

hρ1x<hρ2x for any x(0,NR0] if ρ1>ρ2,

while the other parameters remain constant. Hence, Sρ1>Sρ2 if ρ1>ρ2.

In the epidemic models a crucial parameter is the basic reproduction number, which is the expected number of secondary cases produced, in a completely susceptible population, by a typical infective individual over the course of its infection period.

The variables S and R are disease free, whereas E,I and T are the infective variables. We consider the equilibrium N,0,0,0,0 and the linearization around it of the subsystem of (9) corresponding to the infective variables, which is given by

dxdt=σβ1ρβσγ00α1ρNαNγx=Jx. (17)

Following van den Driessche and Watmough (2002) we calculate the reproduction number R0 by splitting the matrix J into the rest of the Jacobian matrix associated to the rate of new infections F and the one associated to the net rate out of the compartments V:

J=FV=0β1ρβ0000α1ρNαNσ00σγ000γ.

The basic reproduction number is equal to the spectral radius of the matrix FV1 when this matrix is non-negative. As the eigenvalues of the matrix

FV1=β1ργβ1ργβγ000α1ργNα1ργNαγN

are λ1=0 and λ2=1γβ1ραN, we obtain that

R0=1γβ1ραN

provided that this quantity is non-negative, which is the usual situation as α/N is small in great populations.

We can easily see that the zero solution of system (17) is asymptotically stable if and only if R0<1. Indeed, the characteristic equation for the eigenvalues of the matrix J is the following:

λ3+a2λ2+a1λ+a0=0,

where

a2=σ+2γ+αN,
a1=σ+γαN+γ+σγ,
a0=σγαN+γβ1ρ.

According to the Routh–Hurwitz stability criterion, all the eigenvalues have negative real part if and only if a2,a0>0 and a2a1a0>0. It is clear that

a2>0,
a2a1a0=σγγ+σ+σ+2γ+αNσ+γαN+γ+σγβ1ρ>0,
a0=σγ21R0.

Thus, the result follows.

When the number of susceptible individuals is less than N, we replace R0 by the effective reproduction number at time t, given by

Rt=1γStNβ1ραN.

Finally, we observe that R0<1 is equivalent to the inequality

β1ργ+αN<1.

Thus, the coefficient R˜0=β1ργ+αN could be also used as a reproduction number, which corresponds to the one used in Niazi et al. (2020).

4. Validation of the model by applying it to the COVID-19 spreading in Spain

In this section, we estimate the parameters of model (2) during the first wave of the COVID-19 pandemic in Spain. After estimating the parameters, we introduce the massive test detection. Solving system (4) with a constant number of test per day we show numerically that the final number of susceptible S would have been greater had a program for testing been implemented. We show also the increasing dependence of S with resect to the parameter ρ.

We will assume that the rates of death and recovery are the same among the detected and the undetected infected people. Therefore, γ¯(t)=γ1t+γ2t, and γ˜1t=γ1t,γ˜2t=γ2t.

Taking into account that during the pandemic the government implemented at certain moments of time restrictive measures of confinement leading to reduction of mobility, following Gutierrez and Varona, 2020, Lin et al., 2020 and Tang, Bragazzi et al. (2020) the rate of transmission βt will be a piecewise continuous function with a finite number of discontinuities such that in each interval of continuity the form of the function reads as:

βt=β0β11eαtt0.

We extend this approach to the functions γ1t,γ2t, so that they are also piecewise defined functions of the form:

γi(t)=γ0,iγ1,i1eαitt0.

As observed in Roda et al. (2020) the parameters ρt and βt are somehow dependent, which means that for a given sample there exist several combinations of these parameters that fit well the data. This is the problem of nonidentifiability. To avoid this drawback we use the study of seroprevalence in Spain for the first wave of the epidemic and choose an average value for the parameter ρ, so it is not estimated.

It has been estimated (see Lauer et al. (2019) and Rai et al. (2021)) that the mean value of the incubation period of the virus is about five days, so we take σ=1/5. There are other studies which give a larger period of incubation. For example, in Wu et al. (2020) the estimated value is around six.

Also, we choose an average value for ρ given by the study of seroprevalence in Spain of Instituto de Salud Carlos III (2020). According to this work, at the end of May of 2020 5.2% of the population of Spain had been infected by the virus (which gives about 2,400,000 infected people as the population is 47 millions), whereas an approximate number of 230,000 people were detected by the COVID tests at that moment. Thus, the average rate of detection during the first wave of the pandemic in Spain was approximately equal to 0.1.

We estimate the parameters of the model in the period from February 20, 2020 to May 17, 2020. Taking into account the points of confinement, we split this interval into the following four subintervals: (1) 20/02-12/03; (2) 12/03-1/04; (3) 1/04-21/04; (4) 21/04-17/05.

We need to estimate the parameters of the functions βt,γ1t and γ2t in each subinterval. For this aim we use the observed values of the variables Dt,F1t,R1t, that is, the number of currently infected, dead and recovered people which were detected. We have taken the sample given by the Spanish Health Ministry (see https://github.com/datadista/datasets/tree/master/COVID%2019, sections ccaa_covid19_confirmados_pcr, ccaa_covid19_fallecidos, ccaa_covid19_altas), using only the number of infected people detected by means of a PCR test. The value of the constant α is 0 and the variable Tt is equal to 0 as well (that is, there is no massive testing). The observed value at time ti will be denote by Di,F1i and R1i, respectively. We consider in each interval the pondered average of the euclidean norm of each observed variable:

Error=α1i=1nDiDti2+α2i=1nF1iF1ti2+α3i=1nR1iR1ti2, (18)

where α1+α2+α3=1. We have chosen α1=α2=0.35,α3=0.3.

On the 20th of February the number of detected active infected individuals was 3, so the estimate of the real number of infected subjects is 3/ρ=30. At that moment there were detected neither dead nor recovered people. We assume that there were no removed subjects at all at the initial stage of the pandemic. Hence, the initial value of the problem is given by:

I0=30F0=0,R0=0,L0=0,S0=NI0E0F0R0L0.

As we do not have a hint for the value of E0, we estimate it.

The estimate of the parameters is carried out by means of the minimization of the target function (18) after solving system (2) when the values of the parameters go through a grid of points. The results in each interval of time are the following:

  • 1.

    20/02-12/03: βt=β0=1.04, γ1t=γ0,1=0.0069,γ2t=γ0,2=0.014, E0=160.In this interval, we have looked for constant functions βt,γ1t,γ2t, so β1=γ1,1=γ1,2=0.

  • 2.

    12/03-1/04: βt=0.60.596(1e0.09t21), γ1t=0.0120.001(1e0.05t21), γ2t=0.016+0.04(1e0.025(t21)).

  • 3.

    1/04-21/04: βt=0.040.033(1e0.05(t41)), γ1t=0.00950.008(1e0.065(t41)), γ2t=0.0550.025(1e0.44(t41)).

  • 4.

    21/04-17/05: βt=0.020.0065(1e0.09(t61)), γ1(t)=0.00550.004(1e0.075(t61)), γ2t=0.025+0.01(1e0.93(t61)).

In Fig. 1, Fig. 2, Fig. 3 we can see the estimate of the detected currently infected, dead and recovered individuals over the whole period.

Fig. 1.

Fig. 1

Detected currently infected individuals .

Fig. 2.

Fig. 2

Detected dead individuals .

Fig. 3.

Fig. 3

Detected recovered individuals .

In Fig. 4 we can see the prediction given by the estimate of the first two intervals, that is, using only the observed data until the first of April. The number of currently infected people detected at the peak of the pandemic is lower than the one given in the estimate. This reflects the fact that on 28th of March the Spanish Government established more severe measures of confinement by forbidding any non-essential activity and allows us to know what would have happened without these measures. After some period this restriction was withdrawn, so the slope of the curve of observed values increased again. Despite these changes, the predicted values give a fairly good picture of the future evolution of the epidemic in Spain from the first of April until the end of May. First, the model predicts pretty good the moment of time when the peak of the pandemic curve is reached. Second, it correctly predicts a monotone decayment of the number of active infected individuals during May leading to a low incidence rate at the end of the month.

Fig. 4.

Fig. 4

Detected currently infected individuals .

In Fig. 5 we have the prediction of the currently infected individuals in the fourth interval when we use the observed data of the first three intervals, that is, until the 21th of April. In this case, the predicted values are close to the observed ones.

Fig. 5.

Fig. 5

Detected currently infected individuals .

Further we intend to estimate the impact of a massive random testing on the final number of infected people during the first wave in Spain. For this aim we solve system (7) with three different values of the parameter α; namely, when we carry out 50,000, 100,000 and 150,000 random tests per day.

The approximate limit values of susceptible individuals S in the long run are the following:

graphic file with name fx1001_lrg.jpg

Therefore, the number of infections which are saved with massive testing is 88,000, 171,000 and 250,000, respectively. In Fig. 6 one can see the evolution of the number of susceptible without massive tests and carrying out 100,000 tests per day.

Fig. 6.

Fig. 6

Susceptible individuals .

We can measure also the effect of increasing the value of the parameter ρ while the other parameters remain unchanged, and varying the value of α at the same time as well:

graphic file with name fx1002_lrg.jpg

It is important to observe that we have considered here the simple situation where a constant number of random test is carried out each day. This procedure is very far to be optimal. In the next section we will consider a heuristic method to optimize the distribution of the tests in order to make massive testing much more effective.

5. Distribution approach

In the previous section we have analysed the impact of massive testing when the number of tests which are carried out each day is constant. It is clear that such an homogeneous distribution is not optimal. Therefore, in this section we implement a heuristic method which allows us to increase the number of saved infections. Unlike the previous situation, where we needed to estimate the parameters of the model only once, now it will be necessary to make the estimation a lot of times. Due to this, the parameters have to be estimated using an automatic method. For this aim a genetic algorithm will be implemented.

5.1. Estimation of the parameters

The expert system that we have developed uses the Differential Evolution technique for estimating the parameters of model (7). The differences between this method and other evolutionary algorithms are mainly in the mutation and recombination phases in which weighted differences from the space vector, despite random quantities, are used to obtain perturbations. We have employed the Differential Evolution Algorithm described in Iorio and Li (2006). In this method, a given array of numbers (or genes) containing given particular values of the parameters to be estimated is called a chromosome. The initial population is a set of N chromosomes. It is randomly generated and Algorithm 1 describes the procedure which generates a new population from the current population P G formed by N chromosomes, where we denote by P G, G=0,1,2,, the population at the iterate G. We denote by xi,G, i=1,,N, the chromosome number i in the population P G. This sequence of chromosomes converges to the optimal values for the parameters.

graphic file with name fx1003_lrg.jpg

Algorithm 1 generates one descendant for each chromosome i belonging to the current population P G. Basically three different chromosomes with index r1r2r3i are randomly selected from P G. At Step 3, the descendant u i,G+1 is generated by u i,G+1= x i,G+K( x r3,G x i,G)+F( x r1,G x r2,G). The differential K( x r3,G x i,G) combines x r3,G and x i,G whereas the differential F( x r1,G x r2,G) sets the step size. The coefficients K,F(0,1] are constants.

The selected chromosome is replaced by its descendant if it is better in the sense that the value of a fitness function is lower. As in the previous section, we use as the fitness function the pondered average of the euclidean norm of each observed variable, given in (18). The algorithm is repeated until no replacement happens within a maximal number of successive generations.

5.2. Obtaining the planning distribution

The aim of the distribution method is to plan the distribution of tests among the locations of a region over the instants of time within a temporal horizon, that is, to decide how many tests is better to assign to each location at each instant of time. The objective of this planning is to minimize the total number of infected individuals in a given region and in a temporal interval or, what is the same, to maximize the number of saved infections. The main purpose of such distribution is to plan an effective massive testing combined with tracing.

The distribution method is based on forecasting the number of infected people which would be saved after the assignment of one testing team to one location at one instant of time within the temporal horizon. This procedure has to be repeated for each location and each moment of time. Let K be the number of people tested per instant (day, week or month) by a testing team, L={l1,,le} be the set of locations (counties, towns, etc.), and T={1,2,,M} be the set of instants of time, being t the current instant. Firstly, we calculate the gain matrix G={gl,t}, that forecasts the number of saved infections due to the assignment of K tests at location l and instant t. Secondly, the optimal test distribution is obtained from this matrix according to a heuristic method that provides the distribution matrix D={dl,t}, which indicates the number of tests to be distributed in each location l at each instant t. So, the procedures gain_matrix (Algorithm 2) and test_distribution (Algorithm 3) are used to obtain G and D, respectively.

Algorithm 2 returns the gain matrix G. Initially, the parameters (β(t), ρ(t), σ, etc.) and the initial state variables (S(t), E(t), I(t), etc.) are estimated by the Differential Evolution technique for each location l from the historical data of the coronavirus spreading until the current instant t. Then, for each location we solve system (7) twice with initial conditions at the moment t. Firstly, it is solved with no test assignment. Secondly, it is solved but assigning K tests only to the location l and the instant t+i, 1i14. The difference of infected cases at t+i+14 in both predictions provides the gain value gl,t+i, that is, the number of saved infections corresponding to testing K people in the location l and instant t. However, if the effective reproduction number at time t+i and the location l, Rl,t+i, is lower than 1, it is considered that there is no gain since the pandemic is in a decline phase, so gl,t+i=0. Besides, we observe that given that the circumstances are variable and the expert system uses the procedure in a dynamic way, running it every day, a myopic approach that focuses on obtaining gain values for the next 14 days after the current instant t is used.

graphic file with name fx1004_lrg.jpg

Once the gain matrix is obtained, Algorithm 3 is called in order to heuristically maximize the number of saved infections. So, first of all the location-instant pair (l,t) with largest gain value is chosen. Then, as many tests as possible are allocated to the population l and instant t>t. This quantity dl,t depends on several constraints such as the number of remaining tests, the maximum number of people which are able to be tested at this location per day, the number of available test teams, etc. At Step 4, the quantity of tests assigned to (l,t) is subtracted from the available tests. Then, the procedure carries on locating the second largest gain, and so on. Once all the available tests have been distributed the procedure ends up returning the distribution matrix D.

graphic file with name fx1005_lrg.jpg

We note that it is convenient to refresh the estimate of parameters and the state variables for each moment of time t, being necessary to compute again both the gain and distribution matrices every time new data about the coronavirus spreading are reported. Thus, the proposed approach is used in a dynamic way.

6. Computational experience: distributing tests among the New York counties

In this section we will analyse the computational results of the proposed distribution approach by measuring the number of saved infections and also by comparing it with a distribution of tests which is homogeneous in time and proportional to the size of each population. As an application, we will apply this method to the New York state, which pandemic data are disaggregated by counties and available in Data (2020). The period of time chosen for our study is from the first of April to the first of July of 2020. The reason for choosing it is that it was during this period when the pandemic spreading was most virulent. Although the first case was detected on the first of March, in the majority of the counties there were no registered cases until the 15th of March. The historical data in this period are used in order to simulate the pandemic spreading and also to measure the effectiveness of the possible distributions of tests. In this regard, the distribution obtained by our approach is compared with the distribution of tests which is homogeneous in time and proportional to the number of inhabitants of every county. It is important to observe that as the data about the recovered individuals are not reported, we have estimated them by supposing an average recovery time of 14 days. Then, the historical recovery data have been built by using the cumulative detected cases in the last 14 days for every moment of time.

Algorithm 4 shows the procedure that has been used in order to compute the number of saved infections after applying our distribution method. Initially, we estimate the parameters of model (7) for the whole period of study. However, as the conditions and parameters are different according to different circumstances as the lockdown phases, the use of masks, etc. the parameters are defined and estimated piecewise for each week. This is similar to what is done in Section 4 but now the intervals are fixed in periods of seven days because the purpose of this study is to develop a fully automatic decision system, avoiding thus the necessity to define the intervals with dependence on the government restrictions as lockdown or curfew. We will refer to these parameters as the simulation parameters.

Regarding the settings of the Differential Evolution technique which have been used in this application, a population of N=5 chromosomes has been fixed. Usually, the population size of genetic algorithms is higher but due to the large number of parameters to be estimated this size has been adjusted to the available RAM computer memory. The coefficients K,F(0,1] were randomly generated at each iteration. Finally, the estimation is stopped when a maximum number of 1000 iterations are carried out without any replacement. Our numerical experiments were performed on a PC with a 2.33 GHz Intel Xeon dual core processor, 8.5 GB of RAM, and with the operating system LINUX Ubuntu 18.04.5.

Secondly, for each day t, algorithms 2 and 3 are used for planning the test distribution but computing the gain matrix using only the historical data until t, given that we need to suppose the data of later instants to be unknown. We need to define a matrix D containing the distribution of tests on each location at each moment of time over the whole period. At any instant t we only apply the obtained test distribution for the next day, so the column t+1 in D is replaced by the column t+1 in the matrix D, that is dl,t+1=dl,t+1. We repeat this procedure for each moment of time. We note that when we advance from t to t+1 in order to obtain a new matrix distribution D we need to estimate again all the parameters, as the reported data is now available for one day more. After computing the matrix D we calculate the difference between the estimated infected cases at time M without testing I0(M) and the estimated infected cases after applying our test distribution ID(M), which provides the total number of saved infections.

graphic file with name fx1006_lrg.jpg

We note that in order to obtain the test distribution we employ the estimation of parameters at each iteration, whereas the simulation parameters are used for predicting the model and measuring the effectiveness of testing. Both estimations do not need to have the same values since the first ones are calculated with the information which is available until the current moment of time and the simulation parameters are estimated using the data until the last instant of time. Regarding the cases which are detected from the application of random tests, they are estimated by using expression (6), where, for each location and instant t, α(t) is the number of tests to be applied. Besides, in our computational experiments, the detected cases by testing have been multiplied by a factor with the following values:

Table 1.

Number of infected cases and saved infections with 10,000 tests per day limitation.

# Tests Factor Hom. Inf. Approach Inf. Hom. saving Approach saving Advantage
10,000 1 3,365,384 3,365,783 34 433 399
10,000 3 3,364,519 3,365,716 101 1,298 1197
10,000 9 3,361,923 3,365,514 303 3,894 3591

50,000 1 3,364,824 3,365,666 151 993 842
50,000 3 3,362,819 3,365,365 452 2,998 2546
50,000 9 3,357,442 3,364,462 1,355 8,375 7020

100,000 1 3,364,744 3,365,519 298 1,073 775
100,000 3 3,362,612 3,364,924 893 3,205 2312
100,000 9 3,357,124 3,363,144 2,673 8,693 6020

500,000 1 3,364,334 3,364,346 1,471 1,483 12
500,000 3 3,361,467 3,361,416 4,401 4,350 −51
500,000 9 3,354,169 3,352,700 13,117 11,648 −1469
  • 1, which means that the testing process is fully random and posteriorly there is no COVID contact tracing;

  • 3, if it exists a process of COVID contact tracing which produces an average of 2 additional detected cases from the initially detected case;

  • 9, if besides the COVID contact tracing, testing is focused on towns or groups of determined socio-demographic features, which allows to increase 3 more times the probability of detection.

It is important to take into account that the product of this factor and the number of daily tests αt cannot be greater that the total population of the location.

We have carried out two types of computational experiments for different quantities of available tests. In the first of them, the daily testing capacity is restricted to 10000 tests per day. In the second, the daily testing capacity is restricted to 10% of the total tests to apply in the whole period.

Table 1 shows the results obtained by our experiments. Column #Tests indicates the number of tests to be assigned. These have been: 10, 50, 100 and 500 thousands. So, 12 simulations, corresponding to the four possible numbers of tests and the three factor values indicated in Column Factor, have been carried out. Hom. Inf. and Approach Inf. columns show the estimated infected cases by using the homogeneous distribution and our approach, respectively. On the other hand, Hom. Saving and Approach Saving columns show the number of saved infections with both methods. The number of infected cases from the first of April to the first of June of 2020, including both those detected and not detected, have been estimated in 3,365,817. For this aim, model (7) has been used, estimating its parameters by Algorithm 1 with weekly intervals in which the parameters can be different. No prevalence study was taking into account, so ρ(t) was also estimated for each interval. Then Hom. Saving and Approach Saving were calculated as 3,365,817 minus Hom. Inf. and Approach Inf., respectively. Finally, Column Advantage shows the difference Approach Saving minus Hom. Saving. Therefore, the greater the difference the more advantageous the proposed approach is. On the contrary, negative values mean that the homogeneous distribution overcomes our approach. We can see that the proposed method improves the homogeneous distribution except in a few cases with 500,000 tests. We note that such quantity of tests practically forces a homogeneous distribution in time even with the proposed method. This explains that the results are similar except when tracing and socio-demographic features are taking into account, in which case the homogeneous method clearly leads to better results. It is also important to observe that in this particular case not all the available tests were distributed since 100,000 tests were not assigned.

Regarding the total number of tests which are employed, the more tests the more infections are saved, but the ratio saving/tests is decreasing as it can be seen in Table 2, which shows the number of saved infected cases per hundred applied tests. These ratios have been obtained for both distributions with factor=9. On the one side, the effectiveness of the proposed approach highly decreases with respect to test increments, whereas the effectiveness of the homogeneous distribution is practically independent of the total tests. On the other hand, the effectiveness of the proposed distribution is notoriously higher but finally, due to the loss of its effectiveness when increasing the number of tests, both are similar for 500,000 tests. At last, Table 3 specifies the corresponding distribution for 100,000 tests with factor=1.

Table 2.

Ratios saving/tests with 10,000 tests per day limitation.

# Tests % Homogeneous % Approach
10,000 3.03 38.94
50,000 2.71 16.75
100,000 2.67 8.69
500,000 2.62 2.33

Table 3.

Planning for 100 000 tests.

Day County # Tests
12/04 New York City 10,000
16/04 Rensselaer 10,000
20/04 Delaware 10,000
23/04 Ulster 10,000
24/04 Franklin 10,000
25/04 Cortland 10,000
26/04 Onondaga 10,000
27/04 Fulton 10,000
28/04 Oswego 10,000
29/04 New York City 10,000

Given that imposing a constant capacity of testing per day, which is independent of the total number of tests, is a very strict limitation, we have also carried out experiments in which the capacity per day was limited to 10% of the total number of tests. This computational experience is reported in Table 4. In this case the advantage of the proposed approach is always higher than the homogeneous approach even in a new case with 1,000,000 tests, which illustrates how high quantities of tests are also effective if they are not highly restricted by the daily capacity of testing.

Table 4.

Number of infected cases and saved infections with 10% tests per day limitation.

# Tests Factor Hom. Inf. Approach Inf. Hom. saving Approach saving Advantage
10,000 1 3,365,783 3,365,709 34 108 74
10,000 3 3,365,716 3,365,494 101 323 222
10,000 9 3,365,514 3,364,851 303 966 663

50,000 1 3,365,666 3,365,280 151 537 386
50,000 3 3,365,365 3,364,209 452 1,608 1,156
50,000 9 3,364,462 3,361,026 1,355 4,791 3,436

100,000 1 3,365,519 3,364,744 298 1,073 775
100,000 3 3,364,924 3,362,612 893 3,205 2,312
100,000 9 3,363,144 3,357,124 2,673 8,693 6,020

500,000 1 3,364,346 3,360,256 1,471 5,561 4,090
500,000 3 3,361,416 3,354,512 4,401 11,305 6,904
500,000 9 3,352,700 3,341,480 13,117 24,337 11,220

1,000,000 1 3,362,884 3,355,417 2,933 10,400 7,467
1,000,000 3 3,357,058 3,344,645 8,759 21,172 12,413
1,000,000 9 3,339,880 3,316,234 25,937 49,583 23,646

The saving/test ratios of the distribution with the daily limitation of 10% of the total number of tests are reported in Table 5 for factor=9. The cases with 10,000 and 50,000 tests show again how higher restrictions of the daily capacity reduce effectiveness given that its daily capacity is more restricted and as a consequence its ratio decreases. The cases with 500,000 and 1,000,000 tests illustrate how a higher daily capacity allows us to overcome the homogeneous distribution although the ratio decreases. On the contrary, the ratio of the homogeneous distribution is less variable.

Table 5.

Ratios saving/tests with 10% tests per day limitation.

# Tests % Homogeneous % Approach
10,000 3.03 9.66
50,000 2.71 9.59
100,000 2.67 8.69
500,000 2.62 4.87
1,000,000 2.60 4.96

Table 6 specifies the corresponding distribution planning for 1,000,000 tests and factor=1. The counties of Chenango, Chemung, Delaware, Franklin, Washington and Orleans are fully tested (the inhabitants for each county have been obtained from Department of Health of N.Y.S. (2010)). Note that for 100,000 tests the distribution planning is the same as the one reported in Table 3.

Table 6.

Planning for 1,000,000 tests.

Day County # Tests
12/04 Nassau 100,000
17/04 Oswego 100,000
19/04 Oneida 100,000
21/04 Ontario 100,000
22/04 Chemung 88,830
23/04 Nassau 100,000
24/04 Washington 63,216
25/04 Franklin 51,599
26/04 Delaware 47,980
26/04 Orleans 42,883
27/04 Chenango 50,477
28/04 Westchester 100,000
29/04 Rensselaer 55,015

7. Conclusions

In this work a SEIR model for analysing the efficiency of test distributions has been introduced. It contemplates both detected and non-detected infected individuals in order to measure the impact of testing. Since the values of the parameters of the model can change abruptly due to severe governments measures like lockdown, curfew, etc., the coefficients of the model are defined piecewise in given intervals of time and are functions of time. This model has been applied to the spreading of the COVID pandemic in Spain. Besides, we have theoretically proved how massive testing helps reducing the number of infected people in the long-term.

Secondly, we describe the Differential Evolution technique, which is a genetic algorithm for the estimation of parameters, and develop a heuristic approach for distributing tests. This approach have been applied to the spreading of COVID pandemic in the New York counties by an extensive computational experience showing the advantages of the proposed distribution method. Also, an interesting future research line is to adapt a similar distribution approach to the distribution of vaccines.

CRediT authorship contribution statement

José L. Sainz-Pardo: Conceptualization, Methodology, Software, Data curation, Writing – original draft, Writing – review & editing, Visualization, Supervision. José Valero: Conceptualization, Methodology, Software, Data curation, Writing – original draft, Writing – review & editing, Visualization, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

We would like to thank the reviewer for his/her useful remarks.

This work has been supported by the Generalitat Valenciana (Spain), project 2020/NAC/00022. The first author has also been partially supported by the Spanish Ministry of Science, Innovation and Universities, project PGC2018-099428-B-I00. The second author has also been partially supported by the Spanish Ministry of Science, Innovation and Universities, project PGC2018-096540-B-I00, the Spanish Ministry of Science and Innovation, project PID2019-108654GB-I00, and by Junta de Andalucía (Spain) and FEDER , projects P18-FR-2025 and P18-FR-4509.

References

  1. Annas S., b M.I.P., Rifandi M., Sanusi W., Side S. Stability analysis and numerical simulation of SEIR model for pandemic COVID-19 spread in Indonesia. Chaos, Solitons & Fractals. 2020;139 doi: 10.1016/j.chaos.2020.110072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Arcede J., Caga-anan R., Mentuda C., Mammeri Y. Accounting for symptomatic and asymptomatic in a SEIR-type model of COVID-19. Mathematical Modelling of Natural Phenomena. 2020;15 [Google Scholar]
  3. Berger D., Herkenhoff K., Mongey S. 2020. SEIR infectious disease model with testing and conditional quarantine: NBER Working Paper No. 26901. [Google Scholar]
  4. Brauer F., Castillo-Chávez C. Springer; New-York: 2012. Mathematical models in population biology and epidemiology. [Google Scholar]
  5. Britton T., Ball F., Trapman P. 2005. The disease-induced herd immunity level for Covid-19 is substantially lower than the classical herd immunity level. arXiv, 2005.03085. [Google Scholar]
  6. Chen Y., Lu P., Chang C., Liu T. A time-dependent SIR model for COVID-19 with undetectable infected persons. IEEE Transactions on Network Science and Engineering. 2020 doi: 10.1109/TNSE.2020.3024723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chen T., Rui J., Wang Q., Zhao Z., Cui J., Yin L. A mathematical model for simulating the phase-based transmissibility of a novel coronavirus. Infectious Diseases of Poverty. 2020;9 doi: 10.1186/s40249-020-00640-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Data, U.C.-. [Database] The New York Times. 2020 Retrieved from https://github.com/nytimes/covid-19-data/blob/master/us-counties.csv. [Google Scholar]
  9. Department of Health of N.Y.S. 2010. Population, land area, and population density by county, New York state - 2010. https://www.health.ny.gov/statistics/vital_statistics/2010/table02.htm. [Google Scholar]
  10. Eichenbaum M.S., Rebelo S., Trabandt M. NBER working paper. The Macroeconomics of Testing and Quarantining. 2020;27104 [Google Scholar]
  11. Falco I.D., Cioppa A.D., Scafuri U., Tarantino E. 2020. Coronavirus covid–19 spreading in Italy: optimizing an epidemiological model with dynamic social. arXiv, 2004.00553. [Google Scholar]
  12. Gomes M., Corder R., King J., K.E. Langwig C.S.-M., Carneiro J., Gonçalves G., Penha-Gonçalves C., Ferreira M., Aguas R. Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold. MedRxiv. 2020 doi: 10.1016/j.jtbi.2022.111063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gutierrez J., Varona J. 2020. Analisis de la posible evolución de la epidemia de coronavirus COVID-19 por medio de un modelo SEIR. https://belenus.unirioja.es/jvarona/coronavirus/SEIR-coronavirus.pdf. [Google Scholar]
  14. He S., Peng Y., Sun K. SEIR modeling of the COVID-19 and its dynamics. Nonlinear Dynamics. 2020;101:1667–1680. doi: 10.1007/s11071-020-05743-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Instituto de Salud Carlos III . 2020. Estudio ENE-COVID: Informe Final. Estudio Nacional de sero-epidemiología de la infección por SARS-COV-2 en España. https://portalcne.isciii.es/enecovid19. [Google Scholar]
  16. Iorio, A., & Li, X. (2006). Incorporating directional information within a Differential Evolution algorithm for multi-objective optimization. In Proceeding of the genetic and evolutionary computation conference 2006 (pp. 691–697).
  17. Iorio A., Li X. 2020. Coronavirus Covid–19 spreading in Italy: optimizing an epidemiological model with dynamic social distancing through differential evolution. arXiv, 2004.00553. [Google Scholar]
  18. Ji C., Jiang X. The extinction and persistence of a stochastic SIR model. Advances in Difference Equations. 2017;2017 doi: 10.1186/s13662-021-03347-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Jiao J., Liu Z., Cai S., Dynamics Dynamics of a SEIR model with infectivity in incubation period and homestead-isolation on the susceptible. Applied Mathematics Letters. 2020;107 doi: 10.1016/j.aml.2020.106442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Kotyrba M., Volna E., Bujok P. Unconventional modelling of complex system via cellular automata and differential evolution. Swarm and Evolutionary Computation. 2015;25:52–62. [Google Scholar]
  21. Kuniya T., Nakata Y. Permanence and extinction for a nonautonomous SEIRS epidemic model. Applied Mathematics and Computation. 2012;218:9321–9331. [Google Scholar]
  22. Kwuimy C.A.K., Nazari F., Jiao X., Rohani P., Nataraj C. Nonlinear dynamic analysis of an epidemiological model for COVID-19 including public behavior and government action. Nonlinear Dynamics. 2020;101:1545–1559. doi: 10.1007/s11071-020-05815-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lauer S., Grantz K., Q. Bi F.J., Zheng Q., Meredith H. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: Estimation and application. Annals of Internal Medicine. 2019;172:577–582. doi: 10.7326/M20-0504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Li R., Dong L., Zhang J., Wang X., Wang X., Di Z., Stanley H. Simple spatial scaling rules behind complex cities. Nature Communications. 2017;8:1841. doi: 10.1038/s41467-017-01882-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Li M., Muldowney J. Global stability for the SEIR model in epidemiology. Mathematical Biosciencies. 1995;125:155–164. doi: 10.1016/0025-5564(95)92756-5. [DOI] [PubMed] [Google Scholar]
  26. Li R., Pei S., Chen B., Song Y., Zhangf T., Yang W., Shaman J. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2) Science. 2020;368:489–493. doi: 10.1126/science.abb3221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Li R., Richmond P., Roehner B. Effect of population density on epidemics. Physica A. Statistical Mechanics and its Applications. 2018;510:713–724. [Google Scholar]
  28. Li R., Wang W., Di Z. Effects of human dynamics on epidemic spreading in Côte d’Ivoire. Physica A. Statistical Mechanics and its Applications. 2017;467:30–40. [Google Scholar]
  29. Lin Q., Zhao S., Gao D., Lou Y., Yang S., Musa S., Wang M., Cai Y., Wang W., Yang L., He D. A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action. International Journal of Infectious Diseases. 2020;93:211–216. doi: 10.1016/j.ijid.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Mushayabasa S., Ngarakana-Gwasira E., Mushanyu J. On the role of governmental action and individual reaction on COVID-19 dynamics in South Africa: A mathematical modelling study. Informatics in Medicine Unlocked. 2020;20 doi: 10.1016/j.imu.2020.100387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ndaïrou F., Area I., Nieto J., Torres D. 2020. Mathematical modeling of COVID-19 transmission dynamics with a case study of wuhan. arXiv, 2004.10885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Niazi M., Kibangoua A., de Wita C.C., Nikitina D., Tumasha L., Bliman P. 2020. Modeling and control of COVID-19 epidemic through testing policies. arXiv, 2010.15438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Rai B., Shukla A., Dwivedi L. Incubation period for COVID-19: a systematic review and meta-analysis. Journal of Public Health (Berlin): From Theory to Practice. 2021 doi: 10.1007/S10389-021-01478-1. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Roda W., Varughese M., D. Han M.L. Why is it difficult to accurately predict the COVID-19 epidemic? Infectious Disease Modelling. 2020;5:271–281. doi: 10.1016/j.idm.2020.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Romagnani P., Gnone G., Guzzi F., Negrini S., Guastalla A., Annunziato F., Romagnani S., Palma R.D. The COVID-19 infection: lessons from the Italian experience. Journal of Public Health Policy. 2020:1–7. doi: 10.1057/s41271-020-00229-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Sauter T., Pacheco M. Testing informed SIR based epidemiological model for COVID-19 in Luxembourg. MedRxiv. 2020;2020 [Google Scholar]
  37. Storn, R. (1996). On the usage of differential evolution for function optimization. In Proceedings of the 1996 biennial conference of the North American fuzzy information processing society (pp. 519–523).
  38. Tang B., Bragazzi N., Li Q., Tang S., Xiao Y., Wu J. Updated estimation of the risk of transmission of the novel coronavirus (2019-nCov) Infectious Disease Modelling. 2020;5:248–255. doi: 10.1016/j.idm.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Tang B., Wang X., Li Q., Bragazzi N., Tang S., Xiao Y., Wu J. Estimation of the transmission risk of the 2019-nCoV and its implication for public health interventions. Journal of Clinical Medicine. 2020;9 doi: 10.3390/jcm9020462. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Ubaru S., Horesh L., Cohen G. 2020. Dynamic graph based epidemiological model for COVID-19 contact tracing data analysis and optimal testing prescription. arXiv, 2009.04971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. van den Driessche P., Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Mathematical Biosciences. 2002;180:29–48. doi: 10.1016/s0025-5564(02)00108-6. [DOI] [PubMed] [Google Scholar]
  42. Wang Y. 2020. How testing reduces infections and prevents herd immunity: a SIR model with testing. https://ssrn.com/abstract=3577765. [Google Scholar]
  43. Wang L., Zeng Y., Chen T. Back propagation neural network with adaptive differential evolution algorithm for time series forecasting. Expert Systems with Applications. 2015;42:855–863. [Google Scholar]
  44. Wu J., Leung K., Leung G. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet. 2020;395:689–697. doi: 10.1016/S0140-6736(20)30260-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Xu C., Yu Y., Chen Y., Lu Z. Forecast analysis of the epidemics trend of COVID-19 in the USA by a generalized fractional-order SEIR model. Nonlinear Dynamics. 2020;101:1621–1634. doi: 10.1007/s11071-020-05946-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Yi W., Zhou Y., Gao L., Li X., Mou J. An improved adaptive differential evolution algorithm for continuous optimization. Expert Systems with Applications. 2016:1–12. [Google Scholar]
  47. Zhang T., Teng Z. On a nonautonomous SEIRS model in epidemiology. Bulletin of Mathematical Biology. 2007;69:2537–2559. doi: 10.1007/s11538-007-9231-z. [DOI] [PubMed] [Google Scholar]
  48. Zhao Y., Jiang D., O’Regan D. The extinction and persistence of the stochastic SIS epidemic model with vaccination. Physica A: Statistical Mechanics and its Applications. 2017;392:4916–4927. [Google Scholar]

Articles from Expert Systems with Applications are provided here courtesy of Elsevier

RESOURCES