On an interval prediction of COVID-19 development based on a SEIR epidemic model

Denis Efimov; Rosane Ushirobira

doi:10.1016/j.arcontrol.2021.01.006

. 2021 Feb 18;51:477–487. doi: 10.1016/j.arcontrol.2021.01.006

On an interval prediction of COVID-19 development based on a SEIR epidemic model

Denis Efimov ¹, Rosane Ushirobira ^1,^⁎

PMCID: PMC7891093 PMID: 33623479

Abstract

In this paper, a new version of the well-known epidemic mathematical SEIR model is used to analyze the pandemic course of COVID-19 in eight different countries. One of the proposed model’s improvements is to reflect the societal feedback on the disease and confinement features. The SEIR model parameters are allowed to be time-varying, and the ranges of their values are identified by using publicly available data for France, Italy, Spain, Germany, Brazil, Russia, New York State (US), and China. The identified model is then applied to predict the SARS-CoV-2 virus propagation under various conditions of confinement. For this purpose, an interval predictor is designed, allowing variations and uncertainties in the model parameters to be taken into account. The code and the utilized data are available on Github.

Keywords: COVID-19, Epidemic model, Parameter Identification, Interval predictor

1. Introduction

The SEIR model is one of the simplest compartmental models of epidemics (Keeling & Rohani, 2008). It is a very popular model and is extensively used in various settings (Wang et al., 2016). The SEIR model represents the development of the relative proportions of four classes of individuals in a population of constant size: the susceptible individuals $S$ , capable of contracting the disease and becoming infectious; the asymptomatic (or exposed) $E$ and symptomatic $I$ infectious, capable of giving the disease to susceptible; and the recovered $R$ , permanently immune after healing or dying (if the number of deaths is of particular interest, then an additional compartment $D$ can be included). This simple model depicts a generic behavior of epidemics (as a series of transitions between these compartments), and a related advantage consists of a small number of parameters to be identified (three transition rates $σ$ , $γ$ , and $b$ ). This latter is an essential point in a virus attack when an insufficient amount of data is available. In May 2020, when the present paper was written, that was mainly the situation worldwide under the SARS-CoV-2 virus’s presence.

There exist many sorts and varieties of SEIR models (Keeling & Rohani, 2008) (e.g., in the most simplistic case, the classes $E$ and $I$ are modeled at once, leading to a SIR model). A specificity of COVID-19 pandemics is the global confinement imposed by most countries worldwide, influencing the virus dynamics (Das, Ghosh, Sen, & Mukhopadhyay, 2020). In recent literature, numerous approaches propose how to reflect the confinement characteristics in the mathematical models (Dandekar and Barbastathis, 2020, Lopez and Rodo, 2020, Nussbaumer-Streit et al., 2020). In Efimov and Ushirobira (2020), we propose a slightly similar SEIR model to analyze the course of SARS-CoV-2 in France.

This work aims to use a novel SEIR model to predict the outbreak development with different quarantine restrictions. Our preliminary attempts to identify such model parameters confirmed that their constancy hypothesis is very restrictive, motivating us to consider time-varying parameters (not much analyzed in the literature). An interval predictor is then designed to realize an efficient and reliable prediction for a SEIR model with time-varying parameters, whose set-membership forecasting abilities perfectly suit the considered scenario. The stability of the predictor and its inclusion capabilities are analytically evaluated. The performance of the proposed approach is shown in numerical experiments for some countries.

The plan of this paper is as follows. The new modified SEIR epidemic model is presented in Section 2, together with an analysis of the model parameters and their admissible values ranges, found in the literature. In Section 3, we describe the measured data applied for the parameter identification and some hypotheses used in the sequel (we fix the values of some parameters having a “physical” meaning in order to be able to identify the remaining ones). The method for parameter identification is presented in Section 4. An interval predictor is designed in Section 5, allowing us to evaluate the present situation under the variation of parameters and initial states. The application results of the proposed identification routine and the interval predictor are given in Section 6 for France, Italy, Spain, Germany, Brazil, Russia, New York State (US), and China. The accuracy of the interval prediction is also evaluated using data for identification and another part for verification. Final discussions and remarks are provided in Section 7.

2. Epidemic model and considerations

This paper proposes a modified SEIR discrete-time model based on the one in Yang et al. (2020), where it has been used to model the course of the epidemic of COVID-19 in China (other similar SIR/SEIR-type models used recently for modeling SARS-CoV-2 virus can be found in Ferguson et al., 2020, Gevertz et al., 2020, Lourenco et al., 2020, Maier and Brockmann, 2020, Peng et al., 2020). The model we propose in this work is as follows (the impact of the natural birth and mortality is not considered, since, for the short period of analysis studied here, the population may be assumed quasi-constant):

S_{t + 1} = S_{t} - b \frac{(p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t})}{N} S_{t},

(1a)

E_{t + 1} = (1 - σ - σ^{'}) E_{t} + b \frac{(p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t})}{N} S_{t},

(1b)

I_{t + 1} = (1 - γ - μ) I_{t} + σ E_{t},

(1c)

R_{t + 1} = R_{t} + γ I_{t} + σ^{'} E_{t},

(1d)

D_{t + 1} = D_{t} + μ I_{t},

(1e)

Open in a new tab

where $t \in N$ (the set of non-negative integers) is the time counted in days ( $t = 0$ corresponds to the beginning of measurements or prediction), $N = S + E + I + R + D$ denotes the total population, the parameter $0 < γ < + \infty$ represents the recovery rate, $0 < μ < + \infty$ is the mortality rate, the parameter $0 < b < + \infty$ corresponds to the rate of the virus transmission from infectious/exposed to susceptible individuals during a contact, $0 < σ, σ^{'} < + \infty$ are the incubation rates at which the exposed develop symptoms or directly become recovered without a viral indication, $0 \leq p_{t} < + \infty$ corresponds to the number of contacts for the infectious $I$ (it is supposed that infected people with symptoms are in quarantine, then the number of contacts is decreased), $p_{t} \leq r_{t} < + \infty$ is the number of contacts per person per day for the exposed population $E$ (in the presence of confinement and depending on its severity, this number is time-varying), and $τ_{p}, τ_{r} > 0$ are the delays in the reactions of the compartments on variations of quarantine conditions (we assume that if $t < τ_{p}$ or $t < τ_{r}$ , then $p_{t - τ_{p}} = p_{0}$ or $r_{t - τ_{r}} = r_{0}$ , respectively). Compared to the model in Yang et al. (2020), the inflow/outflow variables from/to other regions for each state are not considered in our analysis.

In the model (1), for the brevity of introduction, we assume that the parameters $σ$ , $σ^{'}$ , $γ$ , $μ$ and $b$ have constant values, and we revisit this hypothesis later.

2.1. Societal feedback and confinement influence in the model

To consider society’s reaction to confinement and virus propagation, we introduce the delays $τ_{p}$ and $τ_{r}$ in the seclusion inputs $p_{t}$ and $r_{t}$ , respectively.

The idea behind $τ_{r}$ is that after the quarantine activation, several days pass before changes in the disease propagation become detectable (such an effect can be easily observed in the data for all analyzed countries). Roughly speaking, the increase in the number of infected individuals $E$ and $I$ is predefined by the number of contacts in the previous days, when the confinement was not yet imposed, for example.

We assume that during the phase of active lockdown, $r_{t - τ_{r}} = p_{t - τ_{p}}$ always holds, i.e., the number of contacts for asymptomatic $E$ and symptomatic $I$ infected populations is the same (when the society follows Governments requirements).

The delay $τ_{p}$ is used to model the clustering effect of the confinement: under restrictions on displacement activities, people are compelled to stay in their neighborhood and visit a limited number of attractions (such as shops, pharmacies, hospitals). So the population can be considered to be divided into smaller groups. After some time the chances to meet an infected person start to decay (e.g., there is no infected person in such a group, or the individual was isolated, or the whole group can be infected, but in any case, the virus propagation is almost stopped).

Remark 1

A different way of including societal feedback on the current SARS-CoV-2 virus development is the substitution:

$b ⟶ \frac{b}{1 + η I_{t}},$

where $0 < η < + \infty$ is a tuning parameter. In this case, we model the effect of natural augmentation of confinement strictness. Many factors can lead to this increase; for instance, society becomes aware of the problem following the increased number of infected or dead people (the variable $I$ implicitly represents them, or it can also be explicitly replaced with $D$ ). To this end, we decrease the virus transmission rate $b$ with the growth of the number of infected/dead individuals. This variant has been tested, but we prefer to use the delays $τ_{p}$ and $τ_{r}$ since, in this case, the parameter identification is more straightforward.

Compared to our proposed model, the main shortcoming of other models in the literature is that they do not consider the societal feedback and delays in their computation. The countries examined in the present paper have adopted different policies all through the pandemics, and to consider such factor seems indeed quite valuable.

2.2. Model parameters

Therefore, the SEIR model (1) has seven parameters to be identified or assigned: $σ$ , $σ^{'}$ , $τ_{p}$ , $τ_{r}$ , $γ$ , $μ$ and $b$ .

2.2.1. Generic observations

The parameters $σ$ , $σ^{'}$ , $γ$ , $μ$ and $b$ represent, respectively, the rate of changes between the states $E$ to $I$ , $E$ to $R$ , $I$ to $R$ , $I$ to $D$ and $S$ to $E$ (as in Fig. 1). The parameters $σ$ and $σ^{'}$ have a physical meaning: $σ = \frac{1}{T_{S}}$ and $σ^{'} = \frac{κ}{T_{S}}$ , where $T_{s}$ is the average duration of the virus incubation period after contamination, which can be well identified in patients, and $κ \in [0, 1)$ is the ratio of recovering period for the patients with the mild form of COVID-19, which can also be found in sufferers. Similarly, the delays $τ_{r}$ and $τ_{p}$ are of order $T_{s}$ , and have a natural origin. The numbers of contacts in $p_{t}$ and $r_{t}$ (with or without (relaxed) confinement) can be evaluated heuristically based on the population density and social practices (for prediction, different profiles can be selected for testing).

2.2.2. Known or accepted quantities

The incubation period $T_{s}$ that is widely papered in the literature for COVID-19 studies, is considered to be between $2$ and $14$ days (Yang et al., 2020), or in more specialized research, between $2$ and $12$ days (Lauer et al., 2020), so we assume

\frac{1}{12} \leq σ \leq \frac{1}{2} .

It also implies that the delays can be selected in the corresponding limits:

2 \leq τ_{r} \leq 12, τ_{r} + 2 \leq τ_{p},

where the condition $τ_{p} > τ_{r}$ entails that the clustering starts to be important after the effect of confinement becomes significant (adding an incubation period).

The numbers of contacts have to be selected separately for each country. For example, we may take the values of Yang et al. (2020) and make some reduction related with a smaller population density in the considered countries:

p_{Q} = 3 (number of contacts in quarantine),

p_{N} = 15 (number of contacts in normal mode),

p_{R} = 10 (number of contacts in relaxed quarantine),

p_{C} = 0.1 (number of contacts under clustering).

Then the input $p_{t} \in {p_{Q}, p_{C}}$ and $r_{t} \in {p_{Q}, p_{N}, p_{R}, p_{C}}$ for all $t \in N$ .

The identification of the model parameters may be performed using statistics published by authorities.1 As a worthy remark, many research works devoted to the estimation and identification of SIR/SEIR models were developed by now, and several in the last few years, such as Bliman et al., 2018, Cantó et al., 2017, d’Onofrio et al., 2012, Magal and Webb, 2018 and Ushirobira, Efimov, and Bliman (2019), to mention a few.

2.3. Uncertainty and prediction

Since the measured data and parameters contain numerous uncertainties and perturbations, it is challenging to carry out a reasonable prediction based on the simulation of such a model with fixed parameters (also considering the model simplicity and generality). However, the interval predictor and observer framework (Efimov and Raïssi, 2016, Gouzé et al., 2000, Mazenc and Bernard, 2011, Mazenc et al., 2014, Raïssi et al., 2012) allows a set of trajectories corresponding to the interval values of parameters and inputs to be obtained, increasing the model validity without augmenting its complexity. This approach has already been applied to different SEIR models (see, e.g., Aronna and Bliman, 2018, Degue et al., 2016, Degue and Le Ny, 2018). In this paper, we apply the interval predictor method for the considered SEIR model (1) to improve its forecasting quality by assuming that the parameters $σ$ , $σ^{'}$ , $γ$ , $μ$ and $b$ are time-varying.

Remark 2

It is essential to emphasize that the interval predictor framework used here is not the only method oriented toward improving prediction reliability when using SEIR models. Usually, as in Ferguson et al., 2020, Hu et al., 2020, Lourenco et al., 2020, Maier and Brockmann, 2020, Peng et al., 2020 and Yang et al. (2020), stochastic and agent-based simulation procedures are used. In those cases, by assuming that the parameters and initial conditions are distributed with some given probability, multiple numerical experiments are done to restore the system’s possible trajectories. Such a methodology needs more computational effort for its realization. Additional information on the probability distribution for all parameters and variables is necessary, demanding either extra hypotheses or more measured data for estimation. As the SARS-CoV-2 virus attack currently demonstrates, it is difficult to obtain such data quickly during the epidemic development. Contrarily to these approaches, the interval predictor method does not use these extra assumptions on probability distributions. It has also been proposed to estimate a guaranteed interval, including trajectories with minimal computational effort, by the cost of a more complex mathematical analysis and design (Efimov & Raïssi, 2016).

3. Used dataset and associated parameters

Let $ℐ$ , $D$ , and $ℛ$ represent the number of total detected infected, deceased and recovered individuals, respectively (these information are published by authorities). Not all cases can be detected and documented by public health services, so there is a ratio between populations $I$ and $ℐ$ , $R$ and $ℛ$ , $D$ and $D$ , which is denoted in this work by $α$ . The interval of admissible values for $α$ is estimated from different sources as follows2 :

1 \leq α \leq 25 .

Formally, such a ratio $α$ has to be time-varying and different for $I$ , $D$ and $R$ . Due to strict and similar requirements of health services in almost all considered countries, in this paper, we take the following hypotheses:

I_{t} = α_{1} (ℐ_{t} - D_{t} - ℛ_{t}), R_{t} = α_{2} ℛ_{t}, D_{t} = α_{3} D_{t},

(2)

i.e., the number of active infected cases and the related recovered individuals can be masked due to the complexity of examination and the actual confirmation of the virus presence. At the same time, the availability or not of post-mortem tests can influence the number of registered deaths. A further reason is that in many cases, the virus symptoms result in a mild reaction of patients (approximately 80% of cases, see the sources above), hence maybe with no official virus confirmation in such a situation. In this work, we assume the following values for these parameters:

α_{2} = α_{1}, α_{3} = 1,

then, roughly speaking, such a choice corresponds to the registration of deaths exactly (see also Lourenco et al., 2020) with the same error for recovered and infected individuals (the exclusion was made only for the US). CMMID describes a technique to identify $α_{1}$ from the measurements of $ℐ$ , $ℛ$ and $D$ (see the footnote) giving for France (in July 30th):

α_{1} = 1.78 .

So, by fixing $α_{1}$ , $α_{2}$ , and $α_{3}$ ,3 the three variables of the model (1), $I$ , $D$ , and $R$ , are available from the beginning of the epidemics via (2).

Remark 3

The measured information used in the paper are $I$ , $R$ , and $D$ from (2), where the measurement noise can be modeled by time-varying gains $α_{i}$ , $i = 1$ , $2$ , $3$ , representing the different actual values of populations in these compartments. Such noise characteristics are in general unknown (country dependent), and it is difficult to estimate them during the outbreak. However, if we assume that the noise is bounded, then instead of the exact values of $I$ , $R$ , and $D$ , their intervals have to be considered, $[\underline{I}, \bar{I}]$ , $[\underline{R}, \bar{R}]$ , and $[\underline{D}, \bar{D}]$ , corresponding to possible true values of these variables. Using such intervals would lead to interval estimates for parameters (with the methods applying below). To simplify the presentation and the computations, it is assumed in this work that the measured quantities in (2) are noise-free, resulting in the identification of guess values for the parameters. Finally, for prediction, the intervals around the guesses are calculated for all initial conditions, parameters and inputs, which takes into account the presence of the noise in (2) and other uncertainties or complexity effects.

3.1. Fixed values of parameters

Note that model (1) is not identifiable with respect to all seven parameters simultaneously for the given set of measured outputs ( $I$ , $R$ , and $D$ ) and inputs ( $p$ and $r$ ). Hence, it is necessary to fix the values of some of them, those with a physical meaning, for instance, and reconstruct the sets of admissible values for others. To this end, we select an average value for the incubation rate:

σ = \frac{1}{7}

to simplify further identification (the variation in this value can be taken into account later in the interval predictor), then

σ^{'} = κ σ, κ = 0.1,

and we assume that there is a very slow transfer from exposed $E$ to recovered $R$ directly without symptom exposition. The delays’ nominal values are chosen as

τ_{r} = 5, τ_{p} = τ_{r} + 20,

and the algorithm for their identification is discussed below. The procedure for identifying $γ$ , $μ$ , and $b$ is also given in the next section.

3.2. Scenario of confinement

In Ferguson et al. (2020), the theory of a cyclic application of quarantine regimes of different severity is evaluated for COVID-19. By iterating the periods of complete isolation for everybody (suppression), which decelerates the virus advancement, with a time of mild regulation (mitigation), which allows the economy balance to be maintained on an arguable level, and when only fragile parts of the population are isolated, it is possible to attenuate the material consequences of epidemics while decreasing the load on health services. Following this idea, for simulation, we consider a cyclic scenario of confinement (e.g., with $8$ weeks of strict quarantine and $4$ weeks of a relaxed one), which is further periodically repeated. For the chosen model, this scenario impact only the input variables $p_{t - τ_{p}}$ and $r_{t - τ_{r}}$ , an example of their behavior is shown in Fig. 2 (by red dash and blue solid lines, respectively).

Fig. 2 — Variation of the number of contacts $p_{t - τ_{p}}$ and $r_{t - τ_{r}}$ .

Remark 4

In other words, $r_{t}$ and $p_{t}$ can be considered as a sort of control for the virus propagation, by imposing different periods and strictness levels for the confinement for compartments $I$ and $E$ . A more detailed analysis may also take into account age or geographic distribution.

4. Parameter identification

In this section, we assume that the parameters have constant values, which allows us to apply efficient methodologies for their identification. Next, we use these values as the nominal or average quantities passing to time-varying parameters.

For the parameter identification, we assume that the incubation rates $σ$ and $σ^{'}$ are fixed as above and that the symptomatic infectious $I_{t}$ , the dead $D_{t}$ , and the recovered $R_{t}$ persons are measured for the first $J > 0$ days of the virus attack as in (2) for $t = 0, 1, \dots, J$ .

We begin by discussing approaches to the identification of the delays $τ_{p}$ and $τ_{r}$ . Then, the method for identifying the mortality rate $μ$ , the recovery rate $γ$ , and the infection rate $b$ is presented. Finally, the model (1) with the parameters’ obtained values is validated by simulations in Section 6.

4.1. Delay identification

We propose two approaches for the estimation of $τ_{p}$ and $τ_{r}$ .

4.1.1. Method 1

From the dynamics of (1b), the increment of $E_{t}$ (i.e., $E_{t + 1} - E_{t}$ ) is directly proportional to $p_{t - τ_{p}}$ and $r_{t - τ_{r}}$ . The number of contacts $r_{t - τ_{r}}$ instantaneously changes its value after the imposition of the quarantine (it jumps from $p_{N}$ to $p_{Q}$ ). Since $τ_{p} > τ_{r}$ and $p_{t - τ_{p}} = r_{t - τ_{r}}$ in confinement, the signals $p_{t - τ_{p}}$ and $r_{t - τ_{r}}$ jump next from $p_{Q}$ to $p_{C}$ , and the same occurs after the suppression of the confinement (from $p_{C}$ to $p_{Q}$ or $p_{R}$ ), see Fig. 2. It implies that the increment of $E_{t}$ shows discontinuities in these time instants. The variable $E_{t}$ is not available for measurements, but the same (filtered) behavior is also observed in the increment of the variable $I_{t}$ . Since both variables, $I_{t}$ and $E_{t}$ in (1) have an exponential rate of changes, then the signal

d I_{t} = ln (I_{t}) - ln (I_{t - 1})

for $t = 2, \dots, J$ should have a step-like form (the logarithm of the increment of an exponentially growing or decaying signal is a constant) with the change of value in the time instant $t_{c} \geq 2$ , when a modification of the confinement rules starts to influence the variable $I_{t}$ . Therefore, the delay can be estimated as (with a mild ambiguity in this work, we use the same symbol to denote a parameter and its estimate)

τ_{r} = t_{c} - t^{'},

where $t^{'} \geq 0$ is the instant of application of the new confinement rule. Hence, to estimate the value $t_{c}$ , the following algorithm is proposed:

t_{c} = \underset{t = 3, \dots, J}{arg min} \sqrt{\sum_{ℓ = 2}^{J} {| d I_{ℓ} - d I_{ℓ}^{t} |}^{2}}, where d I_{ℓ}^{t} = \{\begin{matrix} \frac{1}{t - 2} \sum_{s = 2}^{t - 1} d I_{s} & , if ℓ < t \\ \frac{1}{J - t + 1} \sum_{s = t}^{J} d I_{s} & , if ℓ \geq t \end{matrix})

is a step-like varying signal, which jumps at the instant $t$ . This approach’s main drawback is the noise in the measurements (as for any approach that indirectly uses a derivative estimation).

Remark 5

Note that if the values of $γ$ and $μ$ are known (see below how we can estimate them), then using (1c) the variable $E_{t} = \frac{1}{σ} (I_{t + 1} - (1 - γ - μ) I_{t})$ can be reconstructed from the measurements, and the same approach can be applied to the increment $d E_{t} = ln (E_{t}) - ln (E_{t - 1})$ , which explicitly depends on $p_{t - τ_{p}}$ and $r_{t - τ_{r}}$ . Unfortunately, we have very noisy data for COVID-19, so the calculated variables $E_{t}$ contain many perturbations, and the above (derivative-based) approach does not provide a reliable estimation using $d E_{t}$ .

4.1.2. Method 2

This method also uses the estimated values of $E_{t}$ (see (3) for the detailed description), but it does not use (approximated) derivatives. The idea of this approach is based on the observation that a straight line can approximate $ln (E_{t})$ (the variable $E_{t}$ is exponentially growing) for any constant values of $p_{t - τ_{p}}$ and $r_{t - τ_{r}}$ :

ln (E_{t}) = a t + b,

for some $a, b \in R$ . Such an approximation filters the noise contrarily to the derivative-based method presented in the previous subsection. Then the initial phase of the epidemics can be decomposed on three intervals of time:

T_{1} = [0, T_{1} + τ_{r}), T_{2} = [T_{1} + τ_{r}, T_{1} + τ_{p}), T_{3} = [T_{1} + τ_{p}, T_{2}],

where $T_{1}$ is the day of confinement activation, $T_{2}$ is the day of commutation to the relaxed quarantine, and on each interval

ln (E_{t}) = a_{i} t + b_{i},

for $t \in T_{i}$ and some coefficients $a_{i}, b_{i} \in R$ with $i = 1, 2, 3$ , is a reliable approximation. The coefficients $a_{i}, b_{i}$ can be calculated using the Least Square Method (LSM), or any other approach of solving this system of linear equations with known reconstructed values of $E_{t}$ . Next, we can calculate the instants of these lines intersection:

τ_{r} = \frac{a_{2} b_{1} - a_{1} b_{2}}{a_{2} - a_{1}} - T_{1}, τ_{p} = \frac{a_{3} b_{2} - a_{2} b_{3}}{a_{3} - a_{2}} - T_{1} .

Note that the intervals $T_{i}$ , $i = 1, 2, 3$ are unknown (their definitions depend on the values of $τ_{p}$ and $τ_{r}$ ), then we can introduce two tuning parameters $Z \in (0, τ_{r})$ and $J_{Z} \in (0, J)$ such that

{\hat{T}}_{1} = [0, Z), {\hat{T}}_{2} = [J_{Z} - Z, J_{Z}), {\hat{T}}_{3} = [J - Z, J]

are the estimates for $T_{1}$ , $T_{2}$ and $T_{3}$ , respectively, which are utilized for calculation of $a_{i}, b_{i}$ . These auxiliary parameters can be rather easily selected having the plot of $ln (E_{t})$ in sight.

This method provides rather good guesses for $τ_{p}$ and $τ_{r}$ , as we demonstrate at the end of this section. In general, these estimates are very sensitive to the noise.

4.2. Rates identification

From Eq. (1e), we can identify the value of the mortality rate $μ$ :

μ = \frac{D_{t + 1} - D_{t}}{I_{t}},

whose LSM estimation is

μ_{k} = \frac{\sum_{t = 0}^{J - k - 1} I_{t} (D_{t + 1} - D_{t})}{\sum_{t = 0}^{J - k - 1} I_{t}^{2}}

for $k = 0, 1, \dots, K$ , where $0 < K < J - 1$ is the number of the last days used for identification (in this work we selected $K = J - 10$ ). Another possible approach is the moving window estimation:

μ_{k} = \frac{\sum_{t = k}^{k + K_{w}} I_{t} (D_{t + 1} - D_{t})}{\sum_{t = k}^{k + K_{w}} I_{t}^{2}}

for $k = 0, 1, \dots, K$ with $K = J - K_{w}$ , where $K_{w} > 1$ is the window length. Then the average value is used for further analysis and design:

μ = \frac{1}{K + 1} \sum_{k = 0}^{K} μ_{k} .

Since $σ^{'} = κ σ$ , multiplying Eq. (1c) by $κ$ and subtracting it from (1d), we can identify the value of the parameter $γ$ :

γ = \frac{R_{t + 1} - R_{t} - κ I_{t + 1} + κ (1 - μ) I_{t}}{(1 + κ) I_{t}},

whose LSM estimation is

γ_{k} = \frac{\sum_{t = 0}^{J - k - 1} I_{t} (R_{t + 1} - R_{t} - κ I_{t + 1} + κ (1 - μ) I_{t})}{(1 + κ) \sum_{t = 0}^{J - k - 1} I_{t}^{2}}

for $k = 0, 1, \dots, K$ , or the moving window estimation:

γ_{k} = \frac{\sum_{t = k}^{k + K_{w}} I_{t} (R_{t + 1} - R_{t} - κ I_{t + 1} + κ (1 - μ) I_{t})}{(1 + κ) \sum_{t = k}^{k + K_{w}} I_{t}^{2}}

for $k = 0, 1, \dots, K$ with $K = J - K_{w}$ . As for $μ$ the average value is used for further analysis and design:

γ = \frac{1}{K + 1} \sum_{k = 0}^{K} γ_{k} .

Next, the sum of Eqs. (1c), (1c) allows us to calculate the related number of asymptomatic infectious ( $σ$ and $σ^{'}$ are chosen, $μ$ is estimated):

E_{t} = \frac{1}{σ + σ^{'}} (I_{t + 1} - (1 - μ) I_{t} + R_{t + 1} - R_{t}),

(3)

while the number of susceptible individuals can be evaluated using the total population:

S_{t} = N - I_{t} - R_{t} - E_{t} - D_{t} .

(4)

If we take into account (3), (4), the state of (1) can be considered as available for direct measurements, shifting the focus to the problems of parameter identification and prediction explored in this work. At this point, having derived quantities $E_{t}$ , we can estimate the delays $τ_{r}$ and $τ_{p}$ using one of the methods presented above. From Eq. (1b), we can derive the infection rate (for the selected values $p$ , $r$ , $σ$ and $σ^{'}$ ):

b = N \frac{E_{t + 1} - (1 - σ - σ^{'}) E_{t}}{(p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t}) S_{t}},

whose LSM estimation is

b_{k} = N \frac{\sum_{t = 0}^{J - k - 1} (p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t}) (E_{t + 1} - (1 - σ - σ^{'}) E_{t}) S_{t}}{\sum_{t = 0}^{J - k - 1} {(p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t})}^{2} S_{t}^{2}}

for $k = 0, 1, \dots, K$ , or the moving window estimation version:

b_{k} = N \frac{\sum_{t = k}^{k + K_{w} - 1} (p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t}) (E_{t + 1} - (1 - σ - σ^{'}) E_{t}) S_{t}}{\sum_{t = k}^{k + K_{w} - 1} {(p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t})}^{2} S_{t}^{2}}

for $k = 0, 1, \dots, K$ with $K = J - K_{w}$ , then the identified value is again the average of these estimates:

b = \frac{1}{K + 1} \sum_{k = 0}^{K} b_{k} .

Remark 6

Due to measurement noise, the derived values of $E_{t}$ , $γ_{k}$ , and $b_{k}$ can be negative (that is physically impossible), then a previous positive estimate can be taken into account, i.e., $E_{t} = E_{t - 1}$ , or only positive quantities for the average calculation can be used: $b = \frac{1}{K + 1} \sum_{k = 0}^{K} ϱ_{k} b_{k}$ with $ϱ_{k} = 0.5 (sign (b_{k}) + 1)$ (it is $0$ for negative $b_{k}$ and $1$ otherwise).

The results of identification for all considered countries, and simulation and validation can be found in Section 6. Next, let us enlarge the prediction’s validity based on (1) by considering intervals of admissible values for the parameters and initial conditions.

5. Interval prediction

In the previous section, the values of parameters $b, γ$ , $μ$ , $τ_{p}$ , $τ_{r}$ for the model (1) were identified for selected guesses of $α_{1}, α_{2}, α_{3}, σ, σ^{'}$ . The model’s initial conditions, $S_{0}$ , $I_{0}$ , $E_{0}$ , $D_{0}$ , and $R_{0}$ , were chosen from measured/reconstructed sets. However, as we can conclude from the results of the identification (see Section 6), the variation of the estimated values of $b, γ$ , $μ$ , $τ_{p}$ , $τ_{r}$ is rather significant. It is related to the model’s generic structure, uncertainties in the auxiliary parameters’ values, and noises in the measured information, but not only. A possible interpretation of these results is that the parameters have to be considered time-varying in the model (1). Indeed, if we focus on the mortality rate $μ$ : obviously, it does not stay constant during the whole period of epidemics, and at the outbreak peak, its value is usually higher since it is related to an increased load on the health system. Unfortunately, practical identification and utilization of time-varying parameters are rather tricky (additionally, it is difficult to forecast their future values). However, for an interval prediction, we need just the set of admissible values of the parameters (Efimov and Raïssi, 2016, Leurent et al., 2019). The interval predictors can generate the envelope of trajectories, including any possible run with parameters and/or initial conditions taking values in the selected intervals. Such an approach dramatically improves the validity of the prediction. In such a case, we calculate/evaluate the sets of the resulted trajectories.

Further in this section, we continue referencing the model (1) assuming the parameters $σ, σ^{'}, b, γ, μ$ to be time-varying (with a small ambiguity, the notation is kept the same). The obtained nominal identified values of $σ, σ^{'}, b, γ, μ$ are interpreted as the middles of the intervals of admissible values for these parameters. We pursue to design an interval predictor that evaluates all possible trajectories for (1) with such time-varying parameters under interval inputs $r_{t}$ and $p_{t}$ (the previously selected values are also chosen as the middles of the admissible sets) and interval initial conditions for the states (that represents the measurement noise or time variation of $α_{i}$ , $i = 1$ , $2$ , $3$ , see Remark 3).

5.1. Explanation of idea

In the sequel, for two vectors $x_{1}, x_{2} \in R^{n}$ or matrices $A_{1}, A_{2} \in R^{n \times n}$ , the relations $x_{1} \leq x_{2}$ and $A_{1} \leq A_{2}$ are understood element-wise. Given a matrix $A \in R^{m \times n}$ , define $A^{+} = max {0, A}$ also element-wise and $A^{-} = A^{+} - A$ (similarly for vectors).

Lemma 1 Efimov & Raïssi, 2016 —

Let $x \in R^{n}$ be a vector variable, satisfying $\underline{x} \leq x \leq \bar{x}$ for some $\underline{x}, \bar{x} \in R^{n}$ .

(1) If $A \in R^{m \times n}$ is a constant matrix, then

$A^{+} \underline{x} - A^{-} \bar{x} \leq A x \leq A^{+} \bar{x} - A^{-} \underline{x} .$ (5)

(2) If $A \in R^{m \times n}$ is a matrix variable and $\underline{A} \leq A \leq \bar{A}$ for some $\underline{A}, \bar{A} \in R^{m \times n}$ , then

${\underline{A}}^{+} {\underline{x}}^{+} - {\bar{A}}^{+} {\underline{x}}^{-} - {\underline{A}}^{-} {\bar{x}}^{+} + {\bar{A}}^{-} {\bar{x}}^{-} \leq A x$ (6)

$\leq {\bar{A}}^{+} {\bar{x}}^{+} - {\underline{A}}^{+} {\bar{x}}^{-} - {\bar{A}}^{-} {\underline{x}}^{+} + {\underline{A}}^{-} {\underline{x}}^{-} .$

The idea of the interval prediction for a discrete-time system with time-varying parameters can be illustrated on a simple scalar case (all equations of (1) can be rewritten is this form):

x_{t + 1} = a_{t} x_{t} + d_{t},

where $x_{t} \in R_{+}$ is a non-negative system state, whose initial conditions belong to a given interval:

x_{0} \in [{\underline{x}}_{0}, {\bar{x}}_{0}],

$a_{t} \in R_{+}$ and $d_{t} \in R$ are uncertain parameters and input, which also take values in known intervals:

a_{t} \in [{\underline{a}}_{t}, {\bar{a}}_{t}], d_{t} \in [{\underline{d}}_{t}, {\bar{d}}_{t}]

for all $t \in N$ . We assume that $0 \leq {\underline{x}}_{0} \leq {\bar{x}}_{0}$ , $0 \leq {\underline{a}}_{t} \leq {\bar{a}}_{t}$ and ${\underline{d}}_{t} \leq {\bar{d}}_{t}$ are known for all $t \in N$ . The imposed non-negativity constraints on $x_{t}$ and $a_{t}$ correspond to the case of the model (1). We want to calculate the lower ${\underline{x}}_{t}$ and upper ${\bar{x}}_{t}$ predictions of the state $x_{t}$ of this system under the introduced hypotheses on all uncertain variables, requiring the relations:

0 \leq {\underline{x}}_{t} \leq x_{t} \leq {\bar{x}}_{t} \forall t \in N .

Applying Lemma 1 to the term $a_{t} x_{t}$ under introduced sign restrictions, we obtain

{\underline{a}}_{t} {\underline{x}}_{t} \leq a_{t} x_{t} \leq {\bar{a}}_{t} \bar{x},

then a possible structure of interval predictor is as follows:

{\underline{x}}_{t + 1} = {\underline{a}}_{t} {\underline{x}}_{t} + {\underline{d}}_{t} and {\bar{x}}_{t + 1} = {\bar{a}}_{t} {\bar{x}}_{t} + {\bar{d}}_{t} .

To substantiate the desired interval inclusion for $x_{t}$ by ${\underline{x}}_{t}, {\bar{x}}_{t}$ , we can consider the lower ${\underline{e}}_{t} = x_{t} - {\underline{x}}_{t}$ and the upper ${\bar{e}}_{t} = {\bar{x}}_{t} - x_{t}$ prediction errors, whose dynamics take the form:

{\underline{e}}_{t + 1} = (a_{t} x_{t} - {\underline{a}}_{t} {\underline{x}}_{t}) + (d_{t} - {\underline{d}}_{t}) and {\bar{e}}_{t + 1} = ({\bar{a}}_{t} {\bar{x}}_{t} - a_{t} x_{t}) + ({\bar{d}}_{t} - d_{t}) .

Then it is easy to verify that the terms $d_{t} - {\underline{d}}_{t}$ and ${\bar{d}}_{t} - d_{t}$ are non-negative by the definition of ${\underline{d}}_{t}, {\bar{d}}_{t}$ , and the terms $a_{t} x_{t} - {\underline{a}}_{t} {\underline{x}}_{t}$ and ${\bar{a}}_{t} {\bar{x}}_{t} - a_{t} x_{t}$ have the same property for $t = 0$ by the definition of ${\underline{a}}_{t}, {\bar{a}}_{t}$ and ${\underline{x}}_{0}, {\bar{x}}_{0}$ . Therefore, ${\underline{e}}_{1} \geq 0$ , ${\bar{e}}_{1} \geq 0$ (that implies $x_{1} \in [{\underline{x}}_{1}, {\bar{x}}_{1}]$ ) and the analysis can be iteratively repeated for all $t \in N$ . Obviously, the estimates ${\underline{x}}_{t}, {\bar{x}}_{t}$ are bounded provided that

{\bar{a}}_{t} \leq 1 - ϵ

for some $ϵ \in (0, 1)$ , and the Lyapunov function $V_{t} = {\underline{x}}_{t} + {\bar{x}}_{t}$ can be used to support this claim.

Let us apply this method to the model (1), where each equation there has the form as above.

5.2. Equations of interval predictor and its properties

To this end, we assume that all parameters belong to the known intervals (for simplicity we do not deviate the values of $τ_{p}$ , $τ_{r}$ and $κ$ ):

σ \in [\underline{σ}, \bar{σ}], γ \in [\underline{γ}, \bar{γ}], b \in [\underline{b}, \bar{b}], p_{t} \in [{\underline{p}}_{t}, {\bar{p}}_{t}], r_{t} \in [{\underline{r}}_{t}, {\bar{r}}_{t}], \forall t \in N,

(7)

together with the initial conditions in (1):

S_{0} \in [{\underline{S}}_{0}, {\bar{S}}_{0}], I_{0} \in [{\underline{I}}_{0}, {\bar{I}}_{0}], E_{0} \in [{\underline{E}}_{0}, {\bar{E}}_{0}], D_{0} \in [{\underline{D}}_{0}, {\bar{D}}_{0}], R_{0} \in [{\underline{R}}_{0}, {\bar{R}}_{0}],

(8)

where non-negative values $\underline{σ}, \bar{σ}$ , $\underline{γ}, \bar{γ}$ , $\underline{b}, \bar{b}$ , ${\underline{p}}_{t}, {\bar{p}}_{t}$ , ${\underline{r}}_{t}, {\bar{r}}_{t}$ , ${\underline{S}}_{0}, {\bar{S}}_{0}$ , ${\underline{I}}_{0}, {\bar{I}}_{0}$ , ${\underline{E}}_{0}, {\bar{E}}_{0}$ , ${\underline{D}}_{0}, {\bar{D}}_{0}$ and ${\underline{R}}_{0}, {\bar{R}}_{0}$ are obtained from the ones used in the previous section by applying $\pm δ %$ deviation from those nominal quantities (we can also use the variation of the identified values). Then, applying the approach explained just above, we derive the equations of the interval predictor:

{\underline{S}}_{t + 1} = (1 - \bar{b} \frac{({\bar{p}}_{t - τ_{p}} {\bar{I}}_{t} + {\bar{r}}_{t - τ_{r}} {\bar{E}}_{t})}{N}) {\underline{S}}_{t},

(9)

{\underline{E}}_{t + 1} = (1 - (1 + κ) \bar{σ} + \underline{b} \frac{{\underline{r}}_{t - τ_{r}}}{N} {\underline{S}}_{t}) {\underline{E}}_{t} + {\underline{p}}_{t - τ_{p}} \underline{b} \frac{{\underline{I}}_{t} {\underline{S}}_{t}}{N},

{\underline{I}}_{t + 1} = (1 - \bar{γ} - \bar{μ}) {\underline{I}}_{t} + \underline{σ} {\underline{E}}_{t},

{\underline{R}}_{t + 1} = {\underline{R}}_{t} + \underline{γ} {\underline{I}}_{t} + κ \underline{σ} {\underline{E}}_{t},

{\underline{D}}_{t + 1} = {\underline{D}}_{t} + \underline{μ} {\underline{I}}_{t},

{\bar{S}}_{t + 1} = (1 - \underline{b} \frac{({\underline{p}}_{t - τ_{p}} {\underline{I}}_{t} + {\underline{r}}_{t - τ_{r}} {\underline{E}}_{t})}{N}) {\bar{S}}_{t},

{\bar{E}}_{t + 1} = min \{N, (1 - (1 + κ) \underline{σ} + \bar{b} \frac{{\bar{r}}_{t - τ_{r}}}{N} {\bar{S}}_{t}) {\bar{E}}_{t} + {\bar{p}}_{t - τ_{p}} \bar{b} \frac{{\bar{I}}_{t} {\bar{S}}_{t}}{N}\},

{\bar{I}}_{t + 1} = min \{N, (1 - \underline{γ} - \underline{μ}) {\bar{I}}_{t} + \bar{σ} {\bar{E}}_{t}\},

{\bar{R}}_{t + 1} = min \{N, {\bar{R}}_{t} + \bar{γ} {\bar{I}}_{t} + κ \bar{σ} {\bar{E}}_{t}\},

{\bar{D}}_{t + 1} = min \{N, {\bar{D}}_{t} + \bar{μ} {\bar{I}}_{t}\},

where ${\underline{S}}_{t}, {\bar{S}}_{t}$ , ${\underline{I}}_{t}, {\bar{I}}_{t}$ , ${\underline{E}}_{t}, {\bar{E}}_{t}$ , ${\underline{D}}_{t}, {\bar{D}}_{t}$ and ${\underline{R}}_{t}, {\bar{R}}_{t}$ are the lower and upper interval predictions for $S_{t}$ , $I_{t}$ , $E_{t}$ , $D_{t}$ and $R_{t}$ , respectively.

Theorem 1

For the model (1) satisfying the relations (7), (8) with

$2 \bar{b} sup_{t \in N} {\bar{r}}_{t} \leq 1, \bar{σ} \leq \frac{1}{1 + κ}, \bar{γ} + \bar{μ} \leq 1,$ (10)

the interval predictor (9) guarantees the interval inclusions for the state of (1) for all $t \in N$ :

$S_{t} \in [{\underline{S}}_{t}, {\bar{S}}_{t}], I_{t} \in [{\underline{I}}_{t}, {\bar{I}}_{t}], E_{t} \in [{\underline{E}}_{t}, {\bar{E}}_{t}], D_{t} \in [{\underline{D}}_{t}, {\bar{D}}_{t}], R_{t} \in [{\underline{R}}_{t}, {\bar{R}}_{t}]$

with boundedness of all predictions for all $t \in N$ :

${\underline{S}}_{t}, {\bar{S}}_{t}, {\underline{I}}_{t}, {\bar{I}}_{t}, {\underline{E}}_{t}, {\bar{E}}_{t}, {\underline{D}}_{t}, {\bar{D}}_{t}, {\underline{R}}_{t}, {\bar{R}}_{t} \in [0, N] .$

Proof

By direct calculation and applying Lemma 1, we can check that

$\underline{b} \frac{({\underline{p}}_{t - τ_{p}} {\underline{I}}_{t} + {\underline{r}}_{t - τ_{r}} {\underline{E}}_{t})}{N} \leq b \frac{(p_{t - τ_{p}} I_{t} + r_{t - τ_{r}} E_{t})}{N} \leq \bar{b} \frac{({\bar{p}}_{t - τ_{p}} {\bar{I}}_{t} + {\bar{r}}_{t - τ_{r}} {\bar{E}}_{t})}{N},$

$\underline{b} \frac{{\underline{r}}_{t - τ}}{N} {\underline{S}}_{t} - (1 + κ) \bar{σ} \leq b \frac{r_{t - τ}}{N} S_{t} - (1 + κ) σ \leq \bar{b} \frac{{\bar{r}}_{t - τ}}{N} {\bar{S}}_{t} - (1 + κ) \underline{σ},$

${\underline{p}}_{t - τ_{p}} \underline{b} \frac{{\underline{I}}_{t} {\underline{S}}_{t}}{N} \leq p_{t - τ_{p}} b \frac{I_{t} S_{t}}{N} \leq {\bar{p}}_{t - τ_{p}} \bar{b} \frac{{\bar{I}}_{t} {\bar{S}}_{t}}{N},$

$\underline{σ} {\underline{E}}_{t} \leq σ E_{t} \leq \bar{σ} {\bar{E}}_{t},$

$\underline{γ} {\underline{I}}_{t} \leq γ I_{t} \leq \bar{γ} {\bar{I}}_{t},$

$\underline{μ} {\underline{I}}_{t} \leq μ I_{t} \leq \bar{μ} {\bar{I}}_{t}$

due to (7), (8) for $t = 0$ . Since (recall that $r_{t} \geq p_{t}$ , ${\bar{I}}_{t} + {\bar{E}}_{t} \leq 2 N$ , thus ${\underline{S}}_{t} \geq 0$ )

$\bar{b} \frac{({\bar{p}}_{t - τ_{p}} {\bar{I}}_{t} + {\bar{r}}_{t - τ_{r}} {\bar{E}}_{t})}{N} \leq \bar{b} {\bar{r}}_{t - τ_{r}} \frac{{\bar{I}}_{t} + {\bar{E}}_{t}}{N} \leq 2 \bar{b} {\bar{r}}_{t - τ_{r}} \leq 2 \bar{b} sup_{t \in N} {\bar{r}}_{t},$

$1 - (1 + κ) \bar{σ} + \underline{b} \frac{{\underline{r}}_{t - τ_{r}}}{N} {\underline{S}}_{t} \geq 1 - (1 + κ) \bar{σ},$

we obtain that

$1 \geq \bar{b} \frac{({\bar{p}}_{t - τ_{p}} {\bar{I}}_{t} + {\bar{r}}_{t - τ_{r}} {\bar{E}}_{t})}{N}, 1 + \underline{b} \frac{{\underline{r}}_{t - τ_{r}}}{N} {\underline{S}}_{t} \geq (1 + κ) \bar{σ}$

due to (10), then as we demonstrated above

$S_{1} \in [{\underline{S}}_{1}, {\bar{S}}_{1}], I_{1} \in [{\underline{I}}_{1}, {\bar{I}}_{1}], E_{1} \in [{\underline{E}}_{1}, {\bar{E}}_{1}],$

$D_{1} \in [{\underline{D}}_{1}, {\bar{D}}_{1}], R_{1} \in [{\underline{R}}_{1}, {\bar{R}}_{1}],$

and such a verification can be repeated for all $t \in N$ . In the same way we can show that if the relations

$0 \leq {\underline{S}}_{t} \leq {\bar{S}}_{t}, 0 \leq {\underline{I}}_{t} \leq {\bar{I}}_{t}, 0 \leq {\underline{E}}_{t} \leq {\bar{E}}_{t}, 0 \leq {\underline{D}}_{t} \leq {\bar{D}}_{t}, 0 \leq {\underline{R}}_{t} \leq {\bar{R}}_{t}$

are satisfied for some $t \in N$ , then they also hold for $t + 1$ in (9).

To substantiate boundedness of the state of the interval predictor, we can first consider a Lyapunov function candidate for the lower bounds:

${\underline{V}}_{t} = {\underline{S}}_{t} + {\underline{I}}_{t} + {\underline{E}}_{t} + {\underline{D}}_{t} + {\underline{R}}_{t},$

which is well-defined since, as we have shown above, all variables are nonnegative for $t \in N$ . Next, the increment of this Lyapunov function admits a non-positive upper estimate:

${\underline{V}}_{t + 1} - {\underline{V}}_{t} = - \frac{(\bar{b} {\bar{p}}_{t - τ_{p}} {\bar{I}}_{t} - \underline{b} {\underline{p}}_{t - τ_{p}} {\underline{I}}_{t} + \bar{b} {\bar{r}}_{t - τ_{r}} {\bar{E}}_{t} - \underline{b} {\underline{r}}_{t - τ_{r}} {\underline{E}}_{t})}{N} {\underline{S}}_{t} - (\bar{γ} - \underline{γ} + \bar{μ} - \underline{μ}) {\underline{I}}_{t} - (1 + κ) (\bar{σ} - \underline{σ}) {\underline{E}}_{t} \leq - (\bar{γ} - \underline{γ} + \bar{μ} - \underline{μ}) {\underline{I}}_{t} - (1 + κ) (\bar{σ} - \underline{σ}) {\underline{E}}_{t} \leq 0,$

which implies boundedness of all variables ${\underline{S}}_{t}, {\underline{I}}_{t}, {\underline{E}}_{t}, {\underline{D}}_{t}, {\underline{R}}_{t}$ . Applying LaSalle Invariance Principle (La Salle, 1976), we conclude that all trajectories converge to the set with ${\underline{I}}_{t} = {\underline{E}}_{t} = 0$ , that leads to the dynamics

${\underline{R}}_{t + 1} = {\underline{R}}_{t}, {\underline{D}}_{t + 1} = {\underline{D}}_{t}$

reproducing a steady-state solution. Finally, the condition $2 \bar{b} {sup}_{t \in N} {\bar{r}}_{t} \leq 1$ introduced in the formulation of the theorem results in

$0 \leq 1 - \bar{b} \frac{({\bar{p}}_{t - τ_{p}} {\bar{I}}_{t} + {\bar{r}}_{t - τ_{r}} {\bar{E}}_{t})}{N} \leq 1$

that ensures the boundedness of ${\underline{S}}_{t}$ . Second, for the upper bound variables, consider a Lyapunov function candidate

${\bar{V}}_{t} = {\bar{S}}_{t} + {\bar{I}}_{t} + {\bar{E}}_{t} + {\bar{D}}_{t} + {\bar{R}}_{t},$

which is also well-defined and whose increment for non-saturated dynamics in (9) admits an estimate:

${\bar{V}}_{t + 1} - {\bar{V}}_{t} = \frac{(\bar{b} {\bar{p}}_{t - τ_{p}} {\bar{I}}_{t} - \underline{b} {\underline{p}}_{t - τ_{p}} {\underline{I}}_{t} + \bar{b} {\bar{r}}_{t - τ_{r}} {\bar{E}}_{t} - \underline{b} {\underline{r}}_{t - τ_{r}} {\underline{E}}_{t})}{N} {\bar{S}}_{t}$

$+ (1 + κ) (\bar{σ} - \underline{σ}) {\bar{E}}_{t} + (\bar{μ} - \underline{μ} + \bar{γ} - \underline{γ}) {\bar{I}}_{t} \geq 0 .$

Hence, the upper bound variables ${\bar{S}}_{t}, {\bar{I}}_{t}, {\bar{E}}_{t}, {\bar{D}}_{t}, {\bar{R}}_{t}$ may become unbounded, and that is why the saturation is explicitly introduced for ${\bar{I}}_{t}, {\bar{E}}_{t}, {\bar{D}}_{t}, {\bar{R}}_{t}$ . For ${\bar{S}}_{t}$ , since

$1 - \underline{b} \frac{({\underline{p}}_{t - τ_{p}} {\underline{I}}_{t} + {\underline{r}}_{t - τ_{r}} {\underline{E}}_{t})}{N} \leq 1,$

the variable stays always bounded. □

Remark 7

The dynamics of lower and upper interval predictions are interrelated through the update equations of ${\underline{S}}_{t}, {\bar{S}}_{t}$ . Thus, the predictor (9) dimension is twice higher than in the system (1). The values of the variables ${\underline{S}}_{t}, {\bar{S}}_{t}$ can be evaluated using the population equation $S_{t} + E_{t} + I_{t} + R_{t} + D_{t} = N$ :

${\underline{S}}_{t} = N - {\bar{I}}_{t} - {\bar{E}}_{t} - {\bar{R}}_{t} - {\bar{D}}_{t},$

${\bar{S}}_{t} = N - {\underline{I}}_{t} - {\underline{E}}_{t} - {\underline{R}}_{t} - {\underline{D}}_{t},$

which, however, does not isolate the dynamics of lower and upper interval predictions. Also, preliminary simulations show that such modification leads to more conservative results, so we keep (9) for all further utilization.

6. Numerical results

Table 1 gives the current population in each of the considered countries and state,4 the parameter $α_{1}$ , and the delays $τ_{r}$ and $τ_{p}$ , as from July 30th.

Table 1.

Time period.

Region	N	$α_{1}$	$τ_{r}$	$τ_{p}$
France	67 064 000	1.78	5	25
Italy	60 359 546	4	10	30
Spain	46 600 396	6.7	8	30
Germany	46 600 396	1.02	3	21
Brazil	212 559 417	2.44	3	35
Russia	146 745 098	1.56	15	20
New York State	19 453 561	1.28	5	20
China	143 807 089	1.0	1	15

Open in a new tab

In this section, we introduce the used data together with the selected parameters, identify the parameters (as illustrated for France in Fig. 3) and simulate the interval predictor (as in Fig. 6 together with the plots of validation Fig. 7). The common parameters assigned to all countries (to simplify the analysis) are:

σ = \frac{1}{7}, κ = 0.1,

for chosen values of $p_{Q}, p_{N}, p_{R}, p_{M}$ .5 Adjusting these values for each country improves the forecast precision, but our goal here is to illustrate the proposed method’s broad applicability for the virus propagation interval prediction.

Fig. 3 — The identified parameters for France.

Fig. 6 — The results of simulation of (9) for France under $\pm 7.5 %$ variation of all parameters.

Fig. 7 — Validation of prediction of $I$ for France with $J - 120$ points of data under deviations of values of all parameters.

For most countries, the first date of data acquisition is March 12th, except for Italy (March 5th), New York State (March 16th), and China (January 16th). For all eight regions, the period considered for our analysis ended on July 30th. The data available from public sources is provided in Github.

Applying the proposed procedure to the parameter identification gives the results in Table 2.

Table 2.

Parameters estimation.

Region	$μ$	$γ$	$b$
France	$5.3345 \times 1 0^{- 4}$	0.0184	0.0918
Italy	$9.3987 \times 1 0^{- 4}$	0.0223	0.0159
Spain	$10.00 \times 1 0^{- 4}$	0.0275	0.1041
Germany	$8.9617 \times 1 0^{- 4}$	0.0693	0.1152
Brazil	$12.00 \times 1 0^{- 4}$	0.0579	0.1473
Russia	$8.5619 \times 1 0^{- 4}$	0.0152	0.0870
New York State	$6.5199 \times 1 0^{- 4}$	0.0271	0.0815
China	$10.40 \times 1 0^{- 4}$	0.0760	0.0238

Open in a new tab

6.1. Results of identification

For France, the obtained values $γ_{k}, b_{k}$ and $μ_{k}$ (solid lines) together with the selected average estimates $γ, b$ and $μ$ (dot lines), and the signal $ln (E_{t})$ (solid line) with approximations $a_{i} t + b_{i}$ (dash lines) are shown in Fig. 3. As we can conclude from these results, the identification of the value of $γ$ is relatively reliable and converging. The mortality rate $μ$ follows the gravity of the outbreak (it was maximal during the most severe virus propagation at the beginning of April). Also, the value of $b$ is more complicated to estimate since it depends on all quantities (we stop the identification if $p_{t - τ_{p}}$ and $r_{t - τ_{r}}$ are sufficiently small to avoid very noisy results; see the missing values in the plot). Finally, delays $τ_{r}$ and $τ_{p}$ are noticeable from the plot, and the line approximations are reasonable (if at a stage some delay cannot be recognized, then we can use a nominal value).

6.2. Simulation and validation

The simulation results, for France, of the model (1) with the identified parameters are given in Fig. 4 (for better visibility, all populations are plotted in the logarithmic scale), a zoomed comparison of the measured and reconstructed data is shown in Fig. 5 (as we can see, the measured data for $I$ , $R$ , and $D$ has a smooth shape, while the reconstructed variable $E$ , also used for identification, is rather noisy). In this case, the model can approximate the virus propagation reasonably well since the identified parameters are consistent with France’s available statistics.

Fig. 5 — The results of verification with identified parameters.

The obtained curves also demonstrate the lack of efficiency of the confinement. The number of asymptomatic infectious can be reduced quickly, but symptomatic patients may persist a long time giving rise to a second wave. This conclusion might be related to the model’s probable weak validity for the decreasing phase of the outbreak.

6.3. Simulation and validation results of the interval predictor

For France, the simulation results of the interval predictor (9) with $δ = 7.5 %$ is presented in Fig. 6 (the dashed and dotted lines represent, respectively, upper and lower interval bounds, the solid lines correspond to the average behavior, the circles depict measured and reconstructed data points used for identification). The width of the predicted interval of admissible values for the state of (1) is growing, which is related with a high level of uncertainty reflected by $δ$ and chosen for these simulations (according to Theorem 1, the dynamics of upper bounds of these variables are unstable, and the lower ones are converging to zero). For the sake of brevity, the simulation results for the remaining geographic regions are not presented here: the obtained model follows well the measured statistics for all countries and state.

As we can conclude from these curves, under sufficiently significant deviations of the parameters (which correspond to the amount and quality of data publicly available), the confinement may slow down the epidemics. The measurements are nearly included in the obtained intervals validating the prediction (the value of $δ$ was selected to ensure this property). There are two variants of epidemic development demonstrated in these results: optimistic, which corresponds to the lower bounds of $I$ and $E$ , and pessimistic presented by the respective upper bounds.

To check the prediction accuracy, we can select a part of the data for identification and another part for verification of prediction reliability. Such validation results are shown in Fig. 7, where the interval prediction for the infectious population $I$ is presented with a deviation of all parameters. As previous, blue dashed and dotted lines correspond do the upper $\bar{I}$ and the lower bounds $\underline{I}$ , the bold lines are calculated using $J - 1$ day initial conditions), the blue circles and squares are the measured information used for identification and validation, and the red line is the average behavior. In the plot, only the data points for $t = 0, 1, \dots, J - 120$ are used, shown by circles, and the interval predictor is initiated with the data for $t = J - 121$ . Then, square data points (which were not taken into account during identification for $t = J - 120, \dots, J$ ) can be compared with the predictor trajectories (bold dashed and dotted blue lines and the red one). As we can see, the points marked by squares are well included in the predicted interval, which confirms the reliability of (9) at least for $120$ days.

In general, further precision of the model and the parameters is needed. However, as a recommendation after these preliminary simulations, the preservation of the quarantine rules is desirable (the simulation clearly demonstrates the epidemics decreasing during lockdown only). The model shows a relatively low decrease in the number of infected individuals, then prolonging the isolation of the fragile part of the population, and social distancing is reasonable (it is worth noting that the value of $p_{R}$ is selected ad-hoc and probably too high).

In the sequel, an analysis of the model fitting to the data for other countries and state is demonstrated in Figs. 8, 9, 10, 11, 12, 13, 14: blue dashed and dotted lines correspond to the upper $\bar{I}$ and the lower bounds $\underline{I}$ (the bold lines are calculated using the last day included in the identification data). The red line is the average, the blue circles and squares are the measured information used for identification and validation. A reasonable fit of the model to the data for Italy is demonstrated in Fig. 8. The square points belong to the middle of the predicted interval in the plot.

Fig. 8 — Validation of prediction of $I$ for Italy with $J - 60$ points of data under deviations of values of all parameters.

Fig. 9 — Validation of prediction of $I$ for Spain with $J - 60$ points of data under deviations of values of all parameters.

Fig. 10 — Validation of prediction of $I$ for Germany with $J - 90$ points of data under deviations of values of all parameters.

Fig. 11 — Validation of prediction of $I$ for Brazil with $J - 70$ points of data under deviations of values of all parameters.

Fig. 12 — Validation of prediction of $I$ for Russia with $J - 120$ points of data under deviations of values of all parameters.

Fig. 13 — Validation of prediction of $I$ for NY State with $J - 40$ points of data under deviations of values of all parameters.

Fig. 14 — Validation of prediction of $I$ for China with $J - 120$ points of data under deviations of values of all parameters.

For Spain, a good fit of the model to the data is demonstrated in Fig. 9: the square points lie close to the middle of the predicted interval. For Germany, the square points in Fig. 10 are not included at the end in the predicted interval in the plot, which is related to the start of the second wave that is noticeable from the data.

For Brazil, the square points belong to the predicted interval in the plot, as shown in Fig. 11. A good fit of the model for Russia is shown in Fig. 12, where the square points belong to the lower part of the predicted interval in the plot.

A fit of the model for the NY State’s data is demonstrated in Fig. 13, where the square points belong to the middle of the predicted interval in the plot. A fit of the model to China’s data is demonstrated in Fig. 14, where the square points are not included at the end in the predicted interval in the plot, which is related to the start of the second wave that is noticeable from the data. As for Germany, this issue originated because the model parameters were identified several months before the beginning of the second wave, and in the end they lost their validity. The societal feedback and reactions also changed at that time, which is not reflected by the predictor’s inputs.

7. Conclusion

A simple new discrete-time SEIR epidemic model was identified and used to predict the quarantine’s influence on the SARS-CoV-2 virus propagation in France, Italy, Spain, Germany, Brazil, Russia, New York State, and China. An interval predictor method was developed to analyze the COVID-19 course – whose ability to take into account the sets of admissible values for initial conditions, inputs, and parameters – enlarges the prediction performance. It was demonstrated that the reliability of the interval prediction for $30 - 120$ days is rather good, even by such a simple model. The prediction showed that more extended confinement might be a bit more efficient, but a more strict as possible quarantine seemed to be advisable under the uncertainty level. The obtained results show that predicting the outbreak development with reasonable accuracy is possible by selecting different contact profiles between the countries’ compartments.

The eight considered countries can be divided into two groups: four European states (France, Italy, Spain, and Germany) and China, where the virus presence is already well developed with several weeks of quarantine, and two BRICS countries (Brazil and Russia) with the US, where the epidemics started later and somewhat general confinement has also been imposed later. The identified models for these groups of countries have common patterns (e.g., a significant variation of the recovery rate $γ$ for Brazil and Russia). Our prediction showed that in European countries, the peak of infections occurred in April–May in the optimistic scenario. Increased severity of the confinement could significantly decrease the amplitude of the peak discharging the health services load.

Machine learning tools can be further used to identify and optimize the time profile for the confinement. Another possible direction of improvement of the proposed approach is to consider a SEIR model with population separation either by age or by region (or by both), but this implies an increasing number of parameters to be identified (that can be impossible) and also needs specially structured data to be available. The introduction of delays in the proposed model dynamics to better describe the virus propagation lags between compartments is also a promising investigation area.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

As in the Report 13 by the Imperial College London, for example.

See, for example, these arguments, or a dedicated analysis in the Report 13 by the Imperial College of London, the works in Bohk-Ewald, Dudel, and Myrskyla (2020) and Magal and Webb (2020), a report by CMMID, or this article by University of Melbourne.

A way to determine $α_{3}$ is given in https://github.com/sebastianhohmann.

⁴

Source: www.en.wikipedia.org/wiki/.

⁵

Check the code in Github.

References

Aronna, M. S., & Bliman, P.-A. Interval observer for uncertain time-varying SIR-SI epidemiological model of vector-borne disease. In 2018 16th European control conference. Limassol.
Bliman, P.-A., Efimov, D., & Ushirobira, R. (2018). A class of nonlinear adaptive observers for SIR epidemic model. In Proceedings of ECC’18, the 16th annual European control conference.
Bohk-Ewald C., Dudel C., Myrskyla M. 2020. A demographic scaling model for estimating the total number of COVID-19 infections. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cantó B., Coll C., Sánchez E. Estimation of parameters in a structured SIR model. Advances in Difference Equations. 2017;2017(1):33. [Google Scholar]
Dandekar R., Barbastathis G. 2020. Neural Network aided quarantine control model estimation of global Covid-19 spread.arXiv:2004.02752 [DOI] [PMC free article] [PubMed] [Google Scholar]
Das S., Ghosh P., Sen B., Mukhopadhyay I. 2020. Critical community size for COVID-19 – a model based approach to provide a rationale behind the lockdown.arXiv:2004.03126 [Google Scholar]
Degue, K. H., Efimov, D., & Iggidr, A. (2016). Interval estimation of sequestered infected erythrocytes in malaria patients. In 2016 European control conference (pp. 1141–1145).
Degue, K. H., & Le Ny, J. (2018). An interval observer for discrete-time SEIR epidemic models. In 2018 annual american control conference (pp. 5934–5939).
d’Onofrio A., Manfredi P., Poletti P. The interplay of public intervention and private choices in determining the outcome of vaccination programmes. PLoS One. 2012;7(10) doi: 10.1371/journal.pone.0045653. [DOI] [PMC free article] [PubMed] [Google Scholar]
Efimov D., Raïssi T. Design of interval observers for uncertain dynamical systems. Automation and Remote Control. 2016;77(2):191–225. [Google Scholar]
Efimov, D., & Ushirobira, R. (2020). On interval prediction of COVID-19 development in France based on a SEIR epidemic model. In Proc. IEEE conference on decision and control. Jeju Island, Korea. [DOI] [PMC free article] [PubMed]
Ferguson N.M., Laydon D., Nedjati-Gilani G., Imai N., Ainslie K., Baguelin M., et al. WHO Collaborating Centre for Infectious Disease Modelling, MRC Centre for Global Infectious Disease Analysis, Abdul Latif Jameel Institute for Disease and Emergency Analytics Imperial College London; 2020. Impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand. [Google Scholar]
Gevertz J., Greene J., Hixahuary Sanchez Tapia C., Sontag E.D. 2020. A novel COVID-19 epidemiological model with explicit susceptible and asymptomatic isolation compartments reveals unexpected consequences of timing social distancing. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gouzé J., Rapaport A., Hadj-Sadok M. Interval observers for uncertain biological systems. Ecological Modelling. 2000;133:46–56. [Google Scholar]
Hu Z., Ge Q., Li S., Boerwincle E., Jin L., Xiong M. 2020. Forecasting and evaluating intervention of Covid-19 in the World. arXiv e-prints, arXiv:2003.09800. [DOI] [PMC free article] [PubMed] [Google Scholar]
Keeling M.J., Rohani P. Princeton University Press; 2008. Modeling infectious diseases in humans and animals. [Google Scholar]
La Salle J.P. Society for Industrial and Applied Mathematics; 1976. The stability of dynamical systems. [DOI] [Google Scholar]
Lauer S.A., Grantz K.H., Bi Q., Jones F.K., Zheng Q., Meredith H.R., et al. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: Estimation and application. Annals of Internal Medicine. 2020 doi: 10.7326/M20-0504. [DOI] [PMC free article] [PubMed] [Google Scholar]
Leurent, E., Efimov, D., Raïssi, T., & Perruquetti, W. (2019). Interval prediction for continuous-time systems with parametric uncertainties. In Proc. IEEE conference on decision and control. Nice.
Lopez L.R., Rodo X. 2020. A modified SEIR model to predict the COVID-19 outbreak in Spain and Italy: simulating control scenarios and multi-scale epidemics. medRxiv, arXiv:https://www.medrxiv.org/content/early/2020/04/16/2020.03.27.20045005.full.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lourenco J., Paton R., Ghafari M., Kraemer M., Thompson C., Simmonds P., et al. 2020. Fundamental principles of epidemic spread highlight the immediate need for large-scale serological surveys to assess the stage of the SARS-CoV-2 epidemic. medRxiv. [DOI] [Google Scholar]
Magal P., Webb G. The parameter identification problem for SIR epidemic models: Identifying unreported cases. Journal of Mathematical Biology. 2018;77:1629–1648. doi: 10.1007/s00285-017-1203-9. [DOI] [PubMed] [Google Scholar]
Magal P., Webb G. 2020. Predicting the number of reported and unreported cases for the COVID-19 epidemic in South Korea, Italy, France and Germany. medRxiv. arXiv:https://www.medrxiv.org/content/early/2020/03/24/2020.03.21.20040154.full.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]
Maier B.F., Brockmann D. 2020. Effective containment explains sub-exponential growth in confirmed cases of recent COVID-19 outbreak in Mainland China. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mazenc F., Bernard O. Interval observers for linear time-invariant systems with disturbances. Automatica. 2011;47(1):140–147. [Google Scholar]
Mazenc F., Dinh T.N., Niculescu S.I. Interval observers for discrete-time systems. International Journal of Robust and Nonlinear Control. 2014;24:2867–2890. [Google Scholar]
Nussbaumer-Streit B., Mayr V., Dobrescu A., Chapman A., Persad E., Klerings I., et al. Quarantine alone or in combination with other public health measures to control COVID-19: A rapid review. Cochrane Database of Systematic Reviews. 2020;(4) doi: 10.1002/14651858.CD013574. [DOI] [PMC free article] [PubMed] [Google Scholar]
Peng L., Yang W., Zhang D., Zhuge C., Hong L. 2020. Epidemic analysis of COVID-19 in China by dynamical modeling.arXiv:2002.06563 [Google Scholar]
Raïssi T., Efimov D., Zolghadri A. Interval state estimation for a class of nonlinear systems. IEEE Transactions on Automatic Control. 2012;57(1):260–265. [Google Scholar]
Ushirobira, R., Efimov, D., & Bliman, P. (2019). Estimating the infection rate of a SIR epidemic model via differential elimination. In 2019 18th European control conference (pp. 1170–1175).
Wang Z., Bauch C.T., Bhattacharyya S., d’Onofrio A., Manfredi P., Perc M., et al. Statistical physics of vaccination. Physics Reports. 2016;664:1–113. [Google Scholar]
Yang Z., Zeng Z., Wang K., Wong S.-S., Liang W., Zanin M., et al. Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions. Journal of Thoracic Disease. 2020;12(3) doi: 10.21037/jtd.2020.02.64. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b1] Aronna, M. S., & Bliman, P.-A. Interval observer for uncertain time-varying SIR-SI epidemiological model of vector-borne disease. In 2018 16th European control conference. Limassol.

[b2] Bliman, P.-A., Efimov, D., & Ushirobira, R. (2018). A class of nonlinear adaptive observers for SIR epidemic model. In Proceedings of ECC’18, the 16th annual European control conference.

[b3] Bohk-Ewald C., Dudel C., Myrskyla M. 2020. A demographic scaling model for estimating the total number of COVID-19 infections. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b4] Cantó B., Coll C., Sánchez E. Estimation of parameters in a structured SIR model. Advances in Difference Equations. 2017;2017(1):33. [Google Scholar]

[b5] Dandekar R., Barbastathis G. 2020. Neural Network aided quarantine control model estimation of global Covid-19 spread.arXiv:2004.02752 [DOI] [PMC free article] [PubMed] [Google Scholar]

[b6] Das S., Ghosh P., Sen B., Mukhopadhyay I. 2020. Critical community size for COVID-19 – a model based approach to provide a rationale behind the lockdown.arXiv:2004.03126 [Google Scholar]

[b7] Degue, K. H., Efimov, D., & Iggidr, A. (2016). Interval estimation of sequestered infected erythrocytes in malaria patients. In 2016 European control conference (pp. 1141–1145).

[b8] Degue, K. H., & Le Ny, J. (2018). An interval observer for discrete-time SEIR epidemic models. In 2018 annual american control conference (pp. 5934–5939).

[b9] d’Onofrio A., Manfredi P., Poletti P. The interplay of public intervention and private choices in determining the outcome of vaccination programmes. PLoS One. 2012;7(10) doi: 10.1371/journal.pone.0045653. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b10] Efimov D., Raïssi T. Design of interval observers for uncertain dynamical systems. Automation and Remote Control. 2016;77(2):191–225. [Google Scholar]

[b11] Efimov, D., & Ushirobira, R. (2020). On interval prediction of COVID-19 development in France based on a SEIR epidemic model. In Proc. IEEE conference on decision and control. Jeju Island, Korea. [DOI] [PMC free article] [PubMed]

[b12] Ferguson N.M., Laydon D., Nedjati-Gilani G., Imai N., Ainslie K., Baguelin M., et al. WHO Collaborating Centre for Infectious Disease Modelling, MRC Centre for Global Infectious Disease Analysis, Abdul Latif Jameel Institute for Disease and Emergency Analytics Imperial College London; 2020. Impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand. [Google Scholar]

[b13] Gevertz J., Greene J., Hixahuary Sanchez Tapia C., Sontag E.D. 2020. A novel COVID-19 epidemiological model with explicit susceptible and asymptomatic isolation compartments reveals unexpected consequences of timing social distancing. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b14] Gouzé J., Rapaport A., Hadj-Sadok M. Interval observers for uncertain biological systems. Ecological Modelling. 2000;133:46–56. [Google Scholar]

[b15] Hu Z., Ge Q., Li S., Boerwincle E., Jin L., Xiong M. 2020. Forecasting and evaluating intervention of Covid-19 in the World. arXiv e-prints, arXiv:2003.09800. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b16] Keeling M.J., Rohani P. Princeton University Press; 2008. Modeling infectious diseases in humans and animals. [Google Scholar]

[b17] La Salle J.P. Society for Industrial and Applied Mathematics; 1976. The stability of dynamical systems. [DOI] [Google Scholar]

[b18] Lauer S.A., Grantz K.H., Bi Q., Jones F.K., Zheng Q., Meredith H.R., et al. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: Estimation and application. Annals of Internal Medicine. 2020 doi: 10.7326/M20-0504. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b19] Leurent, E., Efimov, D., Raïssi, T., & Perruquetti, W. (2019). Interval prediction for continuous-time systems with parametric uncertainties. In Proc. IEEE conference on decision and control. Nice.

[b20] Lopez L.R., Rodo X. 2020. A modified SEIR model to predict the COVID-19 outbreak in Spain and Italy: simulating control scenarios and multi-scale epidemics. medRxiv, arXiv:https://www.medrxiv.org/content/early/2020/04/16/2020.03.27.20045005.full.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b21] Lourenco J., Paton R., Ghafari M., Kraemer M., Thompson C., Simmonds P., et al. 2020. Fundamental principles of epidemic spread highlight the immediate need for large-scale serological surveys to assess the stage of the SARS-CoV-2 epidemic. medRxiv. [DOI] [Google Scholar]

[b22] Magal P., Webb G. The parameter identification problem for SIR epidemic models: Identifying unreported cases. Journal of Mathematical Biology. 2018;77:1629–1648. doi: 10.1007/s00285-017-1203-9. [DOI] [PubMed] [Google Scholar]

[b23] Magal P., Webb G. 2020. Predicting the number of reported and unreported cases for the COVID-19 epidemic in South Korea, Italy, France and Germany. medRxiv. arXiv:https://www.medrxiv.org/content/early/2020/03/24/2020.03.21.20040154.full.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b24] Maier B.F., Brockmann D. 2020. Effective containment explains sub-exponential growth in confirmed cases of recent COVID-19 outbreak in Mainland China. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b25] Mazenc F., Bernard O. Interval observers for linear time-invariant systems with disturbances. Automatica. 2011;47(1):140–147. [Google Scholar]

[b26] Mazenc F., Dinh T.N., Niculescu S.I. Interval observers for discrete-time systems. International Journal of Robust and Nonlinear Control. 2014;24:2867–2890. [Google Scholar]

[b27] Nussbaumer-Streit B., Mayr V., Dobrescu A., Chapman A., Persad E., Klerings I., et al. Quarantine alone or in combination with other public health measures to control COVID-19: A rapid review. Cochrane Database of Systematic Reviews. 2020;(4) doi: 10.1002/14651858.CD013574. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b28] Peng L., Yang W., Zhang D., Zhuge C., Hong L. 2020. Epidemic analysis of COVID-19 in China by dynamical modeling.arXiv:2002.06563 [Google Scholar]

[b29] Raïssi T., Efimov D., Zolghadri A. Interval state estimation for a class of nonlinear systems. IEEE Transactions on Automatic Control. 2012;57(1):260–265. [Google Scholar]

[b30] Ushirobira, R., Efimov, D., & Bliman, P. (2019). Estimating the infection rate of a SIR epidemic model via differential elimination. In 2019 18th European control conference (pp. 1170–1175).

[b31] Wang Z., Bauch C.T., Bhattacharyya S., d’Onofrio A., Manfredi P., Perc M., et al. Statistical physics of vaccination. Physics Reports. 2016;664:1–113. [Google Scholar]

[b32] Yang Z., Zeng Z., Wang K., Wong S.-S., Liang W., Zanin M., et al. Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions. Journal of Thoracic Disease. 2020;12(3) doi: 10.21037/jtd.2020.02.64. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

On an interval prediction of COVID-19 development based on a SEIR epidemic model

Denis Efimov

Rosane Ushirobira

Abstract

1. Introduction

2. Epidemic model and considerations

2.1. Societal feedback and confinement influence in the model

Remark 1

2.2. Model parameters

2.2.1. Generic observations

Fig. 1.

2.2.2. Known or accepted quantities

2.3. Uncertainty and prediction

Remark 2

3. Used dataset and associated parameters

Remark 3

3.1. Fixed values of parameters

3.2. Scenario of confinement

Fig. 2.

Remark 4

4. Parameter identification

4.1. Delay identification

4.1.1. Method 1

Remark 5

4.1.2. Method 2

4.2. Rates identification

Remark 6

5. Interval prediction

5.1. Explanation of idea

Lemma 1 Efimov & Raïssi, 2016 —

5.2. Equations of interval predictor and its properties

Theorem 1

Proof

Remark 7

6. Numerical results

Table 1.

Fig. 3.

Fig. 6.

Fig. 7.

Table 2.

6.1. Results of identification

6.2. Simulation and validation

Fig. 4.

Fig. 5.

6.3. Simulation and validation results of the interval predictor

Fig. 8.

Fig. 9.

Fig. 10.

Fig. 11.

Fig. 12.

Fig. 13.

Fig. 14.

7. Conclusion

Declaration of Competing Interest

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases