Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 Sep 14;101(3):1621–1634. doi: 10.1007/s11071-020-05946-3

Forecast analysis of the epidemics trend of COVID-19 in the USA by a generalized fractional-order SEIR model

Conghui Xu 1, Yongguang Yu 1, YangQuan Chen 2,, Zhenzhen Lu 1
PMCID: PMC7487266  PMID: 32952299

Abstract

In this paper, a generalized fractional-order SEIR model is proposed, denoted by SEIQRP model, which divided the population into susceptible, exposed, infectious, quarantined, recovered and insusceptible individuals and has a basic guiding significance for the prediction of the possible outbreak of infectious diseases like the coronavirus disease in 2019 (COVID-19) and other insect diseases in the future. Firstly, some qualitative properties of the model are analyzed. The basic reproduction number R0 is derived. When R0<1, the disease-free equilibrium point is unique and locally asymptotically stable. When R0>1, the endemic equilibrium point is also unique. Furthermore, some conditions are established to ensure the local asymptotic stability of disease-free and endemic equilibrium points. The trend of COVID-19 spread in the USA is predicted. Considering the influence of the individual behavior and government mitigation measurement, a modified SEIQRP model is proposed, defined as SEIQRPD model, which is divided the population into susceptible, exposed, infectious, quarantined, recovered, insusceptible and dead individuals. According to the real data of the USA, it is found that our improved model has a better prediction ability for the epidemic trend in the next two weeks. Hence, the epidemic trend of the USA in the next two weeks is investigated, and the peak of isolated cases is predicted. The modified SEIQRP model successfully capture the development process of COVID-19, which provides an important reference for understanding the trend of the outbreak.

Keywords: COVID-19, Fractional order, Generalized SEIR model, Epidemic, Peak prediction

Introduction

The outbreak of the coronavirus disease (COVID-19) in 2019 occurred in Wuhan, China, at the end of 2019. This is a severe respiratory syndrome caused by the novel coronavirus of zoonotic origin [1]. The Chinese government has implemented many measures, including the establishment of specialized hospitals and restrictions on travel, to reduce the spread. By April 20, 2020, the outbreak in China has been basically controlled. However, the outbreak is still rampant all over the world. At present, the USA, Italy, Spain and other countries are still in the rising stage of the outbreak. It has posed a great threat to the public health and safety of the world.

At present, many countries have adopted mitigation measures to restrict travel and public gatherings, which have a serious impact on the economy. Therefore, it is very important to predict the development trend of this epidemic and estimate the peak of the isolated cases. The epidemic model is a basic tool to research the dynamic behaviors of disease and predict the spreading trend of disease. Establishing a reasonable epidemic model can effectively characterize the development process of the disease. Ricardo [2] researched a fractional SEIR model with treatment and given some sufficient conditions to ensure the local stability of equilibrium points. Yang and Xu [3] considered a fractional-order SEIR model with general incidence rate and established the local and uniform asymptotic stability of equilibria. However, these studies remain at the theoretical stage and do not use real data to analyze the trend of specific epidemics. In the outbreak of COVID-19, three types of data are usually available: isolated cases, recovered cases and death cases. The traditional fractional SEIR model cannot describe the trend of isolated individuals and dead individuals caused by diseases. Hence, it difficult for the traditional SEIR model to use the information contained in isolated cases and death cases. Therefore, a generalized SEIR epidemic model is proposed in this paper and is defined as SEIQRP model. Some qualitative properties of this model are first analyzed, including the existence and uniqueness of the disease-free and the endemic equilibrium points. Then, conditions are also established to ensure the local asymptotic stability of both disease-free and endemic equilibrium points.

So far, many scholars have researched COVID-19 from different perspectives [1, 48]. In [9], the epidemics trend of COVID-19 in China was predicted under public health interventions. In [10], the basic reproduction number of COVID-19 in China was estimated and the data-driven analysis was performed in the early phase of the outbreak. Peng [11] proposed a generalized SEIR model to analyze the spread of COVID-19 in China. The model can describe the trends of isolated individuals, recovered individuals and dead individuals, and make full use of the information contained in the real data. However, the model does not take into account the infectivity of patients in the incubation period. As we all know, one of the important reasons that COVID-19 can cause a global outbreak is that the epidemic has a long incubation period. Hence, considering the infectivity of the patients in the incubation period is in line with the actual situation. In order to predict the cumulative number of confirmed cases and combine the actual measures taken by the government on the outbreak of COVID-19, we further put forward an improved SEIQRP model, denoted by SEIQRPD model. At present, we can obtain the epidemic data of COVID-19 outbreak in the USA before April 20, 2020. We use these data and the improved model to predict the epidemic trend of the USA in the next two weeks, and estimate the peak of isolated cases. Firstly, the data before April 5 were selected to identify the model parameters, and the prediction ability of the improved model with the epidemiological data from April 6 to April 20 is verified. Then, the cumulative number of confirmed cases and isolated cases are predicted in the next two weeks. The peak of isolated cases is thus predicted.

In recent years, with the continuous development of fractional calculus theory, fractional-order system modeling approaches have been applied in various engineering and non-engineering fields [1214]. Compared with the short memory of the integer derivative, the fractional derivative has the information of the whole time interval or long memory. It is more accurate to describe the biological behavior of population by using fractional differential equation.

The rest of this manuscript is structured as below. In Sect. 2.1, the fractional SEIQRP model and the modified SEIQRP model is proposed. Some qualitative properties of the SEIQRP model are discussed in Sect. 2.2. In Sect. 2.3, the prediction ability of the modified SEIQRP model is verified by using real data. The disease development in the USA in the next two weeks after April 21, 2020, is predicted. Finally, conclusion and some future works are discussed in Sect. 3.

Preliminaries and model derivation

Preliminaries

In this section, some useful lemmas and definitions will be given to analyze some results of this paper.

Definition 1

[15] The Caputo fractional-order derivative is given below

0CDtαf(t)=1Γ(n-α)0tf(n)(ξ)(t-ξ)1+α-ndξ,n-1<α<n,dndtnf(t),α=n,

where n is the smallest positive integer not less than α.

Definition 2

[15, 16] The Mittag–Leffler function is given below

Eα(z)=k=0zkΓ(kα+1),

where n-1αn.

Consider the following n-dimensional fractional-order differential equation system

0CDtαX(t)=AX(t),x(0)=x0, 1

where α(0,1), X(t)=(x1(t),x2(t),,xn(t))T is an n-dimensional state vector and A is an n×n constant matrix.

Lemma 1

[17] For the corresponding linear time-invariant system (1), the following results are true:

(i) The zero solution is asymptotically stable, if and only if all eigenvalues sj (j=1,2,,n) of A satisfy |arg(sj)|>απ2.

(ii) The zero solution is stable, if and only if all eigenvalues sj of A satisfy |arg(sj)|απ2 and eigenvalues with |arg(sj)|=απ2 have the same algebraic multiplicity and geometric multiplicity.

SEIQRP model

The outbreak of COVID-19 has had a great impact on the economic growth of any country and daily life of any human. In order to control and prevent the possible outbreak of infectious diseases like the COVID-19 or other insect diseases in the future, it is very important to establish an appropriate model. The transmission diagram of the generalized SEIR model proposed in this paper is shown in Fig. 1. We divide the total population into six distinct epidemic classes: susceptible, exposed, infectious, quarantined, recovered and insusceptible. We will represent the number of individuals at time t in the above classes by S(t), E(t), I(t), Q(t), R(t) and P(t), respectively. The specific explanations of the above six categories are as follows:

  • Susceptible S(t): the number of uninfected individuals at the time t.

  • Exposed E(t): the number of infected individuals at the time t, but still in incubation period (without clinical symptoms and low infectivity).

  • Infectious I(t): the number of infected individuals at the time t (with obvious clinical symptoms).

  • Quarantined Q(t): the number of individuals who have been diagnosed and isolated at the time t.

  • Recovered R(t): the number of recovered individuals at the time t.

  • Insusceptible P(t): the number of susceptible individuals who are not exposed to the external environment at the time t.

The incidence rate plays a very important role in the epidemic, which can describe the evolution of infectious disease. According to the spread of different diseases in different regions, there are many forms of incidence rate [1820]. We expect to establish a generalized incidence rate, which can contain most of these specific forms. Further, in order to match the dimensions of fractional-order model with respect to time t, Diethelm[21] pointed out that both the left-hand and the right-hand sides of fractional-order epidemic model must have the same dimensions. Inspired by the work of[3, 21, 22], the fractional-order SEIQRP model with general incidence rate is given by

0CDtαS(t)=Λ-β1αf1(S(t))g1(I(t))-β2αf2(S(t))g2(E(t))-μαS(t)-ραS(t),0CDtαE(t)=β1αf1(S(t))g1(I(t))+β2αf2(S(t))g2(E(t))-εαE(t)-μαE(t),0CDtαI(t)=εαE(t)-(δα+μα)I(t),0CDtαQ(t)=δαI(t)-(λα+κα+μα)Q(t),0CDtαR(t)=λαQ(t)-μαR(t),0CDtαP(t)=ραS(t)-μαP(t), 2

where α(0,1), Λ is the inflow number of susceptible individuals, β1 and β2 denote the infection rates of the infected individuals and the exposed individuals, respectively, ρ is the protection rate and ε represents the incubation rate. δ is the rate at which symptomatic infections are diagnosed and quarantined. λ and κ represent the cure rate of isolated individuals and the death rate caused by the disease, respectively, μ is the natural mortality, and the incidence rates β1f1(S)g1(I) and β2f2(S)g2(E) are used to describe the transmission of diseases, which satisfy the following conditions [23]:

Fig. 1.

Fig. 1

Transmission diagram for the model involving six population classes

(i) f1(0)=f2(0)=g1(0)=g2(0)=0,

(ii) f1(S)>0, f2(S)>0, g1(I)>0, g2(E)>0, for any S, E, I>0,

(iii) f1(S)>0, f2(S)>0, g1(I)>0, g2(E)>0, for any S, E, I>0,

(iv) g1(I)-g1(I)I0, g2(E)-g2(E)E0, for any E, I>0.

Modified SEIQRP model

In the following, we will combine the actual situation and government mitigation policy to improve the SEIQRP model (2). Firstly, during the outbreak of COVID-19, we need to make reasonable assumptions according to the actual mitigation policies and circumstances.

  • During the COVID-19 outbreak, population mobility is strictly controlled by many countries. In particular, the policy of city closure was implemented in Hubei Province, China. Therefore, the impact of migration will not be considered in the improved model.

  • For the prediction of this short-term virus transmission, the impact of natural mortality will not be considered. It is assumed that novel coronavirus is the only cause of death during the outbreak.

  • In order to predict the trend of cumulative confirmed cases, it is necessary to simulate the number of death cases. Therefore, a new class D(t) to the SEIQRP model will be added in the modified model, which denotes the number of death cases at time t.

  • The time-varying cure rate, mortality rate and infection rate will be applied to the improved model. This can better simulate the impact of the improvement of medical conditions and government control on individuals in reality.

Based on the above assumptions and analysis, the following improved model for COVID-19 is proposed.

0CDtαS(t)=-β1α(t)f1(S(t))g1(I(t))-β2αf2(S(t))g2(E(t))-ραS(t),0CDtαE(t)=β1α(t)f1(S(t))g1(I(t))+β2αf2(S(t))g2(E(t))-εαE(t),0CDtαI(t)=εαE(t)-δαI(t),0CDtαQ(t)=δαI(t)-(λα(t)+κα(t))Q(t),0CDtαR(t)=λα(t)Q(t),0CDtαP(t)=ραS(t),0CDtαD(t)=κα(t)Q(t), 3

where β1(t)=σ1exp(-σ2t), λ(t)=λ1(1-exp(λ2t)) and κ(t)=κ1exp(-κ2t). The parameters σ1, σ2, λ1, λ2, κ1 and κ2 are all positive, where σ is used to simulate the intensity of government control. It should be emphasized that the protection rate ρ for susceptible individuals also reflects the intensity of government control.

Qualitative analysis of the SEIQRP model

The existence and uniqueness of equilibrium point

Obviously, the right-hand side of system (2) satisfies the local Lipschitz condition; then, there exists a unique solution of system (2) [15, 24].

Theorem 1

The solutions of system (2) are nonnegative, and the closed set Ω={(S,E,I,Q,R,P)R+6:0S+E+I+Q+R+DΛμα} is a positive invariant set of system (2).

Proof

In order to investigate the non-negativity of solutions of system (2), we consider the following system

0CDtαS1(t)=-β1αf1(S(t))g1(I(t))-β2αf2(S(t))g2(E(t))-μαS(t)-ραS(t),0CDtαE1(t)=β1αf1(S(t))g1(I(t))+β2αf2(S(t))g2(E(t))-εαE(t)-μαE(t),0CDtαI1(t)=εαE(t)-(δα+μα)I(t),0CDtαQ1(t)=δαI(t)-(λα+κα+μα)Q(t),0CDtαR1(t)=λαQ(t)-μαR(t),0CDtαP1(t)=ραS(t)-μαP(t), 4

where the initial conditions are S10=0, E10=0, I10=0, Q10=0, R10=0 and P10=0. (0, 0, 0, 0, 0, 0) is the unique solution of the above system. According to the fractional-order comparison theorem [25], one can deduce that the solutions of system (2) satisfy S(t)0, E(t)0, I(t)0, Q(t)0, R(t)0 and P(t)0. By adding six equations of system (2), one can deduce

0CDtαN(t)=Λ-μαN(t)-καQ(t)Λ-μαN(t).

By applying the fractional-order comparison theorem, one has

N(t)(-Λμα+N(0))Eα(-μαtα)+Λμα.

When N(0)Λμα, since Eα(-μαtα)0, we have

S(t)+E(t)+I(t)+Q(t)+R(t)+P(t)Λμα.

Thus, we can draw the result of Theorem 1.

System (2) has an obvious disease-free equilibrium point M0=(S0,0,0,0,0,P0), where

S0=Λμα+ρα,P0=ραΛμα(μα+ρα).

In order to obtain the endemic equilibrium point of system (2), we set:

Λ-β1αf1(S(t))g1(I(t))-β2αf2(S(t))g2(E(t))-μαS(t)-ραS(t)=0,β1αf1(S(t))g1(I(t))+β2αf2(S(t))g2(E(t))-εαE(t)-μαE(t)=0,εαE(t)-(δα+μα)I(t)=0,δαI(t)-(λα+κα+μα)Q(t)=0,λαQ(t)-μαR(t)=0,ραS(t)-μαP(t)=0, 5

which implies

S=Λ-(εα+μα)Eμα+ρα,I=εαEσα+μα.

Combining the above equations and the second equation of (5), one has

(μα+εα)E=β1αf1Λ-(μα+εα)Eμα+ραg1εαEμα+δα+β2αf2Λ-(μα+εα)Eμα+ραg2(E). 6

Define

φ(E)=β1αf1Λ-(μα+εα)Eμα+ραg1εαEμα+δα+β2αf2Λ-(μα+εα)Eμα+ραg2(E)-(μα+εα)E. 7

Note that φ(0)=0 and φ(Λμα+εα)=-Λ<0. In order to show that φ(E)=0 has at least one positive root in the interval (0,Λμα+εα], we need to prove that φ(0)>0. Hence,

φ(E)=-(μα+εα)β1αμα+ραf1Λ-(μα+εα)Eμα+ραg1εαEμα+δα+εαβ1αμα+δαf1Λ-(μα+εα)Eμα+ραg1εαEμα+δα-(μα+εα)β2αμα+ραf2Λ-(μα+εα)Eμα+ραg2(E)+β2αf2Λ-(μα+εα)Eμα+ραg2(E)-(μα+εα). 8

Therefore, we have

φ(0)=εαβ1αμα+δαf1(S0)g1(0)+β2αf2(S0)g2(0)-(μα+εα)=(μα+εα)(R0-1),

where the basic reproduction number is given by

R0=εαβ1α(μα+εα)(μα+δα)f1(S0)g1(0)+β2αμα+εαf2(S0)g2(0). 9

If R0>1, system (2) has at least one endemic equilibrium point M=(S,E,I,Q,R,P), where

S=Λ-(μα+εα)Eμα+ρα,I=εαEμα+δα,Q=δαIμα+λα+κα,R=λαQμα,P=ραSμα.

In the following section, we show that endemic equilibrium point M is unique. According to the above analysis and hypothesis (i)–(iii), one has

φ(E)=-(μα+εα)β1αμα+ραf1(S)g1(I)+εαβ1αμα+δαf1(S)g1(I)-(μα+εα)β2αμα+ραf2(S)g2(E)+β2αf2(S)g2(E)-(μα+εα)=-(μα+εα)β1αμα+ραf1(S)g1(I)-(μα+εα)β2αμα+ραf2(S)g2(E)+εαβ1αμα+δαf1(S)g1(I)-εαβ1αf1(S)g1(I)(μα+δα)I+β2αf2(S)g2(E)-β2αf2(S)g2(E)E=-(μα+εα)β1αμα+ραf1(S)g1(I)-(μα+εα)β2αμα+ραf2(S)g2(E)+εαβ1αμα+δαf1(S)[g1(I)-g1(I)I]+β2αf2(S)[g2(E)-g2(E)E]. 10

By hypothesis (iv), this implies φ(E)<0. If there is another endemic equilibrium point M, then φ(E)0 holds, which contradicts the previous discussion. Hence, system (2) has a unique endemic equilibrium point M when R0>1. Based on the above analysis, the following results can be obtained.

Theorem 2

System (2) has a unique disease-free equilibrium point M0, if R0<1. System (2) has a unique endemic equilibrium point M, if R0>1.

Stability analysis

In this section, the local asymptotic stability of disease-free equilibrium point M0 and endemic equilibrium point M for system (2) is discussed.

Theorem 3

With regard to system (2), the disease-free equilibrium point M0 is locally asymptotic stability, if R0<1; the disease-free equilibrium point M0 is unstable, if R0>1.

Proof

The Jacobian matrix of system (2) at the disease-free equilibrium point M0 is

JM0=J11J12J21J22,

where

J11=-μα-ρα-h2-h10h2-μα-εαh10εα-μα-δα,
J12=000000000,J21=00δα000ρα00,
J22=-μα-κα-λα00λα-μα000-μα,

where h1=β1αf1(S0)g1(0) and h1=β2αf2(S0)g2(0). The corresponding characteristic equation is

H(s)=|sE-JM0|=(s+μα)2(s+μα+ρα)(s+μα+λα+κα)H1(s), 11

where

H1(s)=s2+(δα+2μα+εα-β2αf2(S0)g2(0))s+(μα+δα)(μα+εα)-β2α(μα+δα)f2(S0)g2(0)-εαβ1αf1(S0)g1(0). 12

The characteristic equation H(s)=0 has four obvious negative characteristic roots, which are denoted by s1=s2=-μα, s3=-μα-ρα and s4=-μα-λα-κα, respectively. The discriminant of H1(s) in quadratic form is

Δ=[δα+2μα+εα-β2αf2(S0)g2(0)]2-4[(μα+δα)(μα+εα)-β2α(μα+δα)f2(S0)g2(0)-εαβ1αf1(S0)g1(0)]=(δα-εα+β2αf2(S0)g2(0))2+4εαβ1αf1(S0)g1(0)>0.

This implies that the other two eigenvalues s5 and s6 of characteristic equation H(s)=0 are real roots. Hence,

s5+s6=-(δα+2μα+εα-β2αf2(S0)g2(0)),s5s6=(μα+δα)(μα+εα)(1-R0).

If R0<1, then one can obtain s5+s6<0 and s5s6>0, which imply s5<0 and s6<0. If R0>1, one has s5s6<0, which imply s5>0 or s6>0. It follows from Lemma 1 that the proof is completed.

Further, we will show the locally asymptotic stability of the endemic equilibrium point M of system (2). Similarly, the corresponding Jacobian matrix of system (2) at M is

JM=J1J2J3J4,

where

J1=-l3-l4-μα-ρα-l2-l1l3+l4l2-εα-μαl10εα-δα-μα,
J2=000000000,J3=00δα000ρα00,
J4=-μα-λα-κα00λα-μα000-μα,

with

l1=β1αf1(S)g1(I),l2=β2αf2(S)g2(E),l3=β1αf1(S)g1(I),l4=β2αf2(S)g2(E).

Hence, the corresponding characteristic equation is

L(s)=|sE-JM|=(s+μα)2(s+μα+λα+κα)L1(s), 13

where

L1(s)=s3+a1s2+a2s+a3,

with

a1=εα+3μα+δα+ρα-l2+l3+l4,a2=(μα+εα-l2)(2μα+δα+ρα)+(μα+δα)(μα+ρα)+(2μα+δα+εα)(l3+l4)-εαl1,a3=(μα+δα)(μα+ρα)(μα+εα-l2)+(μα+δα)(μα+εα)(l3+l4)-εα(μα+ρα)l1.

By the corresponding results in [26, 27], let

D1(L1(s))=1a1a2a3001a1a2a332la1a200032a1a200032a1a2=18a1a2a3+a12a22-4a13a3-4a23-27a32. 14

Then, the following result can be obtained.

Theorem 4

With regard to system (2), assume that R0>1,

  • (i)

    If D1(L1(s))>0, a1>0, a3>0 and a1a2>a3, then the endemic equilibrium point M is locally asymptotically stable.

  • (ii)

    If D1(L1(s))<0, a1>0, a2>0 and a3>0, then the endemic equilibrium point M is locally asymptotically stable for α(0,23).

  • (iii)

    If D1(L1(s))<0, a1<0 and a2<0, then the endemic equilibrium point M is unstable for α(23,1).

  • (iv)

    If D1(L1(s))<0, a1>0, a2>0 and a1a2=a3, then for α(0,1), the endemic equilibrium point M is locally asymptotically stable.

Proof

Based on the previous discussion, the characteristic equation L(s)=0 has three obvious negative roots s1=s2=-μα and s3=-μα-λα-κα. In order to investigate the stability of equilibrium point M, we only need to discuss the range of the root of L1(s)=0, denoted by s4, s5 and s6.

(i) By the results in [27], if D1(L1(s))>0, then s4, s5 and s6 are real roots. Further, by Routh–Hurwitz criterion, the necessary and sufficient conditions for si(i=4,5,6) to lie in the left half plane are

a1>0,a3>0,a1a2>a3.

That is to say, under the above conditions, the roots of L1(s)=0 satisfy

|arg(si)|>π2>απ2(i=4,5,6).

Therefore, M is locally asymptotically stable and (i) holds

(ii) By the results in [27], if D1(L1(s))<0, then L1(s)=0 has a real root and a pair of conjugate complex roots, denoted by s4, m+ni and m-ni, respectively. Thus, one has

L1(s)=s3+a1s2+a2s+a3=(s-s4)(s-m-ni)(s-m+ni).

By calculation,

a1=-s4-2m,a2=2s4m+m2+n2,a3=-s4(m2+n2).

The conditions a1>0, a2>0 and a3>0 imply that

-s1>2m,m2+n2>-2s4m,s4<0.

Further, one has

m2+m2tan2θ>-2s4m>4m2.

That is to say tan2θ>3, which implies

θ=|arg(s)|>π3.

Therefore, in order to ensure the establishment of |arg(s)|>απ2, we must have α<23. Thus, (ii) holds.

The proof of conclusions (iii) and (iv) is similar to that of conclusion (ii); hence, we omit it.

Numerical simulations

Data sources

The data used in this paper are from the Johns Hopkins University Center for Systems Science and Engineering (https://github.com/CSSEGISandData/COVID-19). the Johns Hopkins University publishes data of accumulated and newly confirmed cases, recovered cases and death cases of COVID-19 from January 22, 2020.

Analysis of the SEIQRP model

In the following discussion, the standard incidence rate [28] is used to describe the transmission of COVID-19 and is given by

β1αf1(S)g1(I)=β1αSIN,β2αf2(S)g2(E)=β2αSEN,

where N represents the total population of the region at the initial time. Hence,

0CDtαS(t)=Λ-β1αS(t)I(t)N-β2αS(t)E(t)N-μαS(t)-ραS(t),0CDtαE(t)=β1αS(t)I(t)N+β2αS(t)E(t)N-εαE(t)-μαE(t),0CDtαI(t)=εαE(t)-(δα+μα)I(t),0CDtαQ(t)=δαI(t)-(λα+κα+μα)Q(t),0CDtαR(t)=λαQ(t)-μαR(t),0CDtαP(t)=ραS(t)-μαP(t). 15

The effectiveness of model (15) in describing the spread of COVID-19 is illustrated by selecting the confirmed, cured and death cases in the USA. According to the real data provided by the Johns Hopkins University, the outbreak in the USA has not been brought under full control. The data of confirmed cases, recovered cases and death cases are selected from January 22, 2020, to April 20, 2020. Assuming that the confirmed individuals will be isolated, then

Isolated=Confirmed-Recovered-Death. 16

This hypothesis is in line with the actual situation. Hence, we can obtain the real data of isolated cases. Let eQ=Qr-Qm, eR=Rr-Rm and eD=Dr-Dm, where Qr, Rr and Dr denote the real data of isolated cases, recovered cases and death cases, respectively, Qm, Rm and Dm represent the measured value of isolated cases, recovered cases and death cases, respectively. Here, we consider a minimization objective function as:

Jmin=i=1T(eQ2(i)+eR2(i)+eD2(i)),

where T is the number of real data of isolated cases available. The internal function lsqcurvefit of MATLAB is used to minimize the above objective function value. Similar methods are adopted for subsequent parameter identification. Through the fractional predictor–corrector method [29, 30] and the least squares fitting [31], we can identify the parameters of the model (15) through the real data, which is given in Table 1. For more detail with respect to fractional predictor–corrector algorithm, one can refer to [29, 30].

Table 1.

Summary table of the parameter identification for model (15) after using least squares fitting to real data from January 22, 2020, to April 20, 2020

Notation Parameter identification
α 0.7937
Λ 433994
β1 0.2209
β2 0.7392
μ 5.2129×10-7
ρ 0.1224
ε 0.2467
δ 0.1031
λ 0.0024
κ 1.2932×10-5

Based on the parameters in Table 1, we can make a simple prediction of isolated cases and recovered cases in the USA, which is shown in Fig. 2. We need to emphasize that the peak here represents the number of isolated cases rather than the cumulative number of confirmed cases.

Fig. 2.

Fig. 2

Number of isolated cases predicted and recovered cases predicted by the model (15) for the USA

Using the parameters in Table 1, it can be calculated by using (9) that R0=0.0289<1. By the conclusion in Theorem 3, the disease-free equilibrium point is local asymptotic stable. The SEIQRP model has a basic guiding significance for predicting and fitting spreading dynamics of COVID-19. However, the prediction of this model for COVID-19 is relatively rough; we still need to improve model (15) according to actual mitigation policies and research objectives. According to the analysis in Sect. 2.1, we choose the SEIQRPD model to predict the trend of the epidemic in the USA under reasonable assumptions.

The SEIQRPD model for the prediction of COVID-19

Similarly, the standard incidence rate is used to describe the transmission of COVID-19, and the fractional SEIQRPD model can be expressed as

0CDtαS(t)=-β1α(t)S(t)I(t)N-β2αS(t)E(t)N-ραS(t),0CDtαE(t)=β1α(t)S(t)I(t)N+β2αS(t)E(t)N-εαE(t),0CDtαI(t)=εαE(t)-δαI(t),0CDtαQ(t)=δαI(t)-(λα(t)+κα(t))Q(t),0CDtαR(t)=λα(t)Q(t),0CDtαP(t)=ραS(t),0CDtαD(t)=κα(t)Q(t). 17

It should be emphasized that the numerical properties of model (17) are mainly studied. Different from the analysis of model (2) and model (15), the fractional-order α of model (3) and model (17) is not required to be within (0, 1). Therefore, in the parameter identification of model (17), the search range of parameter α is considered as (0, 2). When α=1, the fractional-order SEIQRPD model is similar to the integer-order model used in [24]. According to the data provided by Johns Hopkins University, by April 20, 2020, the outbreak in China has been basically controlled. In many provinces of China, the number of new cases per day is increasing in single digits. This means that the data in China contain more information about the spreading dynamics of COVID-19. Therefore, the data in Hubei, Guangdong, Hunan and Zhejiang are selected to research the fitting effect of the model (17). According to the real data of these four regions in China, the parameters of the model (17) are identified, and the results are shown in Table 2. The model (17) successfully capture the trend of the outbreak, which is shown in Fig. 3.

Table 2.

Summary of parameter identification of model (17) (data used from January 22, 2020, to April 20, 2020)

Notation Hubei Guangdong Hunan Zhejiang
α 1.0771 1.0315 1.0133 1.0688
σ1 0.6295 1.2773 0.7255 1.9997
σ2 5.59×10-4 0.4525 0.1139 1.2108
β2 0.2863 0.2675 0.3352 0.3226
ρ 0.1494 0.0581 0.0464 0.0052
ε 0.2781 0.3558 0.4649 0.4986
δ 0.2598 0.3838 0.6756 0.4097
λ1 0.5118 0.7336 0.987 0.9997
λ2 0.0033 0.0036 0.0038 0.0032
κ1 0.0031 0.0013 2.94×10-4 1.51×10-4
κ2 4.18×10-4 0.0911 4.09×10-14 4.12×10-14

Fig. 3.

Fig. 3

Fitting effect of the improved model (17) on the outbreak in Hubei, Guangdong, Hunan and Zhejiang

According to Definition 1, when α=1, the fractional-order system (17) becomes integer-order system. Tables 5 and 6, respectively, show the prediction results of cumulative confirmed cases and isolated cases by the integer-order system corresponding to model (17). By comparing Table 3 with Tables 5 and 4 with Table 6, we can find that the fractional system (17) has better prediction effect than its corresponding integer system.

Table 5.

Summary of real and predicted data for the cumulative confirmed cases in the USA from April 6, 2020, to April 20, 2020, when α=1

Data Cumulative confirmed cases Relative error (%)
Reported Predicted
6, April 366,667 372,660 1.63
7, April 397,505 408,046 2.65
8, April 429,052 444,475 3.59
9, April 462,780 481,813 4.11
10, April 496,535 519,923 4.71
11, April 526,396 558,665 6.13
12, April 555,313 597,901 7.67
13, April 580,619 637,494 9.8
14, April 607,670 677,308 11.46
15, April 636,350 717,213 12.71
16, April 667,592 757,082 13.4
17, April 699,706 796793 13.88
18, April 732,197 836,231 14.21
19, April 759,086 875,287 15.31
20, April 784,326 913,861 16.52
Average 9.19

Table 6.

Summary of real and predicted data for the isolated cases in the USA from April 6, 2020, to April 20, 2020, when α=1

Data Isolated cases Relative error (%)
Reported Predicted
6, April 336,303 341,929 1.67
7, April 362,948 372,162 2.54
8, April 390,798 402,864 3.09
9, April 420,826 433,871 3.1
10, April 449,159 465,022 3.53
11, April 474,664 496,156 4.53
12, April 500,306 527,114 5.36
13, April 513,609 557,741 8.59
14, April 534,076 587,888 10.08
15, April 555,929 617,412 11.06
16, April 579,973 646,178 11.42
17, April 604,388 674,059 11.53
18, April 628,693 700,937 11.49
19, April 648,088 726,706 12.13
20, April 669,903 751,268 12.15
Average 7.48

Table 3.

Summary of real and predicted data for the cumulative confirmed cases in the USA from April 6, 2020, to April 20, 2020

Data Cumulative confirmed cases Relative error (%)
Reported Predicted
6, April 366,667 370,997 1.18
7, April 397,505 404,778 1.83
8, April 429,052 438,986 2.32
9, April 462,780 473,397 2.29
10, April 496,535 507,790 2.27
11, April 526,396 541,952 2.96
12, April 555,313 575,680 3.67
13, April 580,619 608,786 4.85
14, April 607,670 641,099 5.5
15, April 636,350 672,464 5.68
16, April 667,592 702,748 5.27
17, April 699,706 731,837 4.59
18, April 732,197 759,637 3.75
19, April 759,086 786,076 3.56
20, April 784,326 811,103 3.41
Average 3.54

Table 4.

Summary of real and predicted data for the isolated cases in the USA from April 6, 2020, to April 20, 2020

Data Isolated cases Relative error (%)
Reported Predicted
6, April 336,303 340,153 1.14
7, April 362,948 368,742 1.6
8, April 390,798 397,193 1.64
9, April 420,826 425,264 1.05
10, April 449,159 452,719 0.79
11, April 474,664 479,335 0.98
12, April 500,306 504,905 0.92
13, April 513,609 529,239 3.04
14, April 534,076 552,168 3.39
15, April 555,929 573,547 3.17
16, April 579,973 593,252 2.29
17, April 604,388 611,186 1.12
18, April 628,693 627,274 0.23
19, April 648,088 641,467 1.02
20, April 669,903 653,736 2.41
Average 1.65

At present, we can obtain the epidemic data of the USA from January 22, 2020, to April 20, 2020. We need to preprocess the data to remove the data smaller than 0.5% of the current highest number of confirmed cases. In order to test the prediction ability of the SEIQRPD model (17) for the development process of the epidemic in the USA, we select the data before April 5 to identify the parameters of the SEIQRPD model (17). Furthermore, in order to illustrate the ability of the SEIQRPD model (17) in predicting the outbreak, we compared the real data and fitted data after April 5, which is given in Tables 3, 4 and Fig. 4. According to the results in Tables 3 and 4, it can be found that the real values of current isolated cases and cumulative confirmed cases fall within the range of 94–106% of the predicted values. This shows that the SEIQRPD model (17) can effectively predict the data in the next two weeks.

Fig. 4.

Fig. 4

Based on the data of the USA from January 22 to April 5, 2020, to verify the accuracy of the forecast for the next 15 days

The data before April 20, 2020, are selected to identify the parameters of the improved model (17), and the results are shown in Table 7. The cumulative number of confirmed cases and the number of isolated cases after two weeks are predicted, which is given in Table 8.

Table 7.

Summary table of the parameter identification for model (17) (data used from January 22, 2020, to April 20, 2020)

Notation Parameter identification
α 0.9806
σ1 0.9339
σ2 0.1958
β2 0.2654
ρ 4.8484×10-6
ε 0.2881
δ 0.4115
λ1 0.0076
λ2 0.2096
κ1 0.0079
κ2 0.0345

Table 8.

Summary of predicted data for the USA from April 21, 2020, to May 5, 2020

Data Cumulative confirmed cases Isolated cases
Predicted Predicted
21, April 804,417 684,573
22, April 829,428 702,336
23, April 853,882 719,428
24, April 877,785 735,862
25, April 901,144 751,654
26, April 923,967 766,819
27, April 946,265 781,374
28, April 968,047 795,334
29, April 989,322 808,715
30, April 1,010,102 821,533
1, May 1,030,397 833,805
2, May 1,050,217 845,546
3, May 1,069,574 856,771
4, May 1,088,477 867,495
5, May 1,106,937 877,733

The isolated cases in the USA will peak on June 18, 2020, with the peak of 1.0431×106, which is shown in Fig. 5. We must point out that the peak forecast here is only based on the data before April 20, 2020. It is only a rough estimate from the point of view of numerical simulation. The peak value will be greatly influenced by the policy and environment. The change of policy and environment makes the prediction of peak value very difficult. For the peak prediction, we will do further research in the future work.

Fig. 5.

Fig. 5

Based on the data of the USA from January 22 to April 20, 2020, to verify the accuracy of the forecast for the next 15 days

Conclusion

We first propose the fractional SEIQRP model with generalized incidence rates. Some qualitative properties of the SEIQRP model are discussed. In order to predict COVID-19 effectively, we propose an improved SEIQRPD model according to the actual mitigation situation. According to the data of the USA before April 5, 2020, the trend of the outbreak in the USA from April 6 to April 20 is successfully predicted as compared to the real records. Then, using the data before April 20, 2020, we forecast the trend of the outbreak in the USA in the next two weeks and estimate the peak of isolated cases and the date of the peak.

The improved SEIQRP model proposed in this paper successfully captures the trend of COVID-19. The long-term prediction needs to adjust the model appropriately according to the change of policy and medical level. We will discuss in the future work.

Acknowledgements

The plots in this paper were plotted using the plot code adapted from [31].

Compliance with ethical standards

Conflict of interest

We declare that we have no conflict of interest.

Footnotes

This work is supported by the National Nature Science Foundation of China under Grant 61772063, Beijing Natural Science Foundation under Grant Z180005.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Lin QY, Zhao S, Gao DZ, Luo YJ, Yang S, Musa SS, Wang MH, Cai YL, Wang WM, Yang L, He DH. A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action. Int. J. Infect. Dis. 2020;93:211–216. doi: 10.1016/j.ijid.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ricardo A. Analysis of a fractional SEIR model with treatment. Appl. Math. Lett. 2018;84:56–62. doi: 10.1016/j.aml.2018.04.015. [DOI] [Google Scholar]
  • 3.Yang Y, Xu LG. Stability of a fractional order SEIR model with general incidence. Appl. Math. Lett. 2020;105:106303. doi: 10.1016/j.aml.2020.106303. [DOI] [Google Scholar]
  • 4.Peng, L., Yang, W., Zhang, D., Zhuge, C., Hong, L.: Epidemic analysis of COVID-19 in China by dynamical modeling. Cold Spring Harbor Laboratory (2020). arXiv:2002.06563
  • 5.Amjad, S.S., Iqbal, N.S., Kottakkaran, S.N.: A mathematical model of COVID-19 using fractional derivative: outbreak in India with dynamics of transmission and control (2020). 10.20944/preprints202004.0140.v1. [DOI] [PMC free article] [PubMed]
  • 6.Chen, Y., Cheng, J., Jiang, X., Xu, X.: The reconstruction and prediction algorithm of the fractional TDD for the local outbreak of COVID-19 (2020). arXiv:2002.10302
  • 7.Cheng ZJ, Shan J. 2019”Cnovel coronavirus: where we are and what we know. Infection. 2020 doi: 10.1007/s15010-020-01401-y. [DOI] [Google Scholar]
  • 8.Lu, Z.Z., Yu, Y.G., Chen, Y.Q., Ren, G.J., Xu, C.H., Yin, Z.: A fractional-order SEIHDR model for COVID-19 with inter-city networked coupling effects. Nonlinear Dynamics (Special Issue on ‘Nonlinear dynamics of COVID-19 pandemic: modeling, control, and future perspectives‘) (2020). arXiv:2004.12308 [DOI] [PMC free article] [PubMed]
  • 9.Yang ZF, Zeng ZQ, Wang K, Wong S, Liang WH, Zanin M, Liu P, Cao XD, Gao ZQ, Mai ZT, Liang JY, Liu XQ, Li SY, Li YM, Ye F, Guan WJ, Yang YF, Li F, Luo SM, Xie YQ, Liu B, Wang ZL, Zhang SB, Wang YN, Zhong NS, He JX. Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions. J. Thorac. Dis. 2020;12(2):165–174. doi: 10.21037/jtd.2020.02.64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Zhao S, Lin Q, Ran J, Musa SS, Yang GP, Wang WM, Lou YJ, Gao DZ, Yang L, He DH, Wang MH. Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: a data-driven analysis in the early phase of the outbreak. Int. J. Infect. Dis. 2020;92:214–217. doi: 10.1016/j.ijid.2020.01.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Peng, L.R., Yang, W.Y., Zhang, D.Y., Zhuge, C.J., Hong, L.: Epidemic analysis of COVID-19 in China by dynamical modeling (2020). arXiv:2002.06563
  • 12.Sun HG, Zhang Y, Baleanu D, Chen W, Chen YQ. A new collection of real world applications of fractional calculus in science and engineering. Commun. Nonlinear Sci. Numer. Simul. 2018;59(5):213–231. doi: 10.1016/j.cnsns.2018.04.019. [DOI] [Google Scholar]
  • 13.Cao, K.C., Chen, Y.Q.: Fractional order crowd dynamics: cyber-human systems modeling and control. (Invited book project. Volume 4 of the De Gryuter Monograph Series ‘Fractional Calculus in Applied Sciences and Engineering‘). ISBN 978-3-11-047398-8
  • 14.West BJ. Fractional Calculus View of Complexity: Tomorrow’s Science. Boca Raton: CRC Press; 2015. [Google Scholar]
  • 15.Podlubny I. Fractional Differential Equations. New York: Academic Press; 1999. [Google Scholar]
  • 16.Li Y, Chen YQ, Podlubny I. Mittag–Leffler stability of fractional order nonlinear dynamic systems. Automatica. 2009;45(8):1965–1969. doi: 10.1016/j.automatica.2009.04.003. [DOI] [Google Scholar]
  • 17.Diethelm K. The Analysis of Fractional Differential Equations: An Application-Oriented Exposition Using Differential Operators of Caputo Type. New York: Springer; 2010. [Google Scholar]
  • 18.Kuniya T. Hopf bifurcation in an age-structured SIR epidemic model. Appl. Math. Lett. 2019;92:22–28. doi: 10.1016/j.aml.2018.12.010. [DOI] [PubMed] [Google Scholar]
  • 19.Zhang XB, Huo H, Xiang HF, Xiang H, Meng XY. An SIRS epidemic model with pulse vaccination and non-monotonic incidence rate. Nonlinear Anal. Hybrid Syst. 2013;8:13–21. doi: 10.1016/j.nahs.2012.08.001. [DOI] [Google Scholar]
  • 20.Cai Y, Kang Y, Wang W. A stochastic SIRS epidemic model with nonlinear incidence rate. Appl. Math. Comput. 2017;305:221–240. [Google Scholar]
  • 21.Diethelm K. A fractional calculus based model for the simulation of an outbreak of dengue fever. Nonlinear Dyn. 2013;71:613–619. doi: 10.1007/s11071-012-0475-2. [DOI] [Google Scholar]
  • 22.Almeida R. Analysis of a fractional SEIR model with treatment. Appl. Math. Lett. 2018;84:56–62. doi: 10.1016/j.aml.2018.04.015. [DOI] [Google Scholar]
  • 23.Yang Y, Xu L. Stability of a fractional order SEIR model with general incidence. Appl. Math. Lett. 2020;105:106303. doi: 10.1016/j.aml.2020.106303. [DOI] [Google Scholar]
  • 24.Jalilian Y, Jalilian R. Existence of solution for delay fractional differential equations. Mediterr. J. Math. 2013;10(4):1731–1747. doi: 10.1007/s00009-013-0281-1. [DOI] [Google Scholar]
  • 25.Wang H, Yu YG, Wen GG, Zhang S, Yu JZ. Global stability analysis of fractional-order Hopfield neural networks with time delay. Neurocomputing. 2015;154:15–23. doi: 10.1016/j.neucom.2014.12.031. [DOI] [Google Scholar]
  • 26.Ahmed E, Elgazzar AS. On fractional order differential equations model for nonlocal epidemics. Physica A. 2007;379(2):607–614. doi: 10.1016/j.physa.2007.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Rocca A, West BJ. Fractional calculus and the evolution of fractal phenomena. Physica A. 1999;265(3–4):535–546. doi: 10.1016/S0378-4371(98)00550-0. [DOI] [Google Scholar]
  • 28.Hu Z, Liu S, Wang H. Backward bifurcation of an epidemic model with standard incidence rate and treatment rate. Nonlinear Anal. Real World Appl. 2008;9(5):2302–2312. doi: 10.1016/j.nonrwa.2007.08.009. [DOI] [Google Scholar]
  • 29.Bhalekar S, Gejji V. A predictor–corrector scheme for solving nonlinear delay differential equations of fractional order. J. Fract. Calc. Appl. 2011;1(5):1–9. [Google Scholar]
  • 30.Diethelm K, Ford NJ, Freed AD. A predictor–corrector approach for the numerical solution of fractional differential equations. Nonlinear Dyn. 2002;29:3–22. doi: 10.1023/A:1016592219341. [DOI] [Google Scholar]
  • 31.Cheynet, E.: Generalized SEIR epidemic model (fitting and computation). https://www.github.com/ECheynet/SEIR. GitHub. Accessed 26 Mar 2020

Articles from Nonlinear Dynamics are provided here courtesy of Nature Publishing Group

RESOURCES