Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Aug 24;140:110243. doi: 10.1016/j.chaos.2020.110243

Analytical solution of SEIR model describing the free spread of the COVID-19 pandemic

Nicola Piovella 1
PMCID: PMC7445013  PMID: 32863617

Abstract

We analytically study the SEIR (Susceptible Exposed Infectious Removed) epidemic model. The aim is to provide simple analytical expressions for the peak and asymptotic values and their characteristic times of the populations affected by the COVID-19 pandemic.

Keywords: COVID-19, SEIR, Nonlinear dynamics

1. Introduction

The COVID-19 outbreak has motivated a large number of numerical studies using epidemiology models [1], [2]. A commonly used model is the Susceptible-Exposed-Infected-Removed (SEIR) model [3]. This model is formulated as a system of nonlinear ordinary differential equations, for which no exact analytic solution has yet been found. For this reason, most of the recent works focus on the numerical analysis of statistical ensembles of initial data for these equations. However, due to the uncertainty and often unreliability of the clinical data, the prediction about the real evolution of the epidemic is rather difficult, if not impossible [4], [5]. On the other hand, the SEIR epidemic model provides a deterministic evolution for some given initial state. Therefore, the aim of this work is to provide simple expressions of the main characteristics of the population of individuals that have been in contact with the disease, as of instance the peak of the infected population and the time after which it occurs, the final number of individuals who have contracted the disease and the temporal shape of the infectious population’s curves. These analytical expressions can become useful through their application to the COVID-19, to obtain fundamental parameters as the reproduction number r and the epidemic starting time.

The paper is organized as follow, In Section 2 we recall the SEIR model; in Section 3 we study the linear regime with the exponential growing and decaying evolution, depending on the reproduction number r; in Section 4 we investigate the nonlinear regime in the free spread evolution with r > 1. We approximate the exact model of equations by a reduced model where the decaying mode is adiabatically eliminated. This reduced model allows to obtain analytical results which have been seen to be in good agreement with the exact numerical solution. Section 5 summarizes the results and draws the conclusions.

2. The SEIR model

We used the susceptible-exposed-infected-removed (SEIR) compartment model [3], [6], [7], [8] to characterize the early spreading of COVID-19, where each individual could be in one of the following states: susceptible (S), exposed (E, being infected but without infectiousness), infected (I, with infectiousness), recovered (R) and dead (D). At later times a susceptible individual in the state S would turn to be an individual in the exposed state E with a rate r/τI, where r is the reproduction number (i.e. the average number of infected people generated by each infected person during the desease) and τI=1/γ2 is the average time in the infected state I. An exposed individual in the state E becomes infected, i.e. in the state I in an average time τE=1/γ1. Then the infected individual is removed from the total population with the rate γ 2 either by recovering (R) or dying (D) with a mean case fatality proportion p. The dynamical process of SEIR is described by the following set of equations:

S˙=rγ2(SN)I, (1)
E˙=rγ2(SN)Iγ1E, (2)
I˙=γ1Eγ2I, (3)
R˙=(1p)γ2I, (4)
D˙=pγ2I. (5)

Here S(t), E(t), I(t), R(t) and D(t) respectively represent the number of individuals in the susceptible, exposed, infectious, recovered and death states at time t and N is the total number of individuals in the system such that N(t)=S(t)+E(t)+I(t)+R(t). Finally, the cumulative population C is

C=E+I+R+D, (6)

equal to the total population of individuals who have contracted the infection.

3. Linear regime

If E(t), I(t), R(t) ≪ N(t), then the susceptible population S can be approximated by the total population N (i.e. S ~ N) and the equations for the exposed and infected population are linear:

E˙=rγ2Iγ1E (7)
I˙=γ1Eγ2I (8)

3.1. General solution of the linear equations

Introducing the Laplace transforms

E˜(λ)=0E(t)eλtdtI˜(λ)=0I(t)eλtdt

with Reλ > 0, Eqs. (7) and (8) becomes

(λ+γ1rγ2γ1λ+γ2)(E˜I˜)=(E(0)I(0)) (9)

where E(0) and I(0) are the initial conditions. The eigenvalues λ are solution of

|λ+γ1rγ2γ1λ+γ2|=0 (10)

giving

det(λ)=λ2+(γ1+γ2)λ+γ1γ2(1r)=0 (11)

with solutions

λ±=γ1+γ22±12Δ (12)

where

Δ=(γ1+γ2)2+4γ1γ2(r1)=(γ1γ2)2+4rγ1γ2 (13)

Since Δ > 0 the eigenvalues are real. Depending on r, we distinguish three cases:

  • (a)

    If r > 1 then Δ>γ1+γ2, so that λ+>0 and λ<0. The solution grows exponentially (explosive regime);

  • (b)

    If r < 1 then Δ<γ1+γ2, so that both λ+<0 and λ<0. The solution decays exponentially (relaxation regime);

  • (c)

    If r=1 then Δ=γ1+γ2, so that λ+=0 and λ=(γ1+γ2). The solution remains partially constant (marginally stable regime).

For the cases (a) and (b) the solution is

E(t)=1Δ{ΔE(0)cosh(Δt/2)+[(γ2γ1)E(0)+2rγ2I(0)]sinh(Δt/2)}e(γ1+γ2)t/2 (14)
I(t)=1Δ{ΔI(0)cosh(Δt/2)+[(γ1γ2)I(0)+2γ1E(0)]sinh(Δt/2)}e(γ1+γ2)t/2 (15)

whereas in the case (c) (r=1) the solution is

E(t)=12[E(0)+I(0)]+12[E(0)I(0)]e(γ1+γ2)t/2 (16)
I(t)=12[E(0)+I(0)]12[E(0)I(0)]e(γ1+γ2)t/2. (17)

3.2. Analysis

The only parameter which can be controlled by confinement measures is the reproduction number r. In the following we assume that for COVID-19 the characteristic times are τE=3.69 days and τI=3.48 days [9]. We consider the time evolution of the population E and I for r > 1, r=1 and r < 1, corresponding to the explosive, marginally stable and relaxation regimes, respectively.

3.2.1. Explosive regime

For r > 1 and λ+t1,

E(t)=12{E(0)+1Δ[(γ2γ1)E(0)+2rγ2I(0)]}eλ+t (18)
I(t)=12{I(0)+1Δ[(γ1γ2)I(0)+2γ1E(0)]}eλ+t (19)

with λ+>0.

3.2.2. Marginally stable regime

When r=1, in the asymptotic limit t ≫ τE, τI, E and I are constant,

E=I=12[E(0)+I(0)] (20)

and the death population grows linearly in time

D(t)=D(0)+pγ2γ1+γ2[I(0)E(0)]+12pγ2[E(0)+I(0)]t (21)

where D(0), E(0) and I(0) are the values taken at time when r starts to be r=1.

3.2.3. Relaxation regime

When r < 1, λ+ is negative and E and I tend to zero, whereas D tends to the following constant value,

D()=D(0)+p2γ2(1r){[γ1+γ2(1+2r)]I(0)+(γ2γ1)E(0)} (22)

where D(0), E(0) and I(0) are the values taken at time when r starts to be r < 1. Fig. 1 shows a typical temporal evolution of I(t) and D(t) starting with r > 1, then subsequently changed to r=1 and later on to a value r < 1. The regime is linear (i.e. with E, I ≪ N), the initial values are E(0)=10 and I(0)=0 and p=0.01. The red dashed line is for r=3 (explosive regime). The green dashed-dotted line is for r changed from r=3 to r=1 at t=20 (marginally stable regime) and the blue solid line is for r=3 until t=20, then r=1 between t=20 and t=30 and finally r=0.8 for t > 30 (relaxation regime). Notice the asymmetry of the curve of I(t) due to the different growing and decaying rates.

Fig. 1.

Fig. 1

Evolution of I(t) and D(t) with r=3 (red dashed line), r=1 after t=20 days (green dashed-dotted line) and r=0.8 after t=30 days (blue solid line). Initial conditions: E(0)=10. I(0)=0,N(0)=6.e7; p=0.01.

4. Nonlinear regime

In the following, we investigate the nonlinear regime with a constant reproduction number r > 1. This corresponds to a free spread of the infection, with an initial exponential growth of the exposed population E, and so also of I and D. The exponential growth stops when susceptible population S becomes sensibly less then the total number N of the individuals. This regime is similar to the saturation in a single-mode laser, where steady-state is reached when the gain of emitted photons equals the losses by the cavity [10]. Notice that

E+I+S+R+D=N0 (23)

is a constant of motion and N(t)=N0D(t). However, if p ≪ 1 we always have D ≪ N 0, so that with a good approximation we can approximate N by N 0. Introducing the removed population Q=R+D, we can eliminate S=N0(E+I+Q) using the constant of motion and obtain

E˙=rγ2(1E+I+QN0)Iγ1E (24)
I˙=γ1Eγ2I (25)
Q˙=γ2I (26)

We normalize the variables by N 0 defining x=E/N0, y=I/N0 and z=C/N0 where C=E+I+Q is the cumulative population, i.e. the total number of individuals who have contracted the infection. Then the equations become

x˙=rγ2(1z)yγ1x (27)
y˙=γ1xγ2y (28)
z˙=rγ2(1z)y (29)

These equations have a single steady-state solution (i.e. x˙=y˙=z˙=0) with x=y=0 (end of the epidemics) and z=z0 with 0 < z 0 < 1. This solution is stable if r0=r(1z0)<1. We see that the stability condition implies

z0>11r (30)

In Fig. 2 we plot E/N, I/N and C/N for r=1.5, p=0.01, τE=3.69 days, τI=3.48 days and initial conditions E(0)=10, I(0)=0, N(0)=6·107. We observe that C/N tends to a steady-state value of about 0.6, whereas the peak of I/N is about 0.03: it means that for these parameters the 60% of the total population has contracted the infection and the peak the infected population is about 3% of the total population. Note that these results are independent on p and depend only on τE, τI and r.

Fig. 2.

Fig. 2

Simulation with r=1.5 and p=0.01. Initial conditions: E(0)=10,I(0)=0,N=6·107. I/N (a) and C/N (b) vs. time from the numerical solution (solid black line) and from the analytic expressions, Eqs. (63) and (62) (dashed blue line). The time t is in units of days and τE=3.69 days, τI=3.48 days.

4.1. Reduced model

In this section we find an approximated analytic solution of Eqs. (27)(29) in the free spread evolution with r > 1. The idea is to adiabatically eliminate the decaying mode with negative eigenvalue λ. To this aim, it is convenient to write Eqs. (27)(29) in the basis of the eigenvalues λ ±. Writing again the linear Eqs. (7) and (8) in the form

ddt(xy)=(γ1rγ2γ1γ2)(xy) (31)

the normalized eigenvectors associated to the eigenvalues λ ± of Eq. (12) are

u±=1D±(γ2+λ±γ1) (32)

where

D±=γ12+(γ2+λ±)2 (33)

Hence, in the new basis

(xy)=(γ2+λ+D+γ2+λDγ1D+γ1D)(x¯y¯) (34)

and the inverse is

(x¯y¯)=(γ1Dγ2+λDγ1D+γ2+λ+D+)(xy) (35)

In the new basis Eqs. (27)(29) take the form:

x¯˙=λ+x¯rγ1γ2Δ(x¯+D+Dy¯)z (36)
y¯˙=λy¯+rγ1γ2Δ(DD+x¯+y¯)z (37)
z˙=rγ1γ2(x¯D++y¯D)(1z) (38)

Notice that as expected in the linear regime the dynamics of x¯ and y¯ are uncoupled. Now we consider the free spread regime with r > 1 such that λ+ is positive and λ is negative. If r1 is small, then |λ|λ+ and we can adiabatically eliminate the ’slave’ variable y¯. Neglecting y¯˙ in (37) we obtain

D+Dy¯rγ1γ2Δx¯zλ+(rγ1γ2/Δ)z (39)

which when inserted in Eqs. (36) and (38) yields

x¯˙=1λ+(rγ1γ2/Δ)z[λ+λ+rγ1γ2Δ(λ+λ)z]x¯ (40)
z˙=rγ1γ2D+(λλ+(rγ1γ2/Δ)z)x¯(1z) (41)

Since λ+λ=Δ and γ1γ2r=λ+λ[r/(1r)],

x¯˙=λ+1βz[1zk]x¯ (42)
z˙=λ+λkD+(1z)1βzx¯ (43)
D+Dy¯=βz1βzx¯ (44)

where k=(r1)/r and β=λ+/kΔ. Finally, the original variables are

y=γ1D+[x¯1βz] (45)
x=γ2γ1[1+λ+γ1(1zk)]y (46)

4.2. Analytical solution

Eqs. (42) and (43) may provide some analytical result. Rescaling the time as

τ=λ+kt (47)

and defining

s=λD+x¯ (48)

Eqs. (42) and (43) take the form:

dsdτ=(kz1βz)s (49)
dzdτ=(1z1βz)s (50)

In the limit β → 0 they have the form of Lotka-Volterra equations [11]. From them, dividing member by member, it results

dsdz=kz1z (51)

which when integrated yields

s=z+1rln|1z| (52)

where we assumed s → 0 when z → 0. On the other hand, s → 0 when z → z (see Fig. 3 ), where z is the solution of the transcendental equation

rz+ln|1z|=0 (53)

The same transcendental Eq. (53) for z has been obtained for the SIR compartmental model [12], [13]. Here we have demonstrated its validity also for the SEIR model.

Fig. 3.

Fig. 3

Plot of s vs. z for r=1.5,2,2.5,3.0, from Eq. (52).

We see from Fig. 3 that s=0 for z=0 and z=z. The maximum value of s occurs when z=k=11/r so that

smax=11r1rlnr (54)

These simple equations provide two analytic expressions for the asymptotic value of C/N and for the peak of I/N.

Let’s now find an approximated solution of z as a function of the scaled time τ. Using Eq. (52) in Eq. (50) we obtain a differential equation for z:

dzdτ=1z1βz(z+1rln|1z|) (55)

From the numerical analysis and assuming βz ≪ 1, we find that z(τ) is well approximated by the following function:

z(τ)=z2{1+tanh[k(ττd)/2]}=zek(ττd)1+ek(ττd) (56)

where τd depends on the initial conditions. From (49) it follows for βz ≪ 1

dsdτ=[kz(τ)]s={kz2z2tanh[k(ττd)/2]}s (57)

This equation can be integrated to give

s(τ)=s(0){cosh[kτd/2]cosh[k(ττd)/2]}z/ke(kz/2)τ (58)

Since d ≫ 1 and, from Eqs. (52) and (56), s(0)kzexp(kτd), we can write Eq. (58) in the following form:

s(τ)=kz[sech[k(ττd)/2]2]z/ke(kz/2)(ττd) (59)

The time τmax at which s(τ) is maximum can be evaluated from the condition z(τmax)=k which, using Eq. (56), yields

τmax=τd+1kln[kzk] (60)

where τd=(1/k)ln[kz/s(0)]. For instance, for r=1.5 and s(0)=105, we obtain z=0.5828, smax=0.063 and τmax=31.25.

5. Results and conclusions

We have obtained analytical expressions for the asymptotic value of the cumulative population fraction C/N and the peak of the infectious population fraction I/N in the case of free spread evolution of COVID-19. Furthermore, we have obtained approximated expressions of these quantities as a function of time and the times at which the peak and the end of the epidemics is expected. We summarize here below these results:

  • (a)

    The asymptotic value of the cumulative population fraction is C/N=z, where z is the solution of the transcendental Eq. (53). A comparison between the exact solution obtained by integrating Eqs. (1)(5) and the solution of Eq. (53) is shown in Fig. 4 (a). Notice that this value depends only on the reproduction number r.

  • (b)
    The peak value of the infectious population fraction is, from Eqs. (45), (48) and (54),
    IpeakN=4γ1Δ(γ1+γ2+Δ)2[11r1rlnr] (61)
    The agreement of this expression with the exact result shown in Fig. 4(b) is better for values of r closer to the threshold r=1.
  • (c)
    We have obtained an approximated temporal profile of C(t)/N,
    C(t)N=z(t)=z2{1+tanh[λ+(ttd)/2]} (62)
    where td=(1/λ+)ln[kz/s0] and s0=(λ/D+D)[γ1x0(γ2+λ)y0], where x 0 and y 0 are the initial values of x and y. From this expression we have obtained the expression of I(t)/N as a function of time:
    I(t)N=(γ1λ)s(t)1βz(t) (63)
    where β=λ+/kΔ and
    s(t)=kz[sech[λ+(ttd)/2]2]z/keλ+(1z/2k)(ttd) (64)
    The good agreement of Eqs. (62) and (63) with the exact numerical solution of Eqs. (1)(5) is shown in Fig. 2.
  • (d)
    The time at which the peak of I/N is reached is
    tpeak=1λ+ln[k2zs0(zk)] (65)
    Fig. 5 shows t peak (in units of days) as a function of r for an initial value of E(0)=10, I(0)=0 and N(0)=6·107.

Fig. 4.

Fig. 4

(a): Plot of C/N vs. r, from the numerical solution of Eqs. (1)-(5) (dashed line) and from the analytical result of Eq. (53) (continuous line). The dotted red line is the threshold value k=11/r. (b) Peak value of I/N vs. r from the numerical solution of Eqs. (1)-(5) (dashed line) and from Eq. (61) (continuous line).

Fig. 5.

Fig. 5

Plot of peak time tpeak (in units of days) vs. r for initial values of E(0)=10 and I(0)=0,N(0)=6·107 and τE=3.69 days, τI=3.48 days.

These analytic expressions can be useful for deriving the uncertainty in the estimates of COVID-19 caused by the fluctuations of the values of the control parameters, as for instance the reproduction number r. In fact, the results of Ref. [4] suggest that uncertainties in both parameters and initial conditions rapidly propagate in the model and can result in different outcomes of the epidemics. For instance, Fig. 4a and b show the dependence of the fraction of the final cumulative fraction, C /N, and the daily infections peak, I peak/N, as a function of r. We observe that the sensitivity of C /N on r variations is larger when r is close to unity (with approximately C/N2(r1)) whereas it decreases for increasing values of r. On the other hand, I peak/N grows almost linearly with r (approximately as Ipeak/N0.07(r1)), so that its sensitivity to r variations is almost constant. Finally, the uncertainty of the peak time t peak (see Fig. 5) on r variations is very large for r close to unity and it reduces strongly at larger r.

In conclusions, we have obtained analytical expressions for the peak and asymptotic values of COVID-19 pandemic curves in the free spread as a function of the reproduction number and the two average times in the exposed and infected states. The results have been obtained by reducing the exact nonlinear model by adiabatically eliminating the decaying mode of the linear regime. This allows to reduce the SEIR model of a set of two equations similar to the Lotka-Volterra equations, from which exact and approximated solutions can be obtained. The analytical results have been compared with the exact numerical solution, showing good agreement. Particular interesting is the asymptotic fraction of the removed (recovered+deaths) population fraction, which depends only on the reproduction number r. Finally, the infected population curve is an almost symmetric function described by an hyperbolic secant function.

CRediT authorship contribution statement

Nicola Piovella: Conceptualization, Formal analysis, Writing - original draft.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  • 1.Rodriguez-Morales A.J., Cardona-Ospina J.A., Gutiérrez-Ocampo E., Villamizar-Peña R., Holguin-Rivera Y., Escalera-Antezana J.P. Clinical, laboratory and imaging features of COVID-19: a systematic review and meta-analysis. Travel Med Infect Dis. 2020:101623. doi: 10.1016/j.tmaid.2020.101623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yang Z., Zeng Z., Wang K., Wong S.-S., Liang W., Zanin M. Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions. J Thorac Dis. 2020;12(3):165. doi: 10.21037/jtd.2020.02.64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kermack W.O., McKendrick A.G. A contribution to the mathematical theory of epidemics. Proc R Soc London SerA. 1927;115(772):700–721. [Google Scholar]
  • 4.Faranda D., Alberti T. 2020. Modelling the second wave of COVID-19 infections in France and Italy via a stochastic SEIR model. arXiv:200605081. [DOI] [PubMed]
  • 5.Bertozzi A.L., Franco E., Mohler G., Short M.B., Sledge D. 2020. The challenges of modeling and forecasting the spread of COVID-19. arXiv:200404741. [DOI] [PMC free article] [PubMed]
  • 6.Anderson R.M., May R.M. Oxford university press; 1992. Infectious diseases of humans: dynamics and control. [Google Scholar]
  • 7.Pastor-Satorras R., Castellano C., Van Mieghem P., Vespignani A. Epidemic processes in complex networks. Rev Mod Phys. 2015;87(3):925. [Google Scholar]
  • 8.Zhou T., Liu Q., Yang Z., Liao J., Yang K., Bai W. Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV. J Evid Based Med. 2020;13(1):3–7. doi: 10.1111/jebm.12376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Li R., Pei S., Chen B., Song Y., Zhang T., Yang W. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2) Science. 2020;368(6490):489–493. doi: 10.1126/science.abb3221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Haken H. Light and matter Ic/Licht und materie Ic. Springer; 1970. Laser theory; pp. 1–304. [Google Scholar]
  • 11.Brauer F., Castillo-Chavez C., Castillo-Chavez C. vol. 2. Springer; 2012. Mathematical models in population biology and epidemiology. [Google Scholar]
  • 12.Harko T., Lobo F.S., Mak M. Exact analytical solutions of the susceptible-infected-recovered (SIR) epidemic model and of the SIR model with equal death and birth rates. Appl Math Comput. 2014;236:184–194. [Google Scholar]
  • 13.Miller J.C. A note on the derivation of epidemic final sizes. Bull Math Biol. 2012;74(9):2125–2141. doi: 10.1007/s11538-012-9749-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Chaos, Solitons, and Fractals are provided here courtesy of Elsevier

RESOURCES