Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2022 Jun 17:163–189. doi: 10.1016/B978-0-32-390504-6.00015-2

Analytical solutions and parameter estimation of the SIR epidemic model

Dimiter Prodanov a,b
Editors: Praveen Agarwalc,d,e, Juan J Nietof, Delfim FM Torresg
PMCID: PMC9212253

Abstract

The SIR (Susceptible-Infected-Removed) model is a simple mathematical model of epidemic outbreaks, yet for decades it evaded the efforts of the community to derive an explicit solution. The present manuscript surveys new analytical results about the SIR model. Recently, it was proven that the explicit solution of the model requires the introduction of a new transcendental special function, related to the Wright's Omega function and Lambert W function. Asymptotic methods and iterative numerical routines suitable for parametric estimation of the SIR model are discussed as well. The methods are exemplified with influenza and COVID-19 datasets.

Keywords: SIR model, special functions, Lambert W function, Wright Omega function

10.1. Introduction

The origin of the mathematical modeling of epidemics can be traced back to XVIIth century with the work of Daniel Bernoulli, who was trained as a physician, and whose results justified the practice of inoculation against smallpox. This represented a major milestone of public health. It is interesting to note that some subsequent major steps in the epidemiology of compartmental models were achieved not by pure mathematicians, but by public health physicians or biologists, such as Sir R.A. Ross, W.H. Hamer, A.G. McKendrick, and W.O. Kermack [1]. As a notable example of such progress, the SIR (Susceptible-Infected-Removed) epidemiological model was published in 1927 by Kermack and McKendrick in 1927 to study the plague and cholera epidemics in London and Bombay [2]. Even to date, the SIR model remains a cornerstone of mathematical epidemiology [3], [4]. It should be noted that the full model derived by Kermack and McKendrick was formulated in terms of convolution integrals, while the more popular form used in the present literature is actually only its simplified case. The full SIR model included dependence on the age of infection, that is, the time since an individual becomes infected. In this sense, the full model can be used to provide a unified approach to compartmental epidemic models. It is also worth mentioning that the SIR model in discrete time is equivalent to a renewal equation with a geometric generation-interval distribution with probability parameter γΔt [5]. This comes from the fact that the limiting distribution of the geometric probability distribution is the exponential one.

Outside epidemiology, the SIR model is also extensively used in modeling of online social networks, viral marketing, diffusion of ideas, spread of computer viruses, financial network contagion, etc. (see recent survey by Rodrigues and the references therein [6]).

The SIR model can be extended in two directions – either by adding a final state, e.g. “deceased” individuals – D; or by adding one or more intermediate nonobservable populations – e.g. “exposed” E individuals. Distinct possibilities include the SEIR and SEIS models, with an exposed period between getting infected and becoming infective, and SIRS models, with temporary immunity conveyed upon recovery from the initial infection.

The analytical solution of the SIR model was formulated for the S-variable in a parametric form in [7]. Recently, Barlow and Weinstein have introduced numerical solutions based on asymptotic rational approximants [8]. Prodanov established the solution in terms of numerical inversion of the parametric solution by Newton iteration for the I-variable [9]. The present chapter focuses on some recent theoretical developments around the SIR model from the perspective of real analysis and theory of special functions.

10.2. The SIR model

The full SIR model can be simplified under the assumptions that the rate of infection is constant and there is no demographic variation. The simplified SIR model is a deterministic model formulated in terms of ordinary differential equations (ODEs) in terms of 3 populations of individuals. The S population consists of all individuals susceptible to the infection of concern. The I population comprises the infected individuals. These persons have the disease and can transmit it to the susceptible individuals. The model can be schematically represented in the input-output diagram presented in Fig. 10.1 .

Figure 10.1.

Figure 10.1

The SIR model input-output diagram.

The standard assumptions behind the SIR model are:

  • 1.

    A typical member of the population S gets infected by a typical member of the I population with certain probability per unit time.

  • 2.

    Infected individuals recover at rate γI per unit time.

  • 3.

    The R population cannot become re-infected and the recovered individuals cannot transmit the disease to others.

  • 4.

    There is no entry into or departure from the overall population, except possibly through death from the disease.

It is also assumed that new cases are generated through homogeneous mixing, yielding the mass-action incidence term βIS, or the standard-incidence term βIS/N [3, Ch. 11]. Note that the dimensionality of β in the two cases is different. In the former case it is [T]1[N]1, while in the latter it is [T]1.

Model parameters can be interpreted as follows. A disease carrier infects on average β individuals per day, for an average time of 1/γ days. The β parameter is called disease transmission rate, while γrecovery rate. The average number of infections arising from an infected individual is then given by

R0=βγ,

called the basic reproduction number. Other useful quantities are the running reproduction number

R(t)=S(t)βNγ

and the herd immunity threshold p=11/R0. Typical initial conditions, modeling an outbreak, are S(0)=S0, I(0)=I0, R(0)=0 [2]. The simplified SIR model comprises a set of three ODEs:

S˙(t)=βS(t)NI(t) (10.1)
I˙(t)=βS(t)NI(t)γI(t) (10.2)
R˙(t)=γI(t) (10.3)

The model assumes a constant overall population N=S+I+R.

The temporal course of the I-variable has a characteristic exponential rise, turnover, and final slower exponential decline towards 0. The S-variable also has three phases – a slow decrease followed by rapid quasi-linear decline followed by slow exponential decline towards S, while the R-variable has a sigmoid shape. A phase diagram of the model is plotted in Fig. 10.2 .

Figure 10.2.

Figure 10.2

Phase plot of the (I, S) plane.

A – Case g=2, the trajectories starting at (4.6316, 1.9) and (6.16(6), 1.2) are indicated with dots. This corresponds to the phase plot given in [10, Fig. 1A] for α = 0.1, β = 0.2. B – Case g=12, the trajectories, starting at (13.81(81), 1.1) and (15.3(3), 0.6) are indicated with dots. This corresponds to the phase plot given in [10, Fig. 1B] for α = 0.1, β = 1.2.

The model can be re-parametrized using normalized variables as

s˙=si (10.4)
i˙=sigi,g=γβ=1R0 (10.5)
r˙=gi, (10.6)

subject to normalization s+i+r=1 and time rescaling τ=βt.

It is, in general, difficult to estimate the contact rate β, which depends on the particular disease and on social and behavioral factors [3]. On the other hand, different nonmedical interventions, such as wearing of personal protective equipment, contact restriction or quarantine, could offer some degree of control over β. The above formulation offers the advantage that the dynamics imposed by the contact rate β(t), viewed as a function of time, can be absorbed into the model by time rescaling

τ(t)=tβ(u)du

which for a causal step function obviously recovers τ=βt. Note that τ(t) becomes a dimensionless variable.

10.3. Second-order systems equivalent to SIR

The SIR model can be formulated also as several equivalent second-order nonlinear systems for the different variables. The readers are directed to the original results in [7], [10].

10.3.1. A second order differential equation for the i-variable

Recently, Kudryashov et al. gave an equivalent second order differential equation for the i-variable [10].

Proposition 10.1

The SIR system is reducible to the nonlinear differential equation for the i-variable:

i¨=gi2ii˙+i˙2i (10.7)

or the system

It˙=gI2IIt+It2I (10.8)
I˙=It (10.9)

Proof

From the conservation law s˙=i˙r˙=igi˙. Then from Eq. (10.4) s=g+i˙/i. Differentiating Eq. (10.5) and substituting Eq. (10.4)

i¨=gi˙+is˙+i˙s=gi˙i2s+si˙=gi˙+i˙(g+i˙i)i2(g+i˙i)=gi2+i˙2iii˙

 □

The advantage of this formulation is that the phase space manifold is parametrized only by a single parameter. Its phase portrait for values of g=2 and g=12 is shown in Fig. 10.3 for reference. The system admits elementary solutions i=i0egt, r=r0+i0(1egt) which correspond to s=0. This can be verified by direct substitution into Eq. (10.7).

Figure 10.3.

Figure 10.3

Phase plane portrait of the differential systems equations (10.8)(10.9).

A, B – (St,S) plot; C, D – (It,I) A – Case g=2, the trajectories starting at (4.6316, −8.8) and (6.1667, −7.4) are indicated with dots. C – Case g=2, the trajectories starting at (1.9, 5.0) and (1.2, 5.0) are indicated with dots. B – Case g=12, the trajectories, starting at (13.8182, −15.2) and (15.3333, −9.2) are indicated with dots. D – Case g=12, the trajectories, starting at (1.1, 2) and (0.6, 2.0) are indicated with dots. This corresponds to the phase plot given in [10, Fig. 1B] for α = 0.1, β = 1.2.

10.3.2. A second order differential equation for the s-variable

An equivalent second-order system for the s-variable was given in [7].

Proposition 10.2

The SIR system is reducible to the nonlinear differential equation for the s-variable

s¨=s˙2s+(sg)s˙

or the system

St˙=St2S+(Sg)St (10.10)
S˙=St (10.11)

Proof

Differentiating Eq. (10.4) gives

didt=s¨+is˙s=s˙2s2s¨s

and substituting i from Eq. (10.4). Substitution in Eq. (10.5) gives

s˙2s2s¨s=gs˙ss˙

from where the result follows. □

The following proposition is also formulated in [7].

Proposition 10.3

The SIR system is reducible to the nonlinear differential equation for the r-variable

r¨=gr˙(erg1)

or the system

Rt˙=gRteRggRt (10.12)
R˙=Rt (10.13)

Proof

Differentiating Eq. (10.6) yields r¨=g(sigi)=gsigr˙=gs˙gr˙. On the other hand, by Eq. (10.16) we have r=glogs+c. We use the fixed-point condition s0=s(0)=g so that s=ger/g+r0/g. Since the system is autonomous we can translate the origin to −∞, where r0=0 from where the result follows. □

10.4. Indeterminate analytical solution

This section develops the analytical solution in an implicit form. Since there is a first integral by construction the system can be reduced to any two of the three equations [3, Ch. 2]:

dids=1+gs (10.14)
didr=sg1 (10.15)
drds=gs (10.16)

From this formulation

Re=Ns0g=S0βγ1

must hold for the infection to propagate. Re is called the effective reproductive number, while the basic reproduction number is R0=ReN [11]. In order to solve the model we will consider the two equations separately. Direct integration of Eq. (10.14) gives

i=s+glogs+c (10.17)

where the indeterminate constant c can be determined from the initial conditions.

10.4.1. The s-variable

The s variable can be determined by substitution in Eq. (10.4), resulting in the autonomous system

s˙=s(s+glogs+c) (10.18)

which can be solved implicitly as

dss(sglogsc)=τ (10.19)

The s variable can be represented explicitly in terms of the Lambert W function.

s=gW±(eicgg) (10.20)

where the signs denote the two different real-valued branches of the function. Note, that both branches are of interest since the argument of the Lambert W function is negative.

10.4.2. The i-variable

Based on Eq. (10.20), Eq. (10.5) can be reduced to the first-order autonomous system

i˙=gi(W±(eicgg)+1) (10.21)

valid for two disjoined domains on the real line. This autonomous ODE can be solved for the rescaled time variable τ as

dii(W±(eicgg)+1)=gτ (10.22)

10.4.3. The r-variable

Finally, the r variable can also be conveniently expressed in terms of i. For this purpose we solve the differential equation

drdi=gsg=11+W±(eicgg)

Therefore,

r=c1glog(gW±(eicgg))=c1glogs

by Proposition 10.5. On the other hand,

glog(gW(eicgg))=g(log(eicg)W(eicgg))=icgW(eicgg)=s+ic

So that

r=gW(eicgg)i+c+c1 (10.23)

Since (s0,0,0) is a stable point it follows that

c+c1=gW(ecgg) (10.24)

Therefore,

r=gW(eicgg)gW(ecgg)i (10.25)

Proposition 10.4

i(t) attains a global maximum im=cgloggg .

Proof

We use a parametrization for which i(0)=im. Then

i˙(τ)=gi(W±(eiimg1)+1)

It follows that i˙(0)=0, i(0)=im so im is an extremum. In the most elementary way since W(z) should be real-valued then

eiimg11/eiimg0

Hence, iim. □

10.5. Inverse parametric solution

The implicit solution Eq. (10.22) can be computed as the definite integral

gτ(i)=c+glogggidyy(W±(eycgg)+1) (10.26)

which requires computation of the Lambert W function on every integration step. However, this does not seem to be efficient. Alternatively the solution can be computed more efficiently by change of variables by Proposition 10.6:

gτ(i)=gggW±(eicgg)dyy(glogyy+c) (10.27)

On the other hand, from Eq. (10.19) it follows that

τ(s)=gsdyy(glogyy+c) (10.28)

where the domain of s is [gW+(ec/g/g),gW(ec/g/g)].

The τ-variable can also be expressed as function of r. We differentiate Eq. (10.23) by s. From where

gt=drdsdt=dr1s(sglogsc)|s=e(c1r)/g=erc1gdr(r(c1+c))erc1g+1

where c1=gW(ec/g/g)c by Eq. (10.24). Therefore,

gτ(r)=qcgloggreygdy(yq)eyg+q,q=gW(ec/g/g) (10.29)

for the c-parametrization and the domain of r is

[0,gW+(ec/g/g)gW(ec/g/g)]

Therefore, the following theorem can be formulated.

Theorem 10.1 Inverse parametric solution —

The inverse parametric solution of the SIR model is given by the integrals

τ(s)=gsdyy(glogyy+c)gτ(i)=gggW±(eicgg)dyy(glogyy+c)=c+glogggidyy(W±(eycgg)+1)gτ(r)=qcgloggreygdy(yq)eyg+q,q=gW(ec/g/g)

The solutions are plotted in Fig. 10.4 . A similar solution for τ(r) was obtained also in [12]. Interestingly, the authors derive an infinite series for τ(r) in terms of upper incomplete Gamma functions.

Figure 10.4.

Figure 10.4

Parametric plots of the SIR model.

A – i-variable from equation im-parametrization with parameters g= 1, im = 6.5; B – s and r-variables with the same parameters. C – i-variable from equation im-parametrization with parameters g=2.0, im = 6.5; D – s and r-variables with the same parameters. The negative branch is on the negative axis.

10.5.1. Peak value parametrization

The upper terminal of integration can be determined by the requirement for the real-valuedness of i. This value of i is denoted as im; that is

W±(eimcgg)=1

Therefore,

im=c+gloggg (10.30)

The peak-value parametrization is supported by Proposition 10.4.

If we consider formally the phase space (z×y=gz(W±(ezimg1)+1)) the following argument allows for the correct branch identification. For i W(ezimg1) so y<0; while W+(ezimg1)0+ so y>0. Therefore, if we move the origin as t(0)=im then conveniently

imidzz(W+(ezimg1)+1)=gτ,τ>0 (10.31)
imidzz(W(ezimg1)+1)=gτ,τ0 (10.32)

Furthermore, the recovered population under this parametrization is

r=gW±(eiimg1)gW(eimg1)i (10.33)

under the same choice of origin.

10.5.2. Initial value parametrization

As customarily accepted, the SIR model can be recast as an initial value problem. In this case, the indeterminate constant c can be eliminated using the initial condition

i0=s0+glogs0+c

Therefore,

i=i0+s0s+glogs/s0=1s+glogs1i0 (10.34)

For this case, the following autonomous differential equation can be formulated:

i˙=gi(W±(1i0gei1g)+1) (10.35)

This can be solved implicitly by separation of variables as

i0idzz(W(1i0gez1g)+1)=gτ,ττm (10.36)
i0idzz(W+(1i0gez1g)+1)=gτ,τ>τm (10.37)

It is noteworthy that the time to the peak of infections τm can be calculated as [9]:

τm=0logg/s0dus0eugu(s0+i0)

The result follows by considering the autonomous system (10.14) and fixing the upper terminal of integration s=g. However, by Proposition 10.7 this definite integral can be evaluated only numerically.

10.6. Analysis of the incidence variable

The incidence i-function of the SIR model appears to be an interesting object of study on its own. It was demonstrated that the function is nonelementary in the sense of the definition given below [9].

Definition 10.1

An elementary function is defined as a function built from a finite number of combinations and compositions of algebraic, exponential, and logarithm functions under algebraic operations (+, −, ., /)

Allowing for the underlying field to be complex numbers – C, trigonometric functions become elementary as well.

Definition 10.2 Liouvillian function —

We say that f(x) is a Liouvillian function if it lies in some Liouvillian extension of (C(x),) for some constant field C.

As a first point we establish the nonelementary character of the integral in Eq. (10.22). The necessary introduction to the theory of differential fields is given in Appendix 10.B. From the work of Liouville it is known that a function of the form F(x)=f(x)eq(x), where f, q are elementary functions, has an elementary antiderivative of the form [13]

F(x)dx=feqdx=heq

for some elementary function h(x) [14]. Therefore, differentiating we obtain

feq=heq+hqeq

so that if eq0 h+hq=f holds. The claim can be strengthened to demand that h be algebraic for algebraic f and q (see Theorem 10.1).

Theorem 10.2

The integrals

I±(ξ)=dξξ(W±(eξcgg)+1)

are not Liouvillian.

Proof

We use im parametrization. Let c=im+gglogg. The proof proceeds by change of variables – first ξ=glogyyg+g+im; followed by z=log((glogy+im+g)/g).

I=dξξ(W±(eξimg1)+1)=y1y(glog(y)gy+im+g)(W(yey)+1)dy=dyy(glog(y)gy+im+g)=1gez+img+1eezez+img+1dz

since W±(yey)=y. The last integral has the form

AezeezAezdz

which allows for the application of the Liouville theorem in the form of Corollary 10.1. We can identify

fezdx=hez,f(z)=AeezAez,A=eimg+1

so that

AeezAez=h(z)+h(z)

for some unknown algebraic h(z). Since the left-hand side of the equation is transcendental in z so is the right-hand side, which is a contradiction. Therefore, the integrand has no elementary antiderivative. □

The result of the last paragraph leaves the question about the form of i(τ) somehow wanting. Here we demonstrate a functional equation for the i variable exhibiting its non-Liouvillian character.

Ritt established that if an elementary function has an elementary inverse it is a sequential composition of algebraic and transcendental functions [15]. Prelle and Singer proved in Corollary 3 that if the autonomous system y=f(y) has an elementary first integral then

g(y)=dyf(y)

is also elementary [16]. This presents a direct way of proving that the incidence function is nonelementary by virtue of Theorem 10.2 but leaves open the question about its non-Liouvillian character. In order to clarify the form of i(τ) we make use of the integral I(ξ). On the first place, the following identity holds for its kernel:

1ξ(W±(eξcgg)+1)=1ξW±(eξcgg)ξ(W±(eξcgg)+1)

Therefore,

I±(ξ)=logξW±(eξcgg)dξξ(W±(eξcgg)+1)==logξ+dz(gz1)W±(eglogzzgg)(glogzz+c)(W±(eglogzzgg)+1)|z=W±(eξcgg)=logξ+dzglogzz+c|z=W±(eξcgg)=logξ+L(W±(eξcgg))

where we have used the defining identity for the Lambert W function and defined the auxiliary function up to a constant as the indefinite integral

L(z)=dzglogzz+c (10.38)

which is a Liouvillian extension nonelementary integral by Proposition 10.8. Exponentiating and using the notation L(z) it can be seen that

exp(I±(ξ))=ξeL(W±(eξcgg))=egτ

However, ξ=i(τ); therefore

i(τ)=egτL(W±(ei(τ)cgg)) (10.39)

or

i(τ)=egτL(Ω±(i(τ)cglogg)),Ω±(z)=Ω(z±jπ) (10.40)

from which we can infer the non-Liouvillian character of i(τ) from the form of the equation. This can be done as follows. For any given τ the RHS is a composition of a Liouvillian function with a non-Liouvillian one, i.e. the Wright Ω function being not a Liouvillian one [17]. It can be seen that the only scenario, where i(τ) is elementary is when the second term, containing the L function vanishes. This is exactly the scenario, when s0=0 as indicated by Eq. (10.7), for in this case i˙=gi and i is exponential in τ. To remove the ambiguity in the definition of L(z) we observe that z=s/g. Therefore,

L(z):=1zduglogu/gu+g+imlogim

so that L(1)=logim ensuring the identity at τ=0. Furthermore, it can be claimed that

Theorem 10.3

The incidence functioni(t), defined by the differential equation(10.21), is not Liouvillian.

10.7. Asymptotic analysis of the SIR model

In their seminal publication, Kermack and McKendrick develop asymptotics for r under the assumption that r/g is small. Recently, Barlow and Weinstein [8] derived Padé approximation scheme for the s-variable. The approximant for the s-variable by construction, matches the correct t behavior and whose expansion about t = 0 is exact to nth-order.

In the previous publication [9], a double exponential form of the asymptotic for the i-variable was given, however without performing comprehensive study.

With the hindsight of the functional equation (10.39) a different scheme is more appropriate. On the first place, the functional equation immediately reveals the long-time asymptotic behavior of i.

limti(τ)=limτegτL(Ω+(i(τ)cglogg))=0

since i(τ) is monotonously decreasing for τm>0. Therefore, it follows that

i(τ)egτL(Ω+(cglogg))=kegτ

for large τ.

Since the solution is Lipshitz everywhere in R one can make use of the Banach Fixed Point Theorem. Notably, one can use the nonlinear approximation scheme of Daftardar-Gejji-Jafari for solving the equivalent integral equations [18]. If we treat the equations of the SIR model as independent we can formally solve Eq. (10.4) as

s(τ)=s0ei(t)dτ

On the other hand, Eq. (10.5) can be transformed formally as

i˙+gi=siegτd(egτi)=sidτd(egτt)egτi=sdτ

Therefore, formally,

i(τ)=k2egτ+s(τ)dτ

Starting from the 0th order approximation is i(0)i0, it follows that

s(0)s0ei0τ

This approximation is valid around the fixed point sg, which we can take as an initial condition s0=g since then di/ds=0. However, this corresponds to the peak-value parametrization so i0=im, i(0)=0. From this, the 1st order approximation for the i-variable becomes

i(1)=imegim(1eimt)gt (10.41)

A second iteration of the loop results in a nonelementary γ integral. Let

J:=egim(1eimτ)gτdτ

Then, by change of variables

J=ygim1egimgyimdy|y=eimτ=Γ(gim,gimeimτ)(gim)gim

where Γ is the upper incomplete Euler's gamma function. Therefore,

r(1)=gJ=k3Γ(gim,gimeimτ)(gim)gim

Matching the long-term limit results in the equation

r(1)=g(W+(eimg1)W(eimg1))Γ(gim,geimtim)Γ(gim) (10.42)

which can be used also for fitting purposes. Then

s(2)=k2eJ=k2exp(Γ(gim,gimeimτ)(gim)gim)

Matching the initial condition gives

s(2)=gexp((Γ(a,a)Γ(a,aeimt))(ea)a),a=gim (10.43)

Finally, following the same procedure for the i-variable we obtain

i(2)=imexp(geq0τeΓ(a,aeimz)(ea)adzgτ),q=Γ(a,a)(ea)a (10.44)

where a=gim. The asymptotics of the i-variable are shown in Fig. 10.5 .

Figure 10.5.

Figure 10.5

Asymptotic solutions i(τ).

Asymptotic solutions compared to parametric plots of (τ(i),i) parametrized by im = 3.7283 and g = 2.0. Legends: LB denotes the left branch involving W(x), RB denotes the right branch involving W+(x), Eq. (10.27). A denotes the asymptotic solution. A – the asymptotic is computed from Eq. (10.41); B – the asymptotic is computed from Eq. (10.44). Plots were produced using the quad_qags Maxima numerical integration command.

10.8. Numerical approximation

The i-function can be efficiently approximated by the Newton's method [9]. The Newton iteration scheme is given as follows for the c-parametrization:

in+1=in+gin(W±(eincgg)+1)(tggW±(eincgg)dyy(glogyy+c))

(see Theorem 10.1). This form has the advantage of requiring only 1 Lambert W function evaluation per iteration.

A point of attention here is the choice of the initial value for the iteration scheme. According to the presented analysis, a suitable elementary initializing function is i1(t) given by Eq. (10.41).

10.9. Cast study I: application to influenza A

The influenza data are tabulated in [4]. The task here is to fit the influenza incidence in a boarding school. The fitting equation is given by

ItNi(tT|g,im)

where It is the observed incidence. The fitting functionality was implemented MATLAB®. Quadratures were estimated by the default MATLAB integration algorithms. The parametric fitting was conducted using least-squares constrained optimization algorithm. A least square fit obtained R2=53.1163 for the asymptotic fit and R2=62.9415 for the full model. Fitted parameters are presented in Table 10.1 . The plots are presented in Fig. 10.6 . The fitting produced peak estimate of 296.2197 for the asymptotic equation and 303.9163 for the full model in excellent agreement with the raw data (see Table 10.1).

Table 10.1.

Incidence parameters.

g R0 T [days] im fit
0.5493 1.8206 6.4849 296.2197 asymptotic
0.4710 2.1233 6.3609 303.9163 sir

Figure 10.6.

Figure 10.6

SIR model fitting of influenza outbreak.

A – estimate of the i variable; B – estimate of the r variable compared with the cumulative number of cases.

10.10. Cast study II: application to COVID-19

The second case study uses COVID datasets were downloaded from the European Centre for Disease Prevention and Control (ECDC) website: https://opendata.ecdc.europa.eu/covid19/casedistribution/csv. The data were imported in the SQLite https://www.sqlite.org database, filtered by country and transferred to MATLAB for parametric fitting using native routines. Quadratures were estimated by the default MATLAB integration algorithms. Estimated parameter values were stored in the same database. The processing is described in [9], For numerical stability reasons the time variable was rescaled by a factor of 10. The estimate represented in Fig. 10.8 has been published in [9]. The asymptotic approach is exemplified with data from Italy (Fig. 10.7 ).

Figure 10.8.

Figure 10.8

Case fatality model for Belgium.

A – The parametric fit of the case fatality data; B – Cumulative deaths compared to the estimate from the r-variable. The origin corresponds to 1st January 2020.

Figure 10.7.

Figure 10.7

Parametric fitting of the first and second waves in Italy.

A, B – case fatality fitting; C, D – incidence fitting; ‘asymp’ refers to parametric fit using the asymptotic formula Eq. (10.41), ‘sir’ refers to fitting the i-variable computed by numerical inversion using the parameters estimated by Eq. (10.41). Note the pronounced weekly variation of the reported numbers.

10.11. Discussion and conclusions

The present chapter demonstrates that the SIR model could be solved completely. Recent developments have exhibited the solutions of the model in terms of special functions. As anticipated in previous literature, these functions are nonelementary and in particular the incidence i-function is non-Liouvillian. On the other hand, the asymptotic solutions of the model could be readily established in terms of iterated exponents or incomplete gamma functions.

In conclusion, the SIR model has proven to be valuable in modeling epidemic outbreaks. In many situations it has shown excellent agreement with the historical data [3], [4] as well as with the data coming from the present COVID-19 pandemics [9], [8], [10]. On the other hand, it should be remembered that any model is only a cartoon of reality and it is always important to challenge model assumptions with the available facts on the ground. A distinction should be always made between the mathematical consistency or even beauty of a certain model and its empirical applicability In particular, predictions based on empirically false models may be harmful, and it is essential to distinguish between assumptions that simplify but do not alter the predicted effects substantially, and wrong assumptions, which are not supported by the underlying ecology or social dynamics.

Appendix 10.A. The Lambert W function and related integrals

The Lambert W function can be defined implicitly by the equation

W(z)eW(z)=z,zC

We observe that by Lemma 10.1 W(z) is transcendental. Furthermore, the Lambert function obeys the differential equation for x1

W(x)=eW(x)1+W(x)

The W function is nonelementary and in particular it is non-Liouvillian [17]. Its indefinite integral is:

W(x)dx=xW(x)+xW(x)x

The Lambert W is a multivalued function. Properties of the W function are given in [19].

Proposition 10.5

dy1+W(eycgg)=glog(gW(eycgg))+C

Proof

We differentiate

g(logW(eycgg))=eycgW(eycgg)gW(eycgg)(1+W(eycgg))=11+W(geicg)

 □

Proposition 10.6

gloggg+cidξξ(W±(eξcgg)+1)=gggW±(eicgg)dyy(glogyy+c)

Proof

We use the change of variables ξc=glogyy and then simplification by the defining identity of the Lambert W function.

gloggg+cidξξ(W(eξcgg)+1)=ABgy1(glogyy+c)(W(eglogyy+cgcgg)+1)dy=ABgyy(gylogyy2+cy)g+gylogyy2+cydy=gABdyy(glogyy+c)

where

glogAA+c=gloggg+c,glogBB+c=i

Therefore, A=g and B=gW(eicgg). □

The proof of Theorem 10.2 establishes also the character of two related integrals:

Proposition 10.7

The integral

I=dyy+cey

is not elementary.

Proof

By change of variables y=ex

I=dyy(logyy+c)=dxexxc

 □

Proposition 10.8

L(z) is a nonelementary integral.

Proof

This can be established by an argument using change of variables as

L(z)=dzglogzz+c=eudueu+gu+c|z=eu

further applying the same reasoning as in the proof of Theorem 10.2. □

Appendix 10.B. Differential fields

Definition 10.3

Denote by C(x,ci,θi) the complex-valued ring, generated by the finite set of rational functions {θi}in and constants {ci}in.

Definition 10.4

An element θ is called algebraic if P(x,θ)=0 for some polynomial

P(x,t)=tm+am1tm1++a0,

where ai can be also rational functions of x, or else it is called transcendental.

Lemma 10.1 Composition lemma —

Denote by a and t the algebraic or transcendental elementary functions, respectively. The following composition rules hold

aa=a,ta=t,at=t

Proof

The case aa when a(x) is a polynomial is trivial. Suppose that a and b are both algebraic:

P(x,a)=0,Q(x,b)=0

Without loss of generality suppose that ai are polynomial. Formally, b=f¯k(x) for any branch k with f¯ algebraic since it is a root of a polynomial, where the bar denotes the inverse function in order to avoid confusion with exponentiation. Therefore,

ab=bm+am1bm1++a0=f¯km(x)+am1f¯km1(x)++a0

is algebraic since it is computed by a finite sequence of algebraic operations.

Suppose that t=exp(x). Then exp(a) is not algebraic, hence it is transcendental. P(x,ex)=exm+am1exmx++a0 is exponential.

Suppose that t=log(x). Then log(a) is not algebraic, hence it is transcendental. P(x,log(x))=logmx+am1logm1x++a0 is not algebraic, hence it is transcendental. □

In what follows is assumed that the differential field is of characteristic zero and has an algebraically closed field of constants. An element y of a differential field is said to be an exponential of an element A if y=Ay, an exponential of an integral of an element A if y=Ay; logarithm of an element A if y=A/A, and an integral of an element A if y=A.

The next definition is due to [17].

Definition 10.5

Let (k,d/dx) be a differential field of characteristic 0. A differential extension (K,d/dx) of k is called Liouvillian over k if there are θ1,,θnK, such that K=C(x,θ1,,θn) and for all i, at least one of the following

  • 1.

    θi is algebraic over k(θ1,,θn1)

  • 2.

    θik(θ1,,θn1)

  • 3.

    θi/θik(θ1,,θn1)

holds. The constant subfield C(K) of K is defined to be the set of c in K, such that c=0.

The next theorem is due to [14].

Theorem 10.4

If K is an elementary field, then it is closed under differentiation.

An elementary integrability theorem due to Conrad [14].

Theorem 10.5 Rational Liouville criterion —

Forf,gC(x)with f and g nonconstant the functionf(x)eg(x)can be integrated in elementary terms if and only if there exists a rational functionhC(x)such thath+gh=f.

The last result can be extended to algebraic functions as follows.

Corollary 10.1 Algebraic Liouville criterion —

Forf(x),g(x)algebraic and nonconstant, the functionf(x)eg(x)can be integrated in elementary terms if and only if there exists an algebraic functionh(x), for whichh+gh=f.

Proof

Suppose that f and g are arbitrary elementary algebraic functions. Denote the primitive of f by capital F in the juxtaposition f÷F. The integral can be integrated by parts

I=f(x)eg(x)dx=eg(x)dF=F(x)eg(x)F(x)(eg(x))dx

Therefore,

(f(x)+F(x)g(x))eg(x)dx=F(x)eg(x)

We observe that g(x) is elementary by Theorem 10.4. The L.H.S. has the form feg and since f(x)+F(x)g(x) is elementary we can identify

hF,f1f+Fg=h+hg

so that (h+hg)eg=(heg) and the claim follows. □

References

  • 1.Brauer F. Mathematical epidemiology: past, present, and future. Infectious Disease Modelling. 2017 may;2(2):113–127. doi: 10.1016/j.idm.2017.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kermack W.O., McKendrick A.G. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character. 1927 aug;115(772):700–721. [Google Scholar]
  • 3.Brauer F., van den Driessche P., Wu J., editors. Mathematical Epidemiology. Springer Berlin Heidelberg; 2008. [Google Scholar]
  • 4.Martcheva M. Springer US; 2015. An Introduction to Mathematical Epidemiology. [Google Scholar]
  • 5.Champredon D., Dushoff J., Earn D.JD. Equivalence of the Erlang-distributed SEIR epidemic model and the renewal equation. SIAM Journal on Applied Mathematics. 2018 jan;78(6):3258–3278. [Google Scholar]
  • 6.Rodrigues H.S. Application of SIR epidemiological model: newtrends. International Journal of Applied Mathematics and Informatics. 2016;10:92–97. [Google Scholar]
  • 7.Harko T., Lobo F.S.N., Mak M.K. Exact analytical solutions of the Susceptible-Infected-Recovered (SIR) epidemic model and of the SIR model with equal death and birth rates. Applied and Computational Mathematics. 2014 jun;236:184–194. [Google Scholar]
  • 8.Barlow N.S., Weinstein S.J. Accurate closed-form solution of the SIR epidemic model. Physica D. Nonlinear Phenomena. 2020 jul;408 doi: 10.1016/j.physd.2020.132540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Prodanov D. Entropy; Basel, Switzerland: 2020 Dec. Analytical Parameter Estimation of the SIR Epidemic Model. Applications to the COVID-19 Pandemic; p. 23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kudryashov N.A., Chmykhov M.A., Vigdorowitsch M. Analytical features of the SIR model and their applications to COVID-19. Applied Mathematical Modelling. 2021 feb;90:466–473. doi: 10.1016/j.apm.2020.08.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Weiss H. The SIR model and the foundations of Public Health. Materials Matemàtics. 2013;3:1–17. [Google Scholar]
  • 12.Sadurní E., Luna-Acosta G. Exactly solvable SIR models, their extensions and their application to sensitive pandemic forecasting. Nonlinear Dynamics. 2021 feb;103(3):2955–2971. doi: 10.1007/s11071-021-06248-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Rosenlicht M. On the explicit solvability of certain transcendental equations. Publications Mathématiques de l'Institut des Hautes Études Scientifiques. 1969;36:15–22. http://www.numdam.org/item/PMIHES_1969__36__15_0 Available from. [Google Scholar]
  • 14.Conard B. Clay Mathematics Institute; 2005. Impossibility theorems for elementary integration. 2005 Academy Colloquium Series. [Google Scholar]
  • 15.Ritt J.F. Elementary functions and their inverses. Transactions of the American Mathematical Society. 1925 jan;27(1):68. [Google Scholar]
  • 16.Prelle M.J., Singer M.F. Elementary first integrals of differential equations. Transactions of the American Mathematical Society. 1983 jan;279(1):215. [Google Scholar]
  • 17.Bronstein M., Corless R.M., Davenport J.H., Jeffrey D.J. Algebraic properties of the Lambert W function from a result of Rosenlicht and of Liouville. Integral Transforms and Special Functions. 2008 oct;19(10):709–712. [Google Scholar]
  • 18.Daftardar-Gejji V., Jafari H. An iterative method for solving nonlinear functional equations. Journal of Mathematical Analysis and Applications. 2006 apr;316(2):753–763. [Google Scholar]
  • 19.Corless R.M., Gonnet G.H., Hare D.E.G., Jeffrey D.J., Knuth D.E. On the Lambert W function. Advances in Computational Mathematics. 1996 dec;5(1):329–359. [Google Scholar]

Articles from Mathematical Analysis of Infectious Diseases are provided here courtesy of Elsevier

RESOURCES