Abstract
We study a simple realistic model for describing the diffusion of an infectious disease on a population of individuals. The dynamics is governed by a single functional delay differential equation, which, in the case of a large population, can be solved exactly, even in the presence of a time-dependent infection rate. This delay model has a higher degree of accuracy than that of the so-called SIR model, commonly used in epidemiology, which, instead, is formulated in terms of ordinary differential equations. We apply this model to describe the outbreak of the new infectious disease, Covid-19, in Italy, taking into account the containment measures implemented by the government in order to mitigate the spreading of the virus and the social costs for the population.
Subject terms: Statistical physics, Biological physics, Applied mathematics, Infectious diseases
Introduction
In a very few months a viral infection called Covid-19 (Coronavirus disease 19) originated in China, breaking through the borders of all the countries, rapidly spread all over the globalized world. Italy is one of the hardest hit countries suffering from the very dramatic consequences of this disease. The outbreak of the virus, the new coronavirus which caused the infection, seems out of our control. In the absence of a therapy and a vaccine, social distancing measures and a strict lockdown appear to be the most effective means to contain the growth of the infection. We should remind that there are places in the world where often infectious diseases, also those already defeated in the so-called more developed countries, can still cause very severe consequences among the local populations.
Even if we cannot answer the question why a virus starts spreading and which is its origin, we can still wonder how it diffuses. The aim of this work is, therefore, to provide a simple handy model for epidemic spreading, which could depend only on the couple of parameters which generally characterize an infectious disease: the infection rate and the infectiousness (or recovery) time. Both these quantities can be taken from the experience, therefore, we do not need further parameters to fit the data which could cause artificial predictions. We will show that the model we are presenting have the same, or even higher, predictive power than that of one of the most widely used technique in epidemiology, the SIR model1–3. This latter model requires the presence of a recovery rate related to the number of recovered persons, without considering that the new cases of recovery (and fatality) come from infected cases occurred previously. The model we are proposing, instead, is based on the fact that the closed cases come from the infected ones after an average delay recovery time, therefore, contrary to the SIR model, formulated in terms of a set ordinary differential equations, it is described by just a functional retarded differential equation, bringing predictions more under control. In this work we derive the exact analytical solution of this model in the limit of a large population, also in the presence of a time-dependent infection rate, which is the case when containment measures are implemented in order to reduce the spreading of the infection. Moreover, the definition of the so-called basic reproduction number (a parameter determining whether a infectious disease can spread or not) comes out naturally in our delay model. Actually delay models in epidemiology have been already implemented in many cases5–10. We consider the case where the infection period is constant and provide for the first time an analytical result for the spreading of the disease in the early stage of the infection.
We finally apply this technique to give a quantitative description of the diffusion of Covid-19 in Italy, showing the current scenario based on the actual situation and what would have happened without the containment measures. Generally it is quite difficult to give a reliable forecast on the fate of the epidemic spreading because it heavily depends on individual and social behaviors, on the effectiveness of the containment measures already implemented, or that will be taken, by the government and on the future political decisions. At the time being, even if the situation in Italy is improving, it seems that more efforts are needed in order to change course and rapidly stop the spreading of the disease. Further measures might be useful, like, for instance, (i) running more diagnostic tests, at least, on all the doctors and medical workers who are in contact with many patients, (ii) improving the food distribution to avoid the crowding in the food shops and to ensure subsistence goods also to those who need, (iii) providing medical devices like surgical masks to all the population.
As last remark, we remind that the outbreak of Covid-19 has been declared a pandemic by the World Health Organization. Many countries are already heavily overwhelmed by this infection and by the risk for the public health, therefore, in a networked world we all have to behave and operate with an improved spirit of cooperation. The bitter lesson imparted by this tough situation is that we cannot save ourselves alone.
Results
The model
Let us introduce the model, assuming that the full population is constant, uniform, homogeneously mixed, and counts N persons who can be divided in three parts, susceptible, infected and recovered persons, whose numbers, at a given time t, are S(t), I(t) and R(t), respectively.
Let us define the initial infected persons at time , and introduce , the probability of remaining infectious at later time t after becoming infectious. is a monotonic decreasing function with and . The initial number of the first infectious persons, therefore, decreases according to , meanwhile other susceptible persons become infected after coming in contact with those already infected, with a rate of infection , which counts the number of contacts per unit of time, times the probability for a infected person to transmit the infection. The probability of new infections at a given time x is, therefore, proportional to the ratio S(x)/N of persons who are still susceptible and the number of infected persons who are still infectious, . At a later time t, the total number of infections are, therefore, given by
1 |
Equivalently, writing , Eq. (1) can be written as
2 |
Since is a non-increasing function, is negative, therefore the last two terms of Eq. (2) reduces the increase of infection due to the first term. For that reason we can identify those terms as minus the variation of the removed cases
3 |
It is convenient, for the benefit of future discussion, to introduce the total number of infected persons, either those who are still infected at time t, I(t), and those who recovered or died, R(t),
4 |
From Eqs. (2) and (3), since , we have that F(t) fulfills the following equation
5 |
which is valid for any choice of .
Standard SIR model
If we now choose , inserting it in Eqs. (2)–(3) we recover the well-celebrated SIR model1–3
6 |
7 |
8 |
where is the recovery rate. In order to make a comparison with what follows let us solve these equations when the population N is very large, and as long as , such that . In this situation we have
9 |
and solving , with the initial condition , we get that the growth of the total number of infections, at the early stage, has the following form
10 |
Delay model
If, instead, we choose , a step function, namely for and for , inserting it in the Eqs. (2), (3), being , we get
11 |
12 |
13 |
From the equations above it is easy to see that , therefore , with C a constant value. We remind that, contrary to I(t), either F(t) and R(t) are both cumulant quantities, namely they are monotonic increasing functions. Requiring that F(t) saturates at , the constant value has to be , therefore
14 |
This equation describes the realistic fact that the total number of cases at some time t becomes that of removed cases at later time , namely after an infectious period T. This seems to be the case also for the new coronavirus spreading, by looking at some reported data for Covid-19 in Italy, shown in Fig. 3 (see also Ref.4). Equation (14) allows us to write Eq. (5) in terms of only the function F(t). Eq. (5), for the delay model, therefore, reads
15 |
where for . This delay differential equation is known to be linked to non-Markovian dynamics11. If we consider the case where the population N is very large, and as long as , we can neglect the logistic term, , so to have
16 |
We expect that this functional retarded differential equation, Eq. (16), at least, at the early stage of the infection, could describe accurately the spreading of the epidemic disease.
Basic reproduction number
Let us rewrite Eq. (16), for , in the following form
17 |
where we introduce and naturally identify as the so-called basic reproduction number
18 |
which is a widely used parameter for predicting whether the infectious disease will spread into a population or turns off, and represents the average number of cases originated by a single infectious case during the infectiousness period. Eq. (17) implies that the first derivative of F(t) is equal to its increment in a time interval T, divided by T, namely F(t) is linear in t if the rate is equal to the critical value (). For (), the function F(t) increases more than linearly, while for (), F(t) goes slower than linearly (see Fig. 1). If we let vary in time, when () the function F(t) has an inflection point, where it changes from being concave to convex or vice versa. Making a comparison with the SIR model, where , one can identify , the recovery rate with the inverse of the recovery time . Notice that is well defined as long as , namely in the early stage of the infection. In general terms one has to define the generalized reproduction number so that Eq. (15) can be written in the same form of Eq. (17) with instead of .
Analytical solution
In this section we will provide the exact solution of Eq. (16). Writing the time t as , where is the integer part of t/T, the solution of Eq. (16) is given by
19 |
where the functions fulfill the following iterative equation
20 |
with for any and , so that, for , we recover . The full exact solution is, therefore, obtained by solving a cascade of n local integrals. The proof of Eqs. (19) and (20) is given in Methods.
At time , from Eq. (20), performing the chain of integrals, and putting the results in Eq. (19), we get the following exact result
21 |
For instance, for and , namely up to twice the infectiousness period, the total number of cases is simply . Surprisingly we find that Eq. (21) depends only on , which is the basic reproduction number . It is easy to check from Eq. (21) that, while for large , F(nT) is dominated by an exponential behavior, for , F(nT) becomes linear in n. From Eqs. (19) and (20) we can also write the following equation
22 |
By iteration one gets simply
23 |
where fulfills the following recursive equation, with inital value ,
24 |
The final exact result for any time is, therefore,
25 |
where . For practical reasons, in order to avoid indeterminate forms, for and , in Eq. (25) one can add an infinitesimal term , so to have . Once we have the total number of infections F(t) at any time, we get also the number of removed cases, , and we can easily calculate, from Eq. (25), the number of persons who are still infected, at a given time t, which, by definition and from Eq. (16), is given by
26 |
Comparison between the delay model and the standard SIR model
As we have seen, one assumption the standard SIR model is based on is that the time in which individuals remain infectious is described by an exponential distribution, which is however biologically rather unrealistic. In reality, infectious periods are fairly closely centered about the mean duration of an infection. A constant infectious period is therefore a more realistic assumption. The conventional SIR model being formulated in terms of ordinary differential equations, requires the presence of an effective recovery (and fatality) rate which might not correspond to the actual rate since the new cases of recovery (and fatality) come from infected cases occurring a few days earlier. For that reason, instead of writing the problem in terms of ordinary differential equations one has to do it in terms of functional differential equations, as for the delay model. Even if the recovery rate of the SIR model is chosen to be equal to the inverse of the average infectious period, the dynamics obtained by solving Eqs. (6–8) does not correspond to the dynamics obtained by solving Eqs. (11–13). As shown in Fig. 2, even with the same initial conditions and the same , the growth and the expected peak of the spreading of the infectious disease are quite different between the two models, even if the asymptotic final values are the same. For the SIR model predicts a much lower peak of I(t) with respect to that expected from the delay model, which is much sharper and occurs much earlier. In other words, the outbreak of an epidemic disease might be underestimated by the standard SIR model. We notice also that the analytic expression for F(t) in Eq. (25) describes fairly well the increase of the infection, at least in its early stage.
Time-dependent infection rate: analytical solution
Let us now consider the possibility of having a time-dependent infection rate in the dynamical equation for the total number of infected persons
27 |
Also in this more general case the exact solution, valid for any profile of , can be written in the same form of Eq. (19), namely, , where now the functions are given by
28 |
For instance, , and so on. For constant α, Eq. (28) reduces to Eq. (20). See “Methods” for more details about the derivation. The solution F(t) has therefore to fulfill the following recursive equation, after splitting the time in n intervals T with the residual time
29 |
This general result implies that if we knew the time dependence of the infection rate or if we could tailor its evolution by, for instance, containment measures, we can know the exact analytical expression of F(t), the total number of infected persons, as a function of time, as long as F(t) is much smaller than N.
Covid-19 in Italy
Let us consider the delay model in its general form, Eq. (15) where the infection rate varies in time
30 |
as the effect of some containment measures taken in order to reduce the impact of an infection on the population. As an example, let us suppose that is modified by social distancing measures, lockdown and the shutdown of many work activities, as it is happening in Italy (and in many other countries) to mitigate and reduce the spreading of the new coronavirus, Covid-19, after two main decrees imposed by the Italian Prime Minister ordering the lockdown of the whole national territory, taken on March 11th (lockdown and shutdown of many stores) and March 22th 2020 (shutdown of many factories and strengthening of social distancing measures), and after some other measures taken right before for local regions (e.g. the decree of March 8-th for the lockdown of Lombardy and other areas). As a result, we can imagine that decreases smoothly after those dates taking into account the adaptation time for the individuals to the new social behaviors and the period needed to complete the last activities before the blockade of the factories. Let us suppose, therefore, that can change in time according to a smooth step function as in Eq. (31),
31 |
where and are the times where the steps are located, and make the function to be smooth, is the initial observed infection rate which causes the starting exponential growth of the epidemic disease, the intermediate rate, which fits with the data, supposed to be reached after the first decree of lockdown, and the supposed asymptotic infection rate after the second decree of lockdown. Fixing the average of recovery and fatality time T, the reproduction number is also a function of time, therefore we define
32 |
with a profile shown in Fig. 4. More precisely , but as we will see, because of the containment measures, at any time.
Solving Eq. (30), or, analogously, using the recursive relation in Eq. (29), with the time-dependent rate given by Eq. (31), with the parameters reported in Fig. 4, we obtain the solution F(t) which slowly goes to saturation over time, in perfect agreement with the data for the total number of confirmed infected cases, as shown by Fig. 5, where the blue line is the expected curve, while the red points are the official data. The dotted gray lines in Fig. 5 represent F(t) if the containment measures had not been taken. As one can see from Figs. 4–5, only when becomes smaller than 1, the curve flattens allowing for a stop of the epidemic spreading, avoiding that a large part of the population gets infected. For , F(t) would increase linearly, and I(t) would become almost constant, meaning that the number of new infections would be equal to the number of closed cases. A reliable forecast has to take into account the fact that the official data of infectious cases are made by counting mostly the symptomatic cases, probably discarding other infectious cases which could transfer the virus even without or with mild symptoms. Moreover, the data of both the total number of infected persons and that of the recovered ones could be affected by the procedure, the realization times and the number of the diagnostic tests. However, since our model relies on the infectiousness time, it does not need a fitting of the data for recovered persons which may be affected by systematic errors. This uncertainty on the data for closed cases would compromise the result for the SIR model. On the contrary, our theoretical prediction based on the delay model agrees fairly well with the data-set for total infected cases, as shown in Fig. 5.
As a final remark we remind that most of the confirmed infected cases in Italy are counted after the appearance of the symptoms and the persons who exhibit severe ones are mostly hospitalized, and afterwards counted as infected persons. Some of them, unfortunately, die approximately 4 days after (therefore after approximately 9 days from the onset of the first symptoms, as reported by the Istituto Superiore di Sanità14). We observe that, splitting the closed cases between real recovered persons, , and dead persons, ,
33 |
and since the confirmation of recovery needs extra diagnostic tests which are not widely performed yet, the most reliable data are those related to dead persons , which are found to be linked to the total number of confirmed infected cases, F(t), in the following way
34 |
with and a delay time of days, as show in Fig. 6. The number of victims follows the number of total confirmed cases and it is equal to 1/7 of its value four days before.
The fatality of the sick persons, those who exhibit some symptoms, is therefore quite high, .
Discussion
We present a simple but realistic model for describing epidemic spreading, based on the fact that the closed cases come from infected ones at an early time. This observation allows us to formulate the problem in terms of a single functional differential equation depending on two well defined clinically relevant parameters: the infection rate and the infectiousness time. We provide the exact analytical solution for such an equation, in the limit of a large population, finding how it depends on the basic reproduction number , see Eqs. (21) and (25). Contrary to the result of the conventional SIR model, the total number of cases has a combined polynomial and exponential growth. We derive the analytic solution also in the presence of a generic time-dependent infection rate, which is the case when some measures are taken to weaken the spreading of the epidemic disease. We apply, therefore, our model to study the spreading of Covid-19 in Italy, allowing the infection rate to vary in time, as a result of some containment measures implemented by the government in order to mitigate the consequences of the infection on the population. We find perfect agreement between the official data and the expected theoretical results. In general terms, the reproduction number should be suppressed well below 1 in order to rapidly recover the initial condition. By a rough estimation, in order to have a decline of the infection as fast as its growth, containment measures or possible therapies should be so effective to reduce the basic reproduction number and reach the final value such that , starting from an initial value . In the case of Covid-19 in Italy, the initial value for the basic reproduction number was , while the current one (April) seems to settle at , implying a rather slow decline of the infection. Finally we discussed the fatality rate, showing that the number of victims is exactly a fraction of the total number of cases few days before. Before we conclude a final comment is in order. The confirmed cases are mostly symptomatic or mild symptomatic. There are also asymptomatic cases which may contribute to the spreading of the infection. However, by scaling arguments, the infection rates of the symptomatic and asymptomatic are expected to be equal, otherwise either symptomatic or asymptomatic cases might become irrelevant. Under the hypothesis that the infectiousness time does not depend on the strength of the symptoms, the ratio between the total number of asymptomatic and symptomatic cases should be constant, although it could be very large. As a result, the total number of infected persons should be equal to the number of symptomatic cases times an overall pre-factor greater than one. The conclusion is, therefore, that, as far as the time evolution of the infection is concerned, which is the aim of this work, the study of only symptomatic cases is still relevant and greatly meaningful.
Methods
Solution of the retarded differential equation
For , the solution of Eq. (16) is . Let us consider with infinitesimal dt, from Eq. (16)
35 |
Using this result we can calculate
36 |
Analogously, from that, we can proceed calculating
37 |
and going on by adding infinitesimal time steps, we find iteratively that
38 |
39 |
with and defining
40 |
In particular, for , we have an expression for F(2T) in terms of the function at early time, . We can now start again with the iteration
41 |
One can proceed in the same way as before getting
42 |
which can be written as
43 |
where
44 |
We can notice that at any step T we can perform the same calculation since we can factorize the function F as
45 |
where, therefore, and
46 |
In the continuum limit, and , keeping finite the time interval , reminding that
47 |
we finally obtain the result reported Eq. (20).
In the presence of time dependent infection rate, splitting again the time in n intervals T and the residual time in m infinitesimal intervals dt, we define
48 |
Proceeding iteratively as done for the constant rate case, but now taking trace of the different values of ,
49 |
after several steps, similar to those done previously, we find that Eq. (46) can be generalized in the following way
50 |
whose continuum limit is given in Eq. (28).
Author contributions
L.D. conceived the work, performed the calculations and wrote the manuscript.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Anderson RM, Anderson B, May RM. Infectious Diseases of Humans: Dynamics and Control. Oxford: Oxford University Press; 1992. [Google Scholar]
- 2.Keeling MJ, Rohani P. Modeling Infectious Diseases in Humans and Animals. Princeton: Princeton University Press; 2011. [Google Scholar]
- 3.Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proc. R. Soc. A. 1927;115:700–721. [Google Scholar]
- 4.Symptoms of Novel Coronavirus (2019-nCoV), CDC (Center for Disease Control and Prevention), https://www.cdc.gov, 10/02/2020, https://www.cdc.gov/coronavirus/2019-ncov/about/symptoms.html.
- 5.Diekmann D, Heesterbeek JAP. Mathematical Epidemiology of Infectious Diseases in Model Building, Analysis and Interpretation. New York: Wiley; 2000. [Google Scholar]
- 6.Arino J, van den Driessche P. Delay Differential Equations and Applications in Time Delay in epidemic models, 539–578. New York: Springer; 2006. [Google Scholar]
- 7.Zhang F, Li Z, Zhang F. Global stability of an SIR epidemic model with constant infectious period. Appl. Math. Comput. 2008;199:285–291. [Google Scholar]
- 8.Beretta E, Breda D. An SEIR epidemic model with constant latency time and infectious period. Math. Biosci. Eng. 2011;8:931–952. doi: 10.3934/mbe.2011.8.931. [DOI] [PubMed] [Google Scholar]
- 9.Ruschel S, Pereira T, Yanchuk S, Young LS. An SIQ delay differential equations model for disease control via isolation. J. Math. Biol. 2019;79:249–279. doi: 10.1007/s00285-019-01356-1. [DOI] [PubMed] [Google Scholar]
- 10.Young LS, Ruschel S, Yanchuk S, Pereira T. Consequences of delays and imperfect implementation of isolation in epidemic control. Sci. Rep. 2019;9:3505. doi: 10.1038/s41598-019-39714-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kiss IZ, Röst G, Vizi Z. Generalization of pairwise models to non-Markovian epidemics on networks. Phys. Rev. Lett. 2015;115:078701. doi: 10.1103/PhysRevLett.115.078701. [DOI] [PubMed] [Google Scholar]
- 12.https://lab.gedidigital.it/gedi-visual/2020/coronavirus-i-contagi-in-italia/.
- 13.Gaeta G. Asymptomatic infectives and for COVID arxiv:2003.14098.
- 14.https://www.iss.it and https://www.salute.gov.it/nuovocoronavirus.