Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Jul 1;139:110064. doi: 10.1016/j.chaos.2020.110064

Modelling the downhill of the Sars-Cov-2 in Italy and a universal forecast of the epidemic in the world

Gabriele Martelloni a, Gianluca Martelloni a,
PMCID: PMC7328650  PMID: 32834614

Abstract

In a previous article [1] we have described the temporal evolution of the Sars-Cov-2 in Italy in the time window February 24-April 1. As we can see in [1] a generalized logistic equation captures both the peaks of the total infected and the deaths. In this article our goal is to study the missing peak, i.e. the currently infected one (or total currently positive). After the April 7, the large increase in the number of swabs meant that the logistical behavior of the infected curve no longer worked. So we decided to generalize the model, introducing new parameters. Moreover, we adopt a similar approach used in [1] (for the estimation of deaths) in order to evaluate the recoveries. In this way, introducing a simple conservation law, we define a model with 4 populations: total infected, currently positives, recoveries and deaths. Therefore, we propose an alternative method to a classical SIRD model for the evaluation of the Sars-Cov-2 epidemic. However, the method is general and thus applicable to other diseases. Finally we study the behavior of the ratio infected over swabs for Italy, Germany and USA, and we show as studying this parameter we recover the generalized Logistic model used in [1] for these three countries. We think that this trend could be useful for a future epidemic of this coronavirus.

Keywords: Sars-Cov-2, Italy, Logistic model, Non linear differential equations, Model calibration

1. Introduction

We briefly review the historical evolution of the Sars-Cov-2 in the Earth. In early December the Sars-Cov-2 appeared in Wuhan, China.

The disease caused by the new Coronavirus has a name: ”COVID-19” (where ”CO” stands for corona, ”VI” for virus, ”D” for disease and ”19” indicates the year in which it occurred). The Oms Director-General Tedros Adhanom Ghebreyesus announced it on February 11, 2020, during the extraordinary press conference dedicated to the virus.

The appearance of new pathogenic viruses for humans, previously circulating only in the animal world, is a widely known phenomenon (called spill over) and it is thought that it may also be at the basis of the origin of the new coronavirus (SARS- CoV-2). The scientific community is currently trying to identify the source of the infection.

On December 31, 2019, the Municipal Health Commission of Wuhan (China) reported to the Oms a cluster of cases of pneumonia of unknown etiology in the city of Wuhan, in the Chinese province of Hubei.

On January 9, 2020, the Chinese Center for Disease Prevention and Control (CDC) reported that a new coronavirus (initially called 2019- nCoV and now called SARS-CoV-2) has been identified as the causative agent and has been rendered publishes the genomic sequence.

Oms on March 11, 2020 declared that COVID-19 can be defined as a pandemic.

After notification of the epidemic by China, Italy immediately recommended postponing unnecessary flights to Wuhan and, subsequently, with the spread of the epidemic, to all of China. Consequently, the latter has canceled all flights from Wuhan.

This disease does not save Italy that has become a protected area with the DPCM signed on the evening of 9 March by the Prime Minister, Giuseppe Conte, who has extended the restrictive measures already applied for Lombardy and the 14 northern provinces most affected by the coronavirus infection to the whole national territory. The new action comes into force on March 10, and will take effect until April 3. Among the main innovations: it limits the movement of people, blocks sporting events, suspends teaching activities in schools and universities throughout the country until April 3.

With the new ordinance of 22 March 2020 issued by the Minister of Health and the Minister of the Interior, from 22 March people are prohibited from moving with public or private trasportation in a municipality other than that in which they are located, except for proven work needs, absolute urgency or for health reasons [2].

Many growth models have been very recently applied to study the evolution of the Covid-19 infection [3], [4], [5], [6], [7], [8], [9], [10]. In [1] we tried to analyze the time evolution of the Sars-Cov-2 in Italy, using a Logistic model[11] at the beginning of the study (compared to the Gompertz law [12]) and after with a generalization of that model.

The Logistic behaviour assumes that growth stops when maximum sustainable population density is reached through the carrying capacity K that depends on the environmental conditions. For example the ordinances of the Prime Minister G.Conte, the people’s hygiene habits are encoded in the carrying capacity K.

We observe as the generalized model of [1] works very well until the April 7. After this date the large increase in the number of swabs meant that the logistical behavior of the infected curve no longer worked. At first in Italy, pharyngeal swabs were initially made only on seriously ill people. This choice gave us the possibility to have a sample of the infected that we can describe with a single population model, after April 7, it becomes impossible. So we decided to use a different model to describe the new trend of the data and try to give different scenarios of the descent phase of the virus in Italy, in the time window February 24-May 5. In [1] we described two different peaks, the peak of the infected and the deaths one. In this paper we analyze the peak of the currently infected and the downhill of the propagation of the Sars-Cov-2. To do this we define a new model similar to a SIRD (see for example [7]), but without the population of supsceptibles, because there are no criteria on defining the susceptible ones. We consider three couple differential equations for Infected I(t), Deaths D(t) and Recovery R(t) with the following conservation law

P(t)=I(t)R(t)D(t), (1)

where P(t) represents the currently infected (or positive).

In the last part of this article we observe as the following ratio (infected I(ti) over swabs S(ti))

Inorm(ti)=I(ti)S(ti), (2)

is the most important parameter to describe the evolution of the Sars-Cov-2. Indeed, we can describe the trend of this quantity only with a generalized Logistic model with 4 parameters even with data after April 7. This behavior suggest us to use this model for a future epidemic of this virus. If we will able to perform a greater and constant number of swabs everyday, using this model, we may have better control over the contagion curve, and consequently over the number of deaths.

2. The new model and the description of the dowhill

Our idea is to use a model that adapts to the data of the problem. We explain better. Let’s consider the following data:

  • 1) daily data of March 14: 3497 new infected, 11,682 swabs;

  • 2) daily data of April 17: 3493 new infected, 65,705 swabs;

  • 3) probable Case Fatality Rate (CFR) of the virus is rougly 1% →  2.3 million of infected at April 17, with 172,434 detected;

  • 4) time delay between hospitalization and death td ≃ 4 days, parameter extrapolated also in [1];

  • 5) time delay between the onset of symptoms and healing tr1442 days, a very oscillating parameter;

  • 6) two different LockDown (LD) data, March 10, and March 22, with different restriction.

Some comments about these data: the points 1) and 2) describe perfectly that the sample of infected is not clean; at the beginning of the contagion the swabs are performed only on the severe infected, after 1 month the number of swabs are increased of a factor 6 and consequently also the midly infected are detected. Point 3) tells us that there is probably an incredible number of asymptomatics as a source of severe infected, we have no control about it. Points 4), 5) indicate that while the death data is under control, the healed data are very oscillating in time. Finally the points 6) tells us that contribution of asymptomatics, portrayed in [1], changes in time, indeed from April 6–7 (14–15 days after the second LD, i.e. an incubation time) the generalized Logistic description fails.

After these considerations we have decided to couple the following equations:

dIdt=r0Iα+Atβtt0, (3)
dIdt=r0Iαr0IδK+Atβtγ(tt0)t>t0, (4)
dDdt=KfdI(ttd)dt, (5)
dRdt=f(t)dI(ttr)dt, (6)

with a conservation law

P(t)=I(t)R(t)D(t), (7)

where P(t) represents the currently infected (or positive).

The parameters r 0 represents the rates of growth of epidemic, K is the carrying capacity for the classical logistic model, α is a constant in order to have a power low initial growth before LD, β is the exponent of the second term of Eq. 1 that represents the influence of asymptomatic; δ, a correction of the quadratic term of logistic, and γ are the constant parameters considering the influence of the government measures1 , Kf is a proportionality constant between deaths and total number of infected, while td and tr are the delays of deaths and recoveries respect to infected respectively; the constant A represents the contribution of asymptomatic people as introduced in [1] and finally t 0 is the time of LD start.

A brief consideration about the function f(t): the great variability of tr suggest us that only the parameter tr is not sufficient to describe correctly the function R(t), so we decided to introduce a coefficient time dependent.

We present two different scenarios, in Fig. 1 we consider a linear approximation f(t)=a+bt, while in Fig. 2 we consider a quadratic approximation f(t)=a+bt+ct2. This choice is not random. Indeed, considering the behaviour of the recovery time series in which a single recovery can heal with some delay in a window variable from few days to two months, the correct modeling could be a regressive linear function of type R(t)=i=1Nai*I(tti) (eventually introducing also no-linear term in the series), but in this way we introduce many degree of freedom how many are the coefficient ai of the regressive function. Therefore, we consider an approximation using the two functions f(t) considered above.

Fig. 1.

Fig. 1

The scenario with a linear growth for the recoveries: the black curve represents the deaths, the red one for the infected, the green one for the recovery and the pink one for the currently infected. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Fig. 2.

Fig. 2

The scenario with a quadratic growth for the recoveries: the black curve represents the deaths, the red one for the infected, the green one for the recovery and the pink one for the currently infected. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

We desumed the following values for the principal parameters by means of 100 stochastic simulation using direct method Gillespie algorithm adapted to non-autonomous differential equations:

r0=0.29±0.02, (8)
K=1.85*105±1.1*104, (9)
t0=30±1, (10)
δ=1.84±0.01, (11)
Kf=0.142±0.002, (12)
td=4±1, (13)
tr=8±1forlinearapproximation, (14)
tr=12±1forquadraticapproximation. (15)

Some comments about these values: with respect to [1] we observe that the peak of the severe infected is correctly estimated, i.e. t0=2426 March and also the peak of the deaths, i.e. t0=2830 March; also the time delay td remained the same, while tr approaches the experimental lower limit in quadratic approximation. With respect to the Logistic model r 0 is increased and the coefficient δ drops from the value 2 to the value 1.84, i.e. we are considering different models.

In Fig. 12 we observe as the peak of currently infected is close to April 20, and finally we give us our prevision for a linear growth for the recoveries

I(end)=247471,D(end)=35235, (16)

close to July 10; for a quadratic growth we have

I(end)=243766,D(end)=34682, (17)

close to June 20.

The estimated numbers I(end) and D(end) are very close, but it is not surprising: the Eqs. (3) and (4) for total infected head the model, while f(t) is present in Eq. (6) that is only a proportionality equation. Obviously a linear approximation for f(t) leads to a slower recovery curve and therefore a small increase of infected.

3. Recovering the logistic behaviour

Now we consider the following parameter:

Inorm(ti)=I(ti)S(ti), (18)

that represents the number of infected normalized with the number of swabs S(ti).

We study this quantity with generalized Logistic equation used in [1]:

dInorm(t)dt=r0(Inorm(t)αInorm(t)2K)+A, (19)

where α, r 0, K and A have the same meaning used in the previous section. Compared to the previous section we observe as studying the parameter I norm(ti) we can describe the contagion with a simple logistic equation and without the phenomenological terms introduced in eqs.(3)-(6).

In order to calibrate this model in the best way possible we use two algorithms, the first one based on simulated annealing [13] and the second one on optimized simplex [14]. We evaluate the function error defined as

F(p)=1/Ni=1Nwi(xiyi(p))2 (20)

where xi is the real data at day i, yi(p) is the correspondent output of the model depending of vector parameter p and wi is a generic weight that we can use or can be equal to one. For our purpose we adopt as weight the derivative of data or the data at time (day) i: the use of derivative allows to calibrate better on average the curve, while the use of the data as weight permit to calibrate better the data of the last part of the curve.

In Fig. 34 we describe the situation of Italy at May 5, while Fig. 56 describe the data of Italy from which we have subtracted the data of Lombardia.

Fig. 3.

Fig. 3

The scenario of Italy with the derivative weight.

Fig. 4.

Fig. 4

The scenario of Italy with the data weight.

Fig. 5.

Fig. 5

The scenario of Italy minus Lombardia with the data weight.

Fig. 6.

Fig. 6

The scenario of Italy minus Lombardia with the derivative weight.

We give you the following values for the parameters of Fig. 3

r0=0.175±0.015, (21)
K=31.2±0.6, (22)
α=0.476±0.08, (23)
A=0.010±0.003, (24)

for the parameters of Fig. 4

r0=0.178±0.015, (25)
K=38.5±0.6, (26)
α=0.398±0.08, (27)
A=0.011±0.003, (28)

for the parameters of Fig. 5

r0=0.143±0.015, (29)
K=22.3±0.4, (30)
α=0.444±0.08, (31)
A=0.011±0.003 (32)

and finally for the parameters of Fig. 6

r0=0.143±0.015, (33)
K=20.2±0.4, (34)
α=0.485±0.08, (35)
A=0.010±0.003. (36)

We observe as the quantity I norm(t) is probably the most important quantity studying the evolution of the virus! We explain better: the contribution of asymptomatic people is essentially the same in Lombardia and in the rest of Italy, while the coefficient r 0 is larger if we consider Italy compared to the scenario of Italy minus Lombardy; this consideration is extremely coherent with the data: the infected of Lombardia region represent the 37% of all the italian infected.

Moreover, the ratio infected over swabs is a very reliable parameter, we can describe correctly the italian situation only with 4 parameter and with a well-known model. We stress that in the future if a nation is ready to carry out a large and constant number of swabs every day, using this model, we can have a reliable forecast of the epidemic!

3.1. A comparison with Germany and USA

We consider also the scenario represented by Eq. (19) for Germany in Fig. 7 and for USA in Fig. 8 . For Germany we study the time evolution of the Sars-Cov-2 in the time window March 8-May 11, and we obtain the following parameters

r0=0.069±0.008, (37)
K=8.6±0.2, (38)
α=0.61±0.08, (39)
A=0.013±0.003. (40)

For USA we study the contagion in the time window March 10-May 11, and we have these values for the parameters

r0=0.082±0.008, (41)
K=29.9±0.4, (42)
α=0.71±0.09, (43)
A=0.012±0.003. (44)

Fig. 7.

Fig. 7

The scenario of Germany with the derivative weight.

Fig. 8.

Fig. 8

The scenario of USA with the data weight.

A brief comment about these parameters: the values r 0 are similar to each other and lower than the Italian value, with and without Lombardia data. This is completely coherent with the various scenarios: the apparent lethality (apparent CFR) is approximately 4.5% in Germany and 6% in the United States, while in Italy is 14%. The contribution of asymptomatic A seems to be similar for all countries.

However, the surprising result is that the trend of the virus is captured by the same differential equation, i.e. Eq. (19), for all considered countries. This could mean that the virus always has the same type of behavior, but at different speed. Let’s try to justify this idea: a different speed may depend on population density, work habits and the number of swabs at the beginning of the epidemic. About the last consideration we imagine to immediately carry out a large number of swabs: knowing as soon as possible the largest possible number of infected means limiting the contagion and therefore the propagation speed of virus.

4. Conclusions

We described the evolution of the Sars-Cov-2 in Italy in the time window February 24-May 5. To do this we have built a phenomenological growth model adapted on the data of Civil Protection. With respect to a classical SiR(D) model we did not consider the supsceptible population, because there are not medical evidences on which sample of the population can be ill. So we have considered three couple differential equations for Infected I(t), Deaths D(t) and Recovery R(t) with the a conservation law including the currently positive population P(t).

As the time delay between the onset of symptoms and healing tr days is a very oscillating parameter we introduced a sort of regressive function f(t) to modelling better this delay. So we described two scenarios of the end of epidemic:

  • I(end)=247471,D(end)=35235, close to July 10, for f(t) linearly approximated,

  • I(end)=243766,D(end)=34682, close to June 20, for f(t) in a quadratic approximation.

Obviously a linear approximation for f(t) leads to a slower recovery curve and therefore a small increase of infected.

In the second part of this manuscript we described the time evolution of the normalized data

Inorm(ti)=I(ti)S(ti), (45)

that represents the number of infected normalized with the number of swabs S(ti).

We have studied this parameter on four different scenarios:

  • Italy,

  • data of Italy minus data of Lombardia (about 37% of the Italian infected belong to the Lombardia region),

  • USA,

  • Germany.

So we have found that all the evolutions are governed by the same generalized logistic equation [1], suggesting an universal feature of the propagation of Sars-Cov-2 virus. In particular the value of the parameter r 0 is in descending order compatible with the respective Apparent CFR (ACFR)

  • for Italy r0=0.175 and ACFR=14%,

  • for Italy-Lombardia r0=0.143 and ACFR=11%,

  • for USA r0=0.082 and ACFR=6%,

  • Germany r0=0.069 and ACFR=4,5%.

Finally, we suggest that the data I norm(ti) is the most important parameter to control the propagation of the virus for a new inauspicious propagation of this virus in the world, because, knowing its universal feature, we can forward know the number of infected preparing a relevant number of swabs.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

We thank many colleagues for interesting discussions, in particular Andrea Marzolla and Domenico Seminara. We also thank Pierluigi Blanc, S.O.C. Infectious Diseases 1 Santa Maria Annunziata Hospital, for stimulating discussions on technical subjects on which we had no knowledge.

Footnotes

1

Remember that the measures of LD are not homogeneous in time, two different DPCM are present in the period March 10-May 4

References

  • 1.Martelloni G., Martelloni G. Analysis of the evolution of the sars-cov-2 in italy, the role of the asymptomatics and the success of logistic model. 2020. arXiv:2004.02224, 5 April. [DOI] [PMC free article] [PubMed]
  • 2.http://www.salute.gov.it/portale/nuovocoronavirus.
  • 3.Lai A., Bergna A., Acciarri C., Galli M., Zehender G. J. Med. Virol. 2020. Early phylogenetic estimate of the effective reproduction number of sars-cov-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Chen Y., Liu Q., Guo D. Emerging coronaviruses: genome structure, replication, and pathogenesis. J MedVirol. 2020;92:418. doi: 10.1002/jmv.25681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Castorina P., Iorio A., Lanteri D. Data analysis on coronavirus spreading by macroscopic growth laws. 2020. arXiv:2003.00507, 1 March.
  • 6.Fenga L. Covid19: An automatic, semiparametric estimation method for the population infected in italy. MedRxiv preprint doi: 10.1101/2020.03.14.20036103. [DOI] [PMC free article] [PubMed]
  • 7.Fanelli D., Piazza F. Analysis and forecast of COVID-19 spreading in China, Italy and France. Chaos, Solitons & Fractals, 134, 109761; 2020. [DOI] [PMC free article] [PubMed]
  • 8.Agosto A., Giudici P.. A poisson autoregressive model to understand COVID-19 contagion dynamics. Ssrn - abstract- id=3551626.
  • 9.Bialek S. February 12, - March 16, 2020. MMWR Morb Mortal Wkly Rep. ePub: 18 March. 2020. CDC COVID-19 Response team, severe outcomes among patients with coronavirus disease 2019 (COVID- 19) - united states. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Lanteri D., Carco’ D., Castorina P. How macroscopic laws describe complex dynamics: asymptomatic population and CoviD-19 spreading. arXiv:2003.12457.
  • 11.Verhulst P.F. Notice sur la loi que la population poursuit dans son accroissement. Correspondance Mathematique et Physique. 1838;10:113. [Google Scholar]
  • 12.Gompertz B. On the nature of the function expressive of the law of human mortality and a new mode of determining life contingencies. Phil Trans R Soc. 1825;115:513. [Google Scholar]
  • 13.Cardoso M.E., Salcedo R.L., de Azevedo S.E. The simplex-simulated annealing approach to continuos non-linear optimization. Comput Chem Eng. 1996;20:1065–1080. [Google Scholar]
  • 14.Marsili-Libelli S. Parameter estimation of ecological models. Ecol Modell. 1992;62:233–258. [Google Scholar]

Articles from Chaos, Solitons, and Fractals are provided here courtesy of Elsevier

RESOURCES