Abstract
The Susceptible-Exposed-Infected-Recovered (SEIR) epidemiological model is one of the standard models of disease spreading. Here we analyse an extended SEIR model that accounts for asymptomatic carriers, believed to play an important role in COVID-19 transmission. For this model we derive a number of analytic results for important quantities such as the peak number of infections, the time taken to reach the peak and the size of the final affected population. We also propose an accurate way of specifying initial conditions for the numerics (from insufficient data) using the fact that the early time exponential growth is well-described by the dominant eigenvector of the linearized equations. Secondly we explore the effect of different intervention strategies such as social distancing (SD) and testing-quarantining (TQ). The two intervention strategies (SD and TQ) try to reduce the disease reproductive number, to a target value but in distinct ways, which we implement in our model equations. We find that for the same TQ is more efficient in controlling the pandemic than SD. However, for TQ to be effective, it has to be based on contact tracing and our study quantifies the required ratio of tests-per-day to the number of new cases-per-day. Our analysis shows that the largest eigenvalue of the linearised dynamics provides a simple understanding of the disease progression, both pre- and post- intervention, and explains observed data for many countries. We apply our results to the COVID data for India to obtain heuristic projections for the course of the pandemic, and note that the predictions strongly depend on the assumed fraction of asymptomatic carriers.
Keywords: Covid-19, SEIR, Compartmentalised model, Mitigation, Testing-Quarantining
The COVID-19 pandemic, that started in Wuhan (China) around December 2019 [1], [2], has now affected almost every country in the world. The total number of confirmed cases on July 30, 2020 were close to 17.5 million with close to 686,000 deaths [3]. One of the serious concerns presently is that there is as yet no clear picture or consensus on the future evolution of the pandemic. It is also not clear as to what is the ideal intervention strategy that a government should implement, while also taking into account the economic and social factors. The role of mathematical models has been to provide guidance for policy makers [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18].
One of the standard epidemiological model is the SEIR model [19] which has four compartments of Susceptible (), Exposed (), Infected () and Recovered () individuals with being the total population of a region (the model can be applied at the level of a country or a state or a city and is expected to work better for well-mixed populations). The SEIR model is parameterized by the three parameters and that specify the rates of transitions from and respectively. In terms of the data that is typically measured and reported, corresponds to the total number of cases till the present date, while would be the number of new cases per day. The number of deaths across different countries is some fraction () of [20] while the number of hospital beds required at any time would be . An important parameter characterizing the disease growth is the reproductive number [21], [22] — when this has a value greater than 1, the disease grows exponentially. Typical values reported in the literature for COVID-19 are in the range [23]. For the SEIR model one has [22].
The two main intervention schemes for controlling the pandemic are social distancing (SD) and testing-quarantining (TQ). Lockdowns (LD) impose social distancing and effectively reduce contacts between the susceptible and infected populations, while testing-quarantining means that there is an extra channel to remove people from the infectious population. These two intervention schemes have to be incorporated in the model in distinctive ways [7], [8] — SD effectively changes the infectivity parameter while TQ changes the parameter . Intervention schemes attempt to reduce to a value less than 1. In the context of the SEIR model with it is clear that we can reduce it by either decreasing or by increasing . In this work we point out that for the same reduction in value, the effect on disease progression can be quite different for the two intervention strategies.
Here we analyze an extended version of the SEIR model which incorporates the fact that asymptomatic or mildly symptomatic individuals [7], [8], [9], [24] are believed to play a significant role in the transmission of COVID-19. Our extended model considers eight compartments of Susceptible (), Exposed (), asymptomatic Infected (), presymptomatic Infected (), and a further four compartments (), two corresponding to each of the two infectious compartments. These last four classes comprise of individuals who have either recovered (at home or in a hospital) or are still under treatment or have died — they do not contribute to spreading the infection. We do not include separate compartments for the number of hospitalized and dead since these extra details would not affect our main conclusions.
For this extended SEIR model we first provide, for the case with analytic expressions for peak infection numbers, time to reach peak values, and asymptotic values of total affected populations. These would provide useful guidance on disease progression. The linearized dynamics is accurate when the total affected population is small. We propose an accurate way of specifying initial conditions for the numerics (from insufficient data) using the fact that the early time exponential growth is well-described by the dominant eigenvector of the linearized equations. We next discuss the performance of two different intervention strategies (namely SD and TQ) in the disease dynamics and control. The aim of interventions is to reduce the reproductive number from it’s free value to a target value . We study both the cases, of strong interventions ( aimed at disease suppression,) and that of weak interventions ( aimed at disease mitigation). Apart from the reproductive number, an important parameter is the largest eigenvalue of the linear dynamics, which we denote as . For we have and this gives us the exponential growth rate (doubling time ). On the other hand, for the corresponding is less than 0 and this tells us that infections will decrease exponentially. We find that, for the same reduction of to a value less than 1, the corresponding magnitude can be very different for different intervention schemes. A larger magnitude of corresponding to a faster suppression of the pandemic, is obtained from TQ than that from SD. An important question for disease control is as to how much testing is required. In our work we relate the parameters of the model related to TQ to testing rates and point out that for TQ to be successful: (a) it has to be based on contact-tracing and (b) it is necessary that testing numbers are scaled up according to the number of new detected cases. Finally, we show that many of our results provide a qualitative understanding of COVID-19 data from several countries which have either achieved disease suppression or mitigation.
The rest of the paper is structured as follows. In Section 1 we define the extended SEIR model. A number of analytic results for the linearized model as well as the full nonlinear system are presented in Section 2. In Section 3, we discuss in detail how intervention strategies such as social distancing and testing-quarantining can be incorporated into the model, we comment on how real testing numbers enter the model parameters. We also present numerical studies comparing different intervention protocols and make qualitative comparisons of the predictions of the SEIR model with real data on the COVID-19 pandemic. In Section 4 we make some heuristic predictions in the Indian context. We summarize our results in Section 5.
1. Definition of the extended SEIR model
The extended SEIR model studied here is schematically described in Fig. 1 . It has eight variables () and ten parameters (), of which represents the fraction of asymptomatic carriers while are related to intervention strategies.
We consider a population of size that is divided into eight compartments:
-
1.
Susceptible individuals.
-
2.
Exposed but not yet contagious individuals.
-
3.
Asymptomatic, either develop no symptoms or mild symptoms.
-
4.
Presymptomatic, those who would eventually develop strong symptoms.
-
5.
Undetected asymptomatic individuals who have recovered.
-
6.
Asymptomatic individuals who are detected because of directed testing-quarantining, may have mild symptoms, and would have been placed under home isolation (few in India).
-
7.
Presymptomatic individuals who are detected at a late stage after they develop serious symptoms and report to hospitals. Here we assume that all individuals who develop significant symptoms are eventually detected.
-
8.
Presymptomatic individuals who are detected because of directed testing-quarantining.
We have the constraint that . A standard dynamics for the population classes is given by the following set of equations:
(1) |
(2) |
(3) |
(4) |
(5) |
(6) |
(7) |
(8) |
The parameters in the above equations correspond to
-
•
: fraction of asymptomatic carriers.
-
•
: infectivity of asymptomatic carriers.
-
•
: infectivity of presymptomatic carriers.
-
•
: transition rate from exposed to infectious.
-
•
: transition rate of asymptomatic carriers to recovery or hospitalization.
-
•
: transition rate of presymptomatics to recovery or hospitalization.
-
•
: detection probabilities of asymptomatic carriers and symptomatic carriers.
-
•
: intervention factor due to social distancing (time dependence will be specified later).
-
•
: intervention factor due to testing-quarantining (time dependence to be specified later). This is a rate and depends on testing-quarantining rates.
With our definitions, the total number of confirmed cases, and the number of daily recorded new cases would be
(9) |
Note that we include because these are people who are not detected through directed tests but eventually get detected (after days) when they get very sick and go to hospitals. On the other hand the class get detected because of directed testing, even before they get very sick.
In the next section we will discuss the case where the intervention parameters and are kept fixed, and present a number of analytic results. In Section 3 we will discuss the case where and are time-dependent.
2. Analytic results for model with constant parameters
2.1. Linear analysis of the dynamical equations
Since at early times and all the other populations one can perform a linearization of the above equations. This tells us about the early time growth of the pandemic, in particular the exponential growth rate. Let us define new variables to characterize the linear regime: . At early times when the dynamics is captured by linear equations
(10) |
(11) |
where . For the present we ignore the time dependence of the SD factor and the TQ factor . The matrix has 5 zero eigenvalues while the 3 non-vanishing ones are given by the roots of the following cubic equation for :
(12) |
This can be re-written in the form
(13) |
where and
(14) |
is the expected form for the reproductive number for the disease. One can intuitively see this as follows. The reproductive number is the average number of secondary infection from one infected individual at the initial phase of the outbreak. In our model, an infected individual may either be asymptomatic or presymptomatic with probabilities or respectively. On average, While an asymptomatic individual infects number of people, a presymptomatic carrier infects number of people. Consequently the expected number of secondary infection from an arbitrarily chosen infected individual will be with expression given by Eq. (14). Noting the fact that it follows that the condition for at least one positive eigenvalue is
(15) |
We denote the largest eigenvalue by and note that this is uniquely related to the reproductive number by Eq. (13). At early times the number of cases detected would grow as . For we expect that the largest eigenvalue is close to zero and, after neglecting the and terms in Eq. (13), we can read off the value as
(16) |
Initial conditions: We discuss here the fact that all initial conditions (which satisfy the condition ) quickly move along the direction of the dominant eigenvector and how this provides us a way to choose the correct initial conditions from the knowledge of one variable (e.g confirmed cases) at an early time. We denote the right and left eigenvectors corresponding to an eigenvalue by and respectively. The largest eigenvalue is denoted by with corresponding right and left eigenvectors and respectively. The vector can be written as [27]
(17) |
where the coefficients are determined by putting and then taking the inner product with the left eigenvector say . Doing so, we get which when substituted back in Eq. (17) gives
(18) |
where the second last line is true at sufficiently large times when only one eigenvalue dominates. This proves that the direction of the vector is independent of initial conditions. In particular, using the explicit form of the dominant eigenvector we find the following relation in the growing phase of the pandemic:
(19) |
Let us consider the initial condition so that (noting that )
(20) |
where . At a sufficiently large time (but still in the very early phase of the pandemic) we equate the observed confirmed number on some day to which therefore gives us the relation
(21) |
This then tells us that we should start with the following initial conditions, (now counting time from the day of the observation ):
(22) |
The crucial point is that the leading eigenvector fixes the direction of the growth and then knowledge of linear combination fixes all the other coordinates. Thus, independent of initial conditions, the vector describing all the system variables quickly points along the direction of the eigenvector corresponding to the largest eigenvalue [15], [28]. Hence if we know any one variable (or a linear combination of all the variables) at sufficiently large times in the growing phase, then the full vector is completely specified. This leads to an accurate way of specifying initial conditions for the numerics (from insufficient data) and will help in reducing the number of fitting parameters in modeling studies, thereby increasing their accuracy in predicting.
This fact also implies that different initial conditions (such as different seed infections) will only cause a temporal shift of the observed evolution. This also means that trajectories for different initial conditions are identical up to a time translation. If one uses identical parameters and intervention strategies, then all countries should follow the same trajectory provided they start with the same value for the normalized fraction of confirmed new cases . We illustrate this idea, for the extended SEIR dynamics, in Fig. 2 where we show a plot of for 5 different initial conditions. The inset shows a collapse of all the trajectories after an appropriate time translation of the different trajectories. Can we see a similar collapse of the real data for different countries (after normalizing by the respective populations and with appropriate time translation of the data)? In Fig. 3 we plot the data with this normalization and initial condition and see a rough collapse for several countries. The differences can be attributed to different parameter values and different control strategies in different countries. We notice in particular that three of the Asian countries (India, Pakistan, Indonesia) follow a distinctly different trajectory.
2.2. Final affected population
Let us define the asymptotic populations (i.e the populations at very long times) in the different compartments as and let . The total population that would eventually be affected by the disease (and either recover or die) is given by and would have developed immunity. A fraction (see below) would be undetected and uncounted.
It is possible to compute the final affected population from the dynamical equations in Eqs. (1)–(8). For the moment let us assume that and do not have any time dependence. We also assume that and . Then solving Eq. (1), we get
(23) |
where and are given after Eq. (13). Adding Eqs. (5) and (6) and then multiplying both sides by we get where . Similarly, we also get where . Plugging these two equations into Eq. (23) then gives
(24) |
Next we note that and . Hence, for the initial condition we find that the ratio at all times. Since at large times this means that the asymptotic values of and are given by
(25) |
Using this in Eq. (24), noting that and defining we then get the following simple equation that determines the asymptotic total affected population:
(26) |
where is the reproductive number as stated earlier. We note that Eq. (26) has a non-zero solution only when . In Fig. 4 we show the dependence of on obtained from a numerical solution of Eq. (26). For the simple SIR model the result of Eq. (26) is well known [25], here we show that this is valid for an extended model as well generally. This computation of the asymptotic population can be straightforwardly extended to a more general model where one can have arbitrary number of compartments for the infected and recovered populations.
The asymptotic population of all four compartments within R are thus given by
(27) |
2.3. Peak infections and the time to reach the peak
The peak infection numbers and the time at which the peak occurs are two important quantities that one would like to know during a pandemic. Here we provide heuristic analytic formulas for these, which are derived after making some simple physical assumptions.
We start with the evolution equations for the infected populations, given by:
(28) |
Let us assume that in comparison to the time scale of the pandemic, and all peak at roughly the same time say and let us indicate by the respective peak values. We then obtain
(29) |
Defining Eq. (29) gives
(30) |
(31) |
is an effective recovery rate. Substituting this in Eq. (29) yields
(32) |
Let us further assume that Eq. (32) is valid in general (for all time) and not just at the peak. Then, after defining we have
(33) |
Substituting Eq. (33) in the model Eqs. (1)–(8), and defining we obtain the standard SEIR equations,
(34) |
(35) |
(36) |
(37) |
Here, is given by Eq. (31) and where the reproductive number is given by Eq. (14). Assuming the initial conditions: we can solve Eqs. (34) and (37) to get:
(38) |
At the peak, our assumption of gives from Eq. (35), [using Eq. (30)]. The peak value is then given from Eq. (38) by . Finally, using Eq. (30) and the fact that gives
(39) |
For somewhat simpler models it is possible to obtain an analytical estimate of the peak size [16] and for the time, required to reach the infection peak [26]. In our case we estimate by noting that the linearized dynamics is approximately valid (see previous section) up to the time reaches its peak . Hence we write where is the infection number at some early time (but already in the exponentially growing regime). Hence we get
(40) |
with given by Eq. (39) and is a constant that depends on initial conditions and model parameters.
In Fig. 4, we provide a numerical verification of the results in Eqs. (39) and (40) by solving the extended SEIR equations numerically. One of the main assumptions required for the proof is the validity of Eq. (33). In Fig. (5 ) we check this assumption in a numerical example with a particular parameter set.
3. Interventions: social distancing and testing-quarantining
We next consider the effect of different intervetion strategies which are incorporated into the extended SEIR dynamical Eqs. (1)–(8), through the parameters and which we will now make time-dependent. We discuss here the choices of the intervention functions and . Note that is a dimensionless number quantifying the level of social contacts, while is a rate which, as we will see, is closely related to the testing rate.
Social distancing (SD): We multiply the constant factors by the time dependent function, the “lockdown” function that incorporates the effect of social distancing, i.e reducing contacts between people. A reasonable form is one where has the constant value before the beginning of any interventions, and then from time it changes to a value over a characteristic time scale . Thus we take a form
(41) |
The number indicates the lowering of social contacts.
Testing-quarantining (TQ): We expect that testing and quarantining will take out individuals from the infectious population and this is captured by the terms and in the dynamical equations. A reasonable choice for the TQ function is perhaps to take
(42) |
where one needs a final rate . In general the time at which the TQ begins to be implemented and the time required for it to be effective could be different from those used for SD.
A useful quantity to characterize the system with interventions is the time-dependent effective reproductive number given by
(43) |
At long times this goes to the targeted reproduction number
(44) |
The time scale for the intervention target to be achieved is given by and .
Relation of the TQ functionto the number of tests done per day: Let us suppose that the number of tests per person per day is given by . We show in Fig. 6 the data for the number of tests per 1000 people per day across a set of countries and see that this is around 0.05 for India which means that . If tests are done completely randomly, then the number of detected people (assuming that the tests are perfect) would be and so it is clear that we can identify . It is then clear that this would have no effect on the pandemic control. To have any effect we would need which means around 100 tests per 1000 people per day which is clearly not practical.
However, a better strategy is to do focused tests on the contacts of all those who have been detected on a given day. We now give an estimate of the rate if we followed this strategy. For simplicity of presentation of our argument we here assume and . From our extended SEIR model the number of detected cases per day is given by . In the growing phase we have, from Eq. (19), that and . Hence we get with . The total number of contacts of the individuals would be where is the mean number of contacts of a single infected person. If we perform tests per day on this pool, then the rate of detections will be given by
(45) |
Denoting and noting that we self-consistently solve the above equation to find
(46) |
Now it is clear that unless and are of the same order, TQ will not have much effect on the dynamics and the change in will be small. Setting then gives us the condition
(47) |
Note that in our model we identify as our control rate function that changes from the value 0 to a value over the time scales of a week or so. This means that we would need to change the testing rate in a controlled way such that the condition is maintained. Thus the number of tests/per day has to be proportional to number of new detections/per day and in fact the ratio has to be larger than the average number of contacts, that each infectious person makes. The number is expected to depend on the population density and also how well SD is being implemented. The table in Fig. 7 shows data for the ratio for a set of countries and also how this ratio has evolved over time. While the value of (around May 15) for India appears to be large, it may not be sufficient given that the population densities are much larger than in many other countries and implementation of SD may be less effective. If we assume 20 contacts a day and the number of days before isolation of the individual to be 5 we get the rough estimate of and then the ratio thus has to be at least . This is the minimum value of testing-to-detected ratio that has to be targeted at localities with high infection rates. The details of the arguments presented here are largely independent of the specifics of the particular SEIR model that we study.
3.1. Comparision of intervention strategies
The model details are given in Section 1. We recall that at any given time the total infectious population size is the cumulative affected population (recovered, in hospital or dead) is the reported total confirmed cases is and the reported new daily cases is .
A useful quantity to characterize the system with interventions is the targeted reproduction number [see Eqs. (43) and (44)]
(48) |
We classify intervention strategies by the targeted value. A strong intervention is one where and will achieve suppression of the disease while a weak intervention is one with and will only mitigate the effects of the disease.
Other than an important quantity to characterize the disease growth is the largest eigenvalue of the linearized dynamics (see Section 2.1). In the early phase of the pandemic, all populations other than grow exponentially with time as . As we will see, for the case of strong intervention, becomes negative and gives the exponential decay rate of the disease.
In our numerical study we choose, for the purpose of illustration, the following parameter set: and the rates all in units of day. For the specified choice of parameter values (free case with ) we get which is close to the value observed for the early time data for confirmed cases in India. The corresponding free value of is 3.7665. Note that is not uniquely fixed by (and vice versa) and different choices of parameters can give the same observed but different values of
Choosing these typical parameter values for COVID-19, we now compare the efficacy of strong and weak interventions implemented in four different ways: (1) 6WLD-NTQ: Six weeks lockdown (strong value of SD parameter) and no testing-quarantining, (2) ELD-NTQ: Extended lockdown and no testing-quarantining,(3)NSD-ETQ: No social distancing and extended testing-quarantining, (4) ESD-ETQ: Extended social distancing and extended testing-quarantining. The case with no social distancing and no testing-quarantining is indicated as NSD-NTQ.
We work with a population and initial conditions and . In all cases, we will assume that intervention strategies are switched on when the confirmed number of cases reaches 50 and after that the full intervention values are attained over a time scale of 5 days.
3.1.1. Strong intervention ()
In this case, the exponential growth stops around the time when crosses the value 1. After this time, the infection numbers will start decaying exponentially. Since the infection numbers are still small compared to the total population, one can work with the linearized theory and the magnitude of the largest eigenvalue (now negative) determines the exponential decay rate. For illustrating this case, we take:
Parameter set I [] — We choose three SD and TQ strengths as (i) SD: (ii) TQ: and (iii) SD-TQ: . This choice corresponds to changing the free value of to a target value for all the three different strategies. The largest eigenvalue changes from the free value to the values (i) (ii) (iii) respectively. The results of the numerical solution of the extended SEIR equations are presented Fig. 8 (a) and (b).
Main observations:
-
1.
A six week (or eight week) lockdown is insufficient to end the pandemic and will lead to a second wave. If the interventions are carried on indefinitely, the pandemic is suppressed and only affects a very small fraction of the population (less than ). We can understand all features of the dynamics from the linear theory. In Fig. 8(a,b), intervention is switched on after weeks and the peak in infections appears roughly after a period of 5 days. Thereafter however, the decay in the number of infections occurs slowly, the decay rate being given by the largest eigenvalue (now negative and smaller in magnitude than in the growth phase).
-
2.
We find that for the same targetdifferent intervention schemes (ELD-NTQ, NSD-ETQ, or ESD-ETQ) can give very different values of the decay rateand, in general we find that TQ is more effective than SD. We see that ELD-NTQ ends the pandemic in about 10 months while NSD-ETQ would take around 3.5 months. This can be understood from the fact that the corresponding values (post-intervention) are given by and respectively, i.e, they differ by a factor of about 3. With a mixed strategy where one allows almost three times more social contacts () than for LD case and that requires three times less testing () than for TQ case, we see that the disease is controlled in about 5 months. Hence this appears to be the most practical and effective strategy.
-
3.The expected time for the pandemic to die would be roughly given by
and so it is important that intervention schemes are implemented early and as strongly as possible.(49)
3.1.2. Weak intervention ()
In this case, a finite fraction of the population is eventually affected, but the intervention succeeds in reducing this from its original free value and in delaying considerably the date at which the infections peak. We take the following parameter set for this study:
Parameter set II [] — we choose three SD and TQ strengths as (i) SD: (ii) TQ: and (iii) SD-TQ: . This choice corresponds to changing the free value of to a fixed target value for all the three different strategies. The largest eigenvalue remains positive and changes from the free value to the values (i) (ii) (iii) respectively. The results of the numerical solution of the extended SEIR equations are presented Fig. 8(c) and (d).
Main observations:
-
1.
We find that in this case the peak infections, peak infection time and the final affected population can be obtained from the analytic expressions given by Eqs. (26), (39) and (40) in terms of the basic disease parameters, using their values after interventions are introduced. We assume that the intervention parameters change from the values to their full strength over a short time scale and thereafter remain constant.
-
2.
We find that the peak infection numbers are smallest for the case with ELD-NTQ and they occur at a later stage. These results can also be understood mathematically from the expressions in Eqs. (39) and (40) using the post-intervention values of and (from the linear theory).
-
3.
We note that while weak interventions can slow down and reduce the impact of the pandemic, they do not lead to development of herd immunity of the population (assuming that all the recovered people develop immunity). It is well known that herd immunity is attained when a fraction of the population has developed immunity. Thus herd immunity in the above example would require that i.e of the population be affected, while Eq. (26) with predicts that only about of the population is affected.
3.2. Observation of strong and weak intervention in COVID-19 data
In Fig. 9 we give some examples of data for number of new cases for nine countries where we see that some of the qualitative features seen in the model results in Fig. 8(a,b). In particular we see the fast exponential growth phase and then a much slower decay phase for the first six countries which have succeeded in controlling the disease with various levels of success. On the other hand we see that India, Brazil and Pakistan continue to show a positive and it is clear that intervention schemes need to be strengthened.
4. Difficulties in making predictions from the extended SEIR model: case study for India
In the following we make some heuristic predictions, based on the analytic results in Eqs. (26), (39) and (40) and the present observed data, for daily new cases in India (), in the state of Delhi () and in the city of Mumbai (). The analysis here is based on the assumption of a best case scenario where the value of achieved after various intervention schemes is maintained at a constant value.
Here we assume that intervention has effectively been through SD, with being neglected. We consider the following choice of parameter values which appears to be reasonable for getting a conservative estimate: i.e, we assume that asymptomatics are rd less infectious and recover times faster. This gives us [using Eq. (31)] and the effective reproductive number as . From this last relation we can write . Plugging this into the equation for the eigenvalues, Eq. (13), and replacing by the observed mean exponential growth rate (we choose the values of and which are representative of the values observed in India since around April 10), we see that we basically get an equation for in terms of and . For specific choices of and the observed values of before and after intervention will then give us the corresponding values of .
For our analysis we need to know the total infections on some day (we take this as April 11) and we estimate it in the following way. Suppose that the daily observed cases on this day was (assuming that only the symptomatics are detected). Then we have . From Eq. (32) we have and so the time to the peak can be estimated as . We use Eq. (39) to compute the peak number of infections and the peak daily cases (PDC) is then obtained as PDC. The total affected population fraction can be computed from Eq. (26), using only the knowledge of . If we assume the number of deaths is of all symptomatic cases this gives us an estimate for the total number of deaths as .
The observed daily new cases in India, Delhi and Mumbai on April 10 were around and respectively [30]. For a range of choice of the parameters with and of and two representative values of the post-intervention growth rates, we compute the corresponding values obtained for and . These and the estimates for PDC and are given in Table 1, Table 2, Table 3 for and in Table 4, Table 5, Table 6 for for India, Delhi and Mumbai. Note that while the peak numbers and total affected population and deaths simply scale with population size, the time to peak depends on the daily detected numbers on April 10, and this leads to the observed differences in the time to the peak for the three cases. We also note here that changing the initial conditions by about causes a change of few days in the peak time while the other quantities remain unchanged. The full numerical solution in Fig. 10 also shows that the complete suppression of the disease takes more than 6 months after the peak.
Table 1.
(free) | Peak daily cases (PDC) | Time of peak | Total affected | Total deaths | ||||
---|---|---|---|---|---|---|---|---|
0.5 | 0.2 | 0.67 | 2.28 | 1.33 | 2,456,630 | 158 (2nd week September) | 45% | 1,936,000 |
0.5 | 0.2 | 0.9 | 2.16 | 1.3 | 686,770 | 132 (3rd week August) | 42% | 550,700 |
0.4 | 0.143 | 0.67 | 2.82 | 1.45 | 2,956,600 | 161 (3rd week September) | 55% | 2,363,700 |
0.4 | 0.143 | 0.9 | 2.65 | 1.41 | 832,890. | 136 (4th week August) | 52% | 676,140 |
Table 2.
(free) | Peak daily cases (PDC) | Time of peak | Total affected | Total deaths | ||||
---|---|---|---|---|---|---|---|---|
0.5 | 0.2 | 0.67 | 2.28 | 1.33 | 35,904 | 114 (1st week August) | 45% | 28,296 |
0.5 | 0.2 | 0.9 | 2.16 | 1.3 | 10,037 | 89 (2nd week July) | 42.3% | 8048 |
0.4 | 0.143 | 0.67 | 2.82 | 1.45 | 43,212 | 118 (2nd week August) | 55% | 34,546 |
0.4 | 0.143 | 0.9 | 2.65 | 1.41 | 12,173 | 93 (2nd week July) | 52% | 9882 |
Table 3.
(free) | Peak daily cases (PDC) | Time of peak | Total affected | Total deaths | ||||
---|---|---|---|---|---|---|---|---|
0.5 | 0.2 | 0.67 | 2.28 | 1.33 | 24,566 | 97 (3rd week July) | 45% | 19,360 |
0.5 | 0.2 | 0.9 | 2.16 | 1.3 | 6867 | 71 (4th week June) | 42.5% | 5507 |
0.4 | 0.143 | 0.67 | 2.82 | 1.45 | 29,566 | 100 (3rd week July) | 55% | 23,637 |
0.4 | 0.143 | 0.9 | 2.65 | 1.41 | 8328 | 75 (4th week June) | 52% | 6761 |
Table 4.
(free) | Peak daily cases (PDC) | Time of peak | Total affected | Total deaths | ||||
---|---|---|---|---|---|---|---|---|
0.5 | 0.2 | 0.67 | 2.28 | 1.23 | 1,331,400 | 208 (1st week November) | 34.5% | 1,482,000 |
0.5 | 0.2 | 0.9 | 2.16 | 1.207 | 368,809 | 172 (1st week October) | 32% | 418,500 |
0.4 | 0.143 | 0.67 | 2.82 | 1.31 | 1,651,200 | 214 (1st week November) | 43% | 1,856,300 |
0.4 | 0.143 | 0.9 | 2.65 | 1.28 | 459,776 | 178 (1st week October) | 40.5% | 526,222 |
Table 5.
(free) | Peak daily cases (PDC) | Time of peak | Total affected | Total deaths | ||||
---|---|---|---|---|---|---|---|---|
0.5 | 0.2 | 0.67 | 2.28 | 1.227 | 19,458 | 146 (1st week September) | 34.5% | 21,661 |
0.5 | 0.2 | 0.9 | 2.16 | 1.207 | 5390 | 109 (4th week July) | 32% | 6117 |
0.4 | 0.143 | 0.67 | 2.82 | 1.31 | 24,132 | 152 (1st week September) | 43% | 27,131 |
0.4 | 0.143 | 0.9 | 2.65 | 1.28 | 6719 | 116 (1st week August) | 40.5% | 7691 |
Table 6.
(free) | Peak daily cases (PDC) | Time of peak | Total affected | Total deaths | ||||
---|---|---|---|---|---|---|---|---|
0.5 | 0.2 | 0.67 | 2.28 | 1.23 | 13,316 | 120 (1st week August) | 34.5% | 14,821 |
0.5 | 0.2 | 0.9 | 2.16 | 1.21 | 3688 | 84 (1st week July) | 32% | 4185 |
0.4 | 0.143 | 0.67 | 2.82 | 1.31 | 16,512 | 126 (2nd week August) | 43% | 18,563 |
0.4 | 0.143 | 0.9 | 2.65 | 1.28 | 4597 | 90 (2nd week July) | 40.5% | 5262 |
In Fig. 10 we show results of a numerical solution of the dynamical equations in presence of intervention (SD) for one of the parameter sets in Table 3 and find excellent agreement with our analytic formula in Eqs. (26), (31) and (39). We see that the predicted peak time is off by about . The numerics also shows that the complete suppression of the disease takes more than 6 months after the peak.
We point out that the mixed-population assumption of the SEIR model is expected to be more accurate for a smaller population and so the estimates for Delhi and Mumbai would be more reliable than the one for India. For a big and highly in-homogeneous country like India, smaller regions (states or cities) would have different values of and and also different initial conditions, hence the global values would not capture the local dynamics correctly. It is likely that the numbers in Table 1 are an over-estimate of the true future trajectory. For the state of Delhi and the city of Mumbai these would be more accurate, however we see that the uncertainty in the true value of and other parameters leads to a huge uncertainty in the predictions.
5. Discussion
Several earlier work have discussed, using determinsitic compartmentalized models, the effect of asymptomatic affected population and the effect of intervention measures on the COVID pandemic [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18]. Here we present a somewhat different choice of compartments and perform a careful quantitative comparison of different intervention strategies. A distinction from earlier studies is that we present a number of analytic results which we believe will be useful beyond their immediate application to epidemiological modeling, in more general studies of population dynamics.
To summarize our findings, a modified version of the SEIR model, incorporating asymptomatic individuals, was analyzed in detail. We have obtained a number of analytical results for the full nonlinear model which can be useful in making empirical estimates of various important quantities that provide information on disease progression. We believe that the derivation of these results can be extended to more sophisticated SIR type models including more compartments and more complex interactions. From the linearized dynamics we point out a simple but important property, namely that at early times the motion of the system quickly settles along the direction of the dominant eigenvector. This allows one to determine accurately initial conditions from sparse data. We provided numerical examples to illustrate these ideas and in addition have provided comparisons with real COVID-19 data. Looking at COVID-19 data in several countries, we find that the extended SEIR model captures some important qualitative features and hence could provide guidance in policy-making
We used the extended SEIR model for analyzing the effectiveness of different intervention protocols in controlling the growth of the COVID-19 pandemic. Non-clinical interventions can be either through social distancing or testing-quarantining. Our results indicate that a combination of both, implemented over an extended period may be the most effective and practical strategy. We have attempted to relate real testing rates to the parameters of the model and comment on what the minimum testing rates should be in order for testing-quarantining to be an effective control strategy.
Finally we have used our analytic formulas to make predictions for disease peak numbers and expected time to peak for India, the state of Delhi and the city of Mumbai, pointing out that these predictions would be highly unreliable for India (due to big inhomogeneity in disease progression across the country) and perhaps more reliable for the cases of Delhi and Mumbai. Our main conclusion here is that the lack of precise knowledge of the disease parameters (e.g the fraction of asymptomatic carriers) and changing control strategies lead to rather large uncertainties in the predictions. Nevertheless, we believe that they could perhaps be used to obtain reasonable bounds.
Credit author statement
All authors contributed equally to the project.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
We thank Jitendra Kethepalli and Kanaya Malakar for very helpful discussions and Ranjini Bandyopadhyay, Siddhartha Chatterjee, Joel Lebowitz, Srujana Merugu, Alpan Raval, Sankar Das Sarma and Sriram Shastry for a careful reading of the draft and making useful suggestions. We acknowledge support of the Department of Atomic Energy, Government of India, under project no. 12-R&D-TFR-5.10-1100.
References
- 1.Huang C., et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Coronavirus N.. (2019-nCoV) situation report dated 21 January 2020, world health organization. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200121-sitrep-1-2019-ncov.pdf.
- 3.https://www.worldometers.info/coronavirus/.
- 4.Kucharski A.J., et al. Early dynamics of transmission and control of COVID-19: a mathematical modelling study. Lancet Infect Dis. 2020;20:533–538. doi: 10.1016/S1473-3099(20)30144-4. [DOI] [PMC free article] [PubMed] [Google Scholar]; https://doi.org/10.1016/S1473-3099(20)30144-4
- 5.Prem K., Liu Y., Russell T.W., et al. The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: a modelling study. lancet public health. 2020. (published online March 25.) 10.1016/S2468-2667(20)30073-6. [DOI] [PMC free article] [PubMed]
- 6.Ferguson N., et al. Report 9:impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand. 2020. 10.25561/77482. [DOI] [PMC free article] [PubMed]
- 7.Tang B., et al. An updated estimation of the risk of transmission of the novel coronavirus (2019-nCoV) Infect Dis Model. 2020;5:248. doi: 10.1016/j.idm.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Giordano G., et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in italy. Nat Med. 2020 doi: 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]; https://doi.org/10.1038/s41591-020-0883-7
- 9.Gatto M., et al. Spread and dynamics of the COVID-19 epidemic in Italy: effects of emergency containment measures. Proceedings of the National Academy of Sciences 202004978. 10.1073/pnas.2004978117. [DOI] [PMC free article] [PubMed]
- 10.Ray D., et al. Predictions, role of interventions and effects of a historic national lockdown in India’s response to the COVID-19 pandemic: data science call to arms. 2020;medRxiv, 10.1101/2020.04.15.20067256. [DOI] [PMC free article] [PubMed]
- 11.Sarkar K., Khajanchi S.. Modeling and forecasting of the COVID-19 pandemic in India. 2020. arXiv:2005.07071 [DOI] [PMC free article] [PubMed]
- 12.Shekatkar S., et al. INDSCI-SIM: a state-level epidemiological model for India. https://indscicov.in/indscisim.
- 13.Agrawal S., et al. COVID-19 epidemic: Unlocking the lockdown in India (working paper), IISc-TIFR technical report. https://covid19.iisc.ac.in/wp-content/uploads/2020/04/Report-1-20200419-UnlockingTheLockdownInIndia.pdf.
- 14.Venkateswaran J., Damani O.. Effectiveness of testing, tracing, social distancing and hygiene in tackling COVID-19 in India: a system dynamics model. 2020. arXiv:2004.08859
- 15.Frank T.D. COVID-19 order parameters and order parameter time constants of Italy and China: a modeling approach based on synergetics. J Biol Syst. 2020;28:589–608. [Google Scholar]
- 16.Piovella N. Analytical solution of SEIR model describing the free spread of the COVID-19 pandemic. Chaos Solitons Fractals. 2020;140:110243. doi: 10.1016/j.chaos.2020.110243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Meidan D., et al. Alternating quarantine for sustainable mitigation of COVID-19. 2020. arXiv.org:2004.01453 [DOI] [PMC free article] [PubMed]
- 18.Wong G.N. et al. Modeling COVID-19 dynamics in illinois under non-pharmaceutical interventions. 2020. arXiv:2006.02036
- 19.Li M.Y. Springer; 2018. An introduction to mathematical modeling of infectious diseases. [Google Scholar]
- 20.Coronavirus (COVID-19) testing. https://ourworldindata.org/coronavirus-testing.
- 21.Giesecke J. CRC Press; 2017. Modern infectious disease epidemiology. [Google Scholar]
- 22.Andersson H., Britton T. Springer; New York: 2000. Stochastic epidemic models and their statistical analysis. [Google Scholar]
- 23.Liu Y., Gayle A.A., Wilder-Smith A., Rocklöv J. The reproductive number of COVID-19 is higher compared to SARS coronavirus. J Travel Med. 2020;27(2) doi: 10.1093/jtm/taaa021. [DOI] [PMC free article] [PubMed] [Google Scholar]; https://doi.org/10.1093/jtm/taaa021
- 24.Leung K.Y., Trapman P., Britton T. Who is the infector? Epidemic models with symptomatic and asymptomatic cases. Math Biosci. 2018;301:190. doi: 10.1016/j.mbs.2018.04.002. [DOI] [PubMed] [Google Scholar]
- 25.Brauer F., Driessche P.V., Wu J. Springer; 2008. Lecture notes in mathematical epidemiology. [Google Scholar]
- 26.J. Li J., Lou Y. Characteristics of an epidemic outbreak with a large initial infection size. J Biol Dyn. 2016;10:366. doi: 10.1080/17513758.2016.1205223. [DOI] [PubMed] [Google Scholar]; https://doi.org/10.1080/17513758.2016.1205223
- 27.Lax P.D. Wiley; 2013. Linear algebra and its applications. [Google Scholar]
- 28.Frank T.D. COVID-19 interventions in some european countries induced bifurcations stabilizing low death states against high death states: an eigenvalue analysis based on the order parameter concept of synergetics. Chaos Solitons Fractals. 2020;140:110194. doi: 10.1016/j.chaos.2020.110194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Johns hopkins center for systems science and engineering coronavirus COVID-19 global cases. https://systems.jhu.edu/research/public-health/ncov/.
- 30.Data for India and Delhi are from the website. https://www.kaggle.com/sudalairajkumar/covid19-in-india/data#covid_19_india.csv/. The numbers for Mumbai are estimated from newspaper reports.