At the beginning of the second half of the 20th century, there was a widespread belief that science and in particular medicine had progressed so far that Nature could be brought under complete control. It seemed that healthcare and pharmacology were in the position to prevent or to cure almost all diseases. In the 1980s, for example, the pharmaceutical industry stopped the search for new antibiotic drugs that would be badly needed nowadays in the light of the universal capabilities of bacteria to develop resistance factors. At about the same time previously unknown or unnoticed virus transmitted infectious human diseases appeared: acquired immunodeficiency syndrome caused by human immunodeficiency virus (HIV), Ebola caused by Ebola virus (EBOV) and four related other strains of filoviridae, as well as severe acquired respiratory syndrome (SARS) brought about by SARS coronavirus. Caused by prions and not by a virus is been bovine spongiform encephalopathy (BSE). Nevertheless, it gave rise to an equally serious new epidemic. These and other cases as well as the consequences of the “antivaccination movement” 1, 2, for example, the recent reoccurrence of pertussis and measles, revived a need of reliable models in epidemiology. In particular, the recent Ebola epidemic starting in December 2013 in West Africa 3 initiated a new boom in theoretical work on infectious disease dynamics 4. In PLoS Currents Outbreaks I counted 27 articles between the first publication on the recent Ebola epidemics on May 02, 2014 until March 09, 2015. In December 2014, researchers became aware that the predictions made 3 months earlier, in Fall 2014, apparently overstated the numbers of cases and deaths. A recent theoretical paper aims at an analysis of the prediction errors and provides suggestions how to make better forecasts 5. In this essay, we shall be concerned with the predictive power of one frequently used model denoted as susceptible‐exposed‐infectious‐removed (SEIR) model, and try to analyze typical general problems of predictions from early stages of exponentially growing systems to the final outcomes of the processes. In the focus are the model inherent limitations of reliabilities and not the lack of information or external problems like insufficient data or the uncertainty about the effectiveness of intervention strategies or countermeasures.
Before considering epidemiological models, we illustrate the problem of predicting the longtime behavior of a process that starts with exponential growth and later on goes into saturation by means of a well‐known simple example: the logistic equation introduced by Pierre‐François Verhulst 6 in 1838. The equation describes growth of a multiplying population in a world of finite resources. The model is formulated in the language of ordinary differential equations and can be solved by the standard analytical techniques of calculus
(1) |
Herein, N(t) is the number of multiplying individuals or the population size as a function of time, N 0 = N(0) is the initial value, r is the growth or Malthus parameter, and C is called the carrying capacity and represents a parameter, which determines the maximum number of individuals that can be sustained by the ecosystem (Figure 1).
The initial phase of the process (t < 3) yields practically the same curves for all values of C within the line width. It is straightforward to calculate an approximate expression for N(t) at short times by means of series expansions that has the following two first terms:
(2) |
for small t
Equation (2) illustrates well the problem of parameter fitting to data from the initial phase: In cases where N 0 is small and C is large and N 0/C << 1, the growth parameter r can be determined from the slope of the linearized growth curve, N(t) ≈ N 0 + N 0 r t, as N 0/C is only a small correction. The determination of the second parameter C is tricky, in particular for small N 0/C because the first and the second coefficient of the series expansion are needed and the equations for the fit of C are commonly ill‐conditioned. Unfortunately small N 0 and large C, being tantamount to a few infected individuals—in extreme but not uncommon cases we have N 0 = 1—and a comparatively large population of susceptible individuals, are almost always the case in the spread of epidemics. For very good reasons, Aaron King et al. 5 emphasize the requirement of highly reliable data and sophisticated high‐level stochastic methods for parameter estimations and potential error predictions from the early phase of an exponentially growing system to a final situation when the process is under control. The problem is very general and not restricted to epidemiology. The politically reasonable but scientifically unacceptable flaw in the predictions of the famous Club of Rome 7 is precisely of the same kind: You cannot extrapolate from the initial phase of an exponentially growing process to the final situation unless additional information is or additional data are available.
The SEIR model of epidemiology is very popular and despite relative simplicity it has interesting dynamical features that are highly relevant for prediction. The model describes infection dynamics in a population of initially susceptible individuals and attributes four different states to the people: S means susceptible to infection, E implies exposed and infected but not yet showing the symptoms and not being infectious, I stands for being infected, showing the symptoms and being infectious, and eventually R stands for the number of individuals being out of the game by either being recovered and immune to further infection or being dead. For population dynamics, the SEIR model can be casted into equations of chemical kinetics:
(3a) |
(3b) |
(3c) |
The rate parameters commonly attributed to the three steps are denoted by β/C, k, and γ. As a matter of fact, the epidemiological mechanism (3) is nothing but a chemical reaction consisting of three consecutive steps, and it can be cast into the differential equation
(4) |
Herein, concentrations or numbers of individuals are denoted by [S] = s, [E] = e, [I] = i, [R] = r. Because of the conservation relation, s + i + e + r = C, the model has three independent variables. To take into account intrinsic stochastic effects, the SEIR model can be formulated as a renewal process 4 that is frequently used in theoretical epidemiology 8. To visualize fluctuations, we shall consider here the expectation values embedded in the one‐standard deviation band derived from numerical simulations of the chemical master equation of mechanism (3) 9. The results are shown in Figure 2: As expected the solution curves of the deterministic approach lie within the one‐standard deviation band but do not exactly coincide with the expectation values E(x), x ≡ s,e,i or r—exact coincidence can only be expected for first order or linear processes 10 and, indeed, the step (3a) is corresponding to a bimolecular second order reaction. The SEIR mechanism contains two‐step autocatalysis: S + I → E + I → 2 I resulting from (3a) and (3b). One special property of an autocatalytic system concerns the requirement of a non‐zero concentration of the autocatalyst as no reaction will take place otherwise. In the language of epidemiology this fact is trivial: There is no outbreak without infectious individuals. Autocatalysis gives also rise to a number of features, which are uncommon in conventional chemical kinetics. One property that is relevant for the problem of prediction will be discussed in detail in the next paragraph. It concerns the longtime result, which in chemistry is commonly a single state that does not depend on initial conditions. In case of autocatalysis, however, the final state can be a one‐dimensional (1D) manifold and the particular marginally stable stationary points depend on the initial values of the variables (Figure 3; see also a somewhat humoristic but mathematically serious analysis of the equilibria of epimemiological models and their stability see 11). In the SEIR and as well in the simpler susceptile‐infectious‐recoverd (SIR) model, the final outcome in the sense of the residual “susceptibles,” , is a function of the number of initially infected people expressed by I(0).
Next, we perform stationary state analysis of the SEIR model (4) and calculate the longtime solutions from the equations . The result is not a single stationary state but a 1D manifold of states defined by
(5a) |
The number of individuals, which are not infected by the disease is limited by . Stability analysis of this 1D line of stationary points by means of the evaluation of the eigenvalues of the Jacobian matrix at the steady states yields three eigenvalues, one is zero and two are real:
(5b) |
The zero eigenvalue expresses marginal stability and concerns migration along the line of stationary states. The other two eigenvalues are both negative in the range with and depending on the initial conditions the longtime solution will lie somewhere in this range. An example is shown in Figure 3, the larger the initial number of infected individuals is, the smaller is , the number of people that remained unaffected by the epidemic (The parameter values for the calculations are taken from the 1995 Ebola epidemic in Congo 12).
In this context, it is illustrative to mention a result of the SIR model in the simplest version, which does not consider exposed individuals—as said above, S + I → 2 I and I → R with the two rate parameters β and γ. For simplicity, we replace the numbers of individuals by normalized variables s ⇒ s/C, i ⇒ i/C, r ⇒ r/C such that s + i + r = 1. The analysis of the system reveals again a one‐dimensional manifold of stationary points with and Stability analysis yields stable points in the range and unstable points for , where . In Figure 4, it is shown how the initial ratio of susceptible to recovered or here better immune individuals determines the final number of people that were affected by the epidemic: The more susceptibles are initially present, the smaller is the number of individuals, which remain spared from the disease. Moreover, in the range no epidemic can occur and this has a straightforward consequence for vaccination: If the fraction of immune people, r/(r + s), is above 1 − , the population cannot be invaded by the proliferating disease but, of course, the nonvaccinated individual can still be infected. From γ being larger than β follows and the population is stable in the whole range, , and no outbreak can occur. The fraction of immune people is very hard to know a priori and this implies another uncertainty in modeling.
The parameter β determines the rate of infection and may be controlled by successful intervention strategies. In modeling this can be taken into account using the parameter as a function of time 12
(6) |
The parameter τ is the instant when the interventions start and q is the time constant of the implementation of the countermeasures. This approach is particularly appropriate for Ebola in developing countries since β 1 is there much smaller than β 0. The quantity, which is the key to modeling the course of an epidemic outbreak, is the basic reproduction number R 0, which is a measure of the average number of secondary infections caused by a single primary case in a pool of mostly susceptible individuals 12. For R 0 > 1, the epidemic is spreading whereas R 0 < 1 implies that the infections are under control. The primary goal of politics and healthcare is to drive β(t) below one. The basic reproduction number in a population before intervention is simply given by R 0 = β 0/γ, and the relation to the critical fraction of susceptible individuals is = γ/β 0 = . Although being a flexible and appropriate approximation for real situations, Eq. (6) provides an additional problem for the modelers: It contains three new parameters β 1, τ, and q that have to be derived from data and further information about the course of the epidemic.
Summarizing the current situation in modeling epidemics in view of the recent Ebola outbreak in West Africa we come to the main conclusions: (i) The epidemic provided a unique occasion for testing epidemiological models. (ii) Extrapolations from the phase of exponential growth to the final outcome—as rigorous and careful they might be 13—are very risky because they suffer not only from external uncertainties but also from intrinsic instabilities that are common in processes with autocatalytic steps. Elaborate statistics can substantially improve the situation and reduce the problem if enough and reliable data are available 5. Low robustness of the predictions results also from dynamical systems that are more complex than conventional chemical reaction networks. Simple models can, nevertheless, give important insights and help to understand complex relations.
In Table 1, finally we present some recent data in the light of the estimates of Ebola cases in early 2015 that were calculated to reach from several hundred thousand to even a million and more. Fortunately enough the severeness of the Ebola epidemic in West Africa so far has been overestimated by orders of magnitude. In addition, the dynamics of the epidemic was and is different in the three countries. At the beginning of March Liberia had reported no new case since 42 days but a few single cases were reported after that, whereas in Guinea and Sierra Leone the disease continues at a slowly down going rate.
Table 1.
Country | March 15, 2015 | Confirmed Last Weeka | March 29, 2015 | Confirmed Last Weeka | |
---|---|---|---|---|---|
Guinea | Cases | 3389 | 95 | 3492 | 57 |
Deaths | 2224 | 2314 | |||
Sierra Leone | Cases | 11,751 | 55 | 11,974 | 25 |
Deaths | 3691 | 3799 | |||
Liberia | Cases | 9526 | 0 | 9712 | 0 |
Deaths | 4264 | 4332 | |||
Total | Cases | 24, 666 | 150 | 25,178 | 79 |
Deaths | 10, 179 | 10,445 |
The numbers are the confirmed new Ebola cases with the last week before the date.
The recent Ebola outbreak is so well‐documented that it provides a unique opportunity for theoretical epidemiologists to test the existing and to develop new models. The take‐home lesson nevertheless is: whenever one cannot avoid extrapolating a process from the growth phase to saturation, special care in needed because the early data contain little information on the parameters that determine the late behavior. Complex dynamics may enlarge the uncertainties.
Footnotes
The SIR model is distiguished from the SEIR model by the omission of the state “exposed.” It contains one‐step autocatalysis, S + I → 2 I , and has many features in common with the SEIR model, in particular, the outcome of the SIR process depends also on the initial conditions (see next paragraph).
REFERENCES
- 1. Wolfe, R.M. ; Sharp, L.K. Anti‐vaccinationists past and present. Br Med J 2002, 335, 430–432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Poland, G.A. ; Jacobson, R.M. The age‐old struggle against the antivaccinationists. N Engl J Med 2011, 364, 97–99. [DOI] [PubMed] [Google Scholar]
- 3. Chowell, G. ; Nishiura, H. Transmission dynamics and control of Ebola virus disease (EVD): A review. BMC Med 2014, 12, e196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Nishiura, H. ; Chowell, G. Theoretical perspectives on the infectiousness of Ebola virus disease. Theor Biol Med Model 2015, 12, e1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. King, A.A. ; Domenech de Cellès, M. ; Magpantay, F.M.G. ; Rohani, P. Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to Ebola. Proc Biol Sci 2015, 282, e20150347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Verhulst, P.‐F. Notice sur la loi que la population pursuit dans son accroisement. Corresp Math Phys 1838, 10, 113–121. [Google Scholar]
- 7. Meadows, D.H. ; Meadows, D.L. ; Randers, J. ; Behrens, W.W., III . The Limits of Growth; Universe Books: New York, 1972. [Google Scholar]
- 8. Diekmann, O. ; Heesterbeek, J.A.P. Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation; Wiley: Chichester, UK, 2000. [Google Scholar]
- 9. Gillespie, D.T. Stochastic simulation of chemical kinetics. Annu Rev Phys Chem 2007, 58, 35–55. [DOI] [PubMed] [Google Scholar]
- 10. van Kampen, N.G. Stochastic Processes in Physics and Chemisty, 3rd ed; Elsevier: Amsterdam, 2007; pp 122–127. [Google Scholar]
- 11. Munz P.; Hudea I.; Imad J.; Smith R.J. When zombies attack: Mathematical modelling of an outbreak of zombie infection. In: Infectious disease modelling research progress; Tchuenche J.M.; Chuyaka C., Eds.; Nova Science Publishers: Hauppauge, NY, 2009; pp 133–150. [Google Scholar]
- 12. Chowell, G. ; Hengartner, N.W. ; Castillo‐Chavez, C. ; Fenimore, P.W. ; Hyman, J.M. The basic reproductive number of Ebola and the effect of public health measures: The cases of Congo and Uganda. J Theor Biol 2004, 229, 119–126. [DOI] [PubMed] [Google Scholar]
- 13. Anderson, P.M. ; May, R.M. Infectious Diseases of Humans; Oxford University Press: Oxford, UK, 1991. [Google Scholar]
- 14. Hunt, A.G. Exponential growth in Ebola. Outbreak since May 14, 2014. Complexity 2014, 20, 8–11. [Google Scholar]
- 15.Public Health England. Ebola Epidemiological Update No. 27, March 20, 2015.
- 16. World Health Organization (WHO) . Ebola situation report, April 01, 2015.