Abstract
The increasing use of mathematical models for epidemic forecasting has highlighted the importance of designing models that capture the baseline transmission characteristics in order to generate reliable epidemic forecasts. Improved models for epidemic forecasting could be achieved by identifying signature features of epidemic growth, which could inform the design of models of disease spread and reveal important characteristics of the transmission process. In particular, it is often taken for granted that the early growth phase of different growth processes in nature follow early exponential growth dynamics. In the context of infectious disease spread, this assumption is often convenient to describe a transmission process with mass action kinetics using differential equations and generate analytic expressions and estimates of the reproduction number. In this article, we carry out a simulation study to illustrate the impact of incorrectly assuming an exponential-growth model to characterize the early phase (e.g., 3–5 disease generation intervals) of an infectious disease outbreak that follows near-exponential growth dynamics. Specifically, we assess the impact on: 1) goodness of fit, 2) bias on the growth parameter, and 3) the impact on short-term epidemic forecasts. Our findings indicate that devising transmission models and statistical approaches that more flexibly capture the profile of epidemic growth could lead to enhanced model fit, improved estimates of key transmission parameters, and more realistic epidemic forecasts.
1. Introduction
Standard models of disease transmission and control are largely based on systems of differential equations where infectious disease spreads following an initial exponential growth phase (Anderson and May, 1991, Diekmann and Heesterbeek, 2000, Kermack and McKendrick, 1937, Ross, 1911). Further, the application of disease transmission models for forecasting the likely short-term and long-term impact of an infectious disease outbreak on morbidity and mortality has received more attention with the increasing number of emerging infectious diseases affecting our globalized world. Hence, it is important to better understand how certain modeling assumptions in transmission models of disease spread and control affect estimates of key epidemiological parameters and predictions relating to epidemic impact (e.g., peak size, epidemic size, duration) (Chowell et al., 2016a, Fisman et al., 2013, Hsieh, 2010, Wang et al., 2012). Here we use epidemic simulations and the generalized-growth model (Viboud, Simonsen, & Chowell, 2016) to underscore the importance of capturing the baseline transmission characteristics of an infectious disease outbreak in order to reliably forecast and infer key transmission parameters. For instance, models that fail to provide a good approximation of the initial transmission dynamics of an infectious disease outbreak would not be able to pass an initial validation phase.
In this article, we carry out a simulation study to illustrate the impact of incorrectly assuming an exponential-growth model to characterize the early phase (e.g., 3–5 disease generation intervals) of an infectious disease outbreak that follows near-exponential growth dynamics. Specifically, we assess the impact on: 1) goodness of fit, 2) bias on the growth parameter, and 3) the impact on short-term epidemic forecasts.
2. Methods
2.1. A phenomenological model to characterize early epidemic growth patterns
In order to characterize sub-exponential growth dynamics during the early growth phase of an infectious disease outbreak, a phenomenological approach based on the generalized-growth model was recently introduced in ref. (Viboud et al., 2016). In particular, the generalized-growth model allows relaxing the assumption of exponential growth via a “deceleration of growth” parameter, p. The model is given by the following differential equation:
(1) |
where C′(t) describes the incidence growth phase over time t, the solution C(t) describes the cumulative number of cases at time t, r is a positive parameter denoting the growth rate, and p, the “deceleration of growth” parameter varied between 0 and 1.
If p=0, this equation describes constant incidence over time and the cumulative number of cases grows linearly while p=1 models exponential growth dynamics. Intermediate values of p between 0 and 1 describe sub-exponential (e.g. polynomial) growth patterns. In semi-logarithmic scale, exponential growth patterns are visually evident when a straight line fits well several consecutive disease generations of epidemic growth, whereas a downward curvature in semi-logarithmic scale indicates early sub-exponential growth dynamics. For sub-exponential growth (i.e.,0<p<1), the generalized-growth model has a closed-form solution which is given by (Tolle, 2003):
As explained earlier (Viboud et al., 2016), simulations derived from the generalized-growth model display different epidemic growth profiles, as the “deceleration” parameter (p) is varied between zero and one. These profiles include linear incidence (i.e., p=0.5), concave-up incidence (p>0.5), and concave-down incidence (p<0.5) patterns.
The generalized-growth model is a simple tool that can be used to characterize the early epidemic growth profile from case incidence data as well as from synthetic data derived from transmission models via stochastic simulation (Viboud et al., 2016). Here we employ this phenomenological model and stochastic simulations to demonstrate the impact of assuming exponential growth dynamics for processes that are characterized by sub-exponential epidemic growth dynamics just slightly below the exponential growth regime at p=1.
2.2. Simulation study to assess the impact of exponential growth assumptions
We carried out stochastic simulations of early growth outbreak dynamics that follows near-exponential growth dynamics during 3–5 generations of disease transmission (each generation is fixed at 5 days) using the generalized-growth model. For this purpose, we fixed the growth rate parameter “r” at 0.4 per day, the initial number of cases C(0) = 1, and varied the ‘deceleration of growth’ parameter p between 0.9 and 0.96, a range that falls slightly below the exponential growth regime at p=1.
Stochastic simulations of the generalized-growth model were generated through a Poisson simulation approach (Gustafsson & Sternad, 2007). To assess the impact of assuming exponential growth in the context of near-exponential growth dynamics, we estimated the growth rate “r” for the exponential growth model for each of the 500 stochastic realizations to generate a distribution of the estimates of the growth rate “r”. For comparison, we also jointly estimated both the growth rate “r” and the deceleration of growth parameter “p” from each of the 500 stochastic realizations using the generalized-growth model (the “true model”). We estimated parameters through nonlinear least-square curve fitting to the case incidence curve modeled by equation C′(t) as described in ref. (Viboud et al., 2016). The initial number of cases C(0) was fixed according to the simulated data (i.e., C(0) = 1).
2.3. Short-term forecasting approach
Using the ensemble of stochastic simulations of early growth dynamics, we generated short-term forecasts (e.g., forecasting horizon comprising 3 generations) of epidemic growth following a model calibration period, which consisted of the early growth phase comprising only 3 generations of disease transmission. The forecasts generated by the generalized-growth model were compared with the “true” ensemble of stochastic simulations of early epidemic growth during 6 generations of disease transmission. We also compared the goodness of fit that the models yielded during the calibration period comprising 3 generations of disease transmission.
3. Results
The stochastic simulations of early case incidence growth that follow near-exponential growth according to the generalized-growth model with the ‘deceleration of growth’ parameter p between 0.9 and 0.96, and the growth rate parameter “r” is fixed at 0.4 per day that were employed in our analyses are displayed in Fig. 1.
We illustrate the impact of inferring the growth rate parameter “r” from early case incidence growth data that follows a near-exponential growth process assuming a simple exponential growth model rather than the generalized-growth model (the “true model”). The distributions of the growth rate parameter estimates derived from fitting the exponential growth model to each of 500 stochastic simulations derived from the generalized-growth model using 3, 4, and 5 generations of disease transmission are shown in Fig. 2. Overall, estimates of “r” derived using the wrong “exponential growth” model suffer from increasing bias as the length of the early growth phase increases from 3 to 5 disease generations while inference using the “true model” confirms ascertainment using the true growth rate “r” with decreasing variance in the estimate as the length of the early growth phase increases. We also evaluated the impact of the deceleration of growth parameter “p” on estimates of the growth rate “r” derived using the simple exponential growth model. Moreover, the bias in estimates of the growth rate parameter rapidly increases as the deceleration of growth parameter decreases, moving away from the exponential growth regime (Fig. 3).
We also assessed the goodness-of-fit provided by the exponential growth model when it is calibrated using synthetic epidemic growth data that follows near-exponential growth dynamics using the generalized-growth model (Fig. 4). Overall, the goodness-of-fit provided by the exponential growth model rapidly deteriorates as the growth phase increases or the deceleration of growth parameter is decreased below the exponential growth regime (Fig. 5).
The impact of assuming exponential growth for generating short-term epidemic forecasts when the underlying growth process is just below the exponential growth regime is illustrated in Fig. 6. That is, we first calibrated the exponential growth model to the first 3 generations of disease transmission for each of 500 stochastic simulations of early growth disease transmission derived using the generalized-growth model where each disease generation is fixed at 5 days, the ‘deceleration of growth’ parameter p is set at 0.96, the true growth rate parameter is set at 0.4 per day. Using the calibrated models (the exponential growth model and the “true model” based on the generalized-growth model), we forecasted epidemic growth for the following 3 disease generations for each epidemic realization. In particular, these results illustrate how rapidly the epidemic forecasts diverge within just a few generations of disease transmission even when the “true” growth dynamics fall slightly below the exponential growth regime.
4. Discussion
It is often taken for granted that the early growth phase of different growth processes follow exponential growth dynamics. In the context of infectious disease spread, this assumption is often convenient to describe a transmission process with mass action kinetics using differential equations (Bailey, 1975, Ross, 1911) and generate analytic expressions and estimates of the reproduction number (Anderson and May, 1991, Diekmann et al., 1990). Our results demonstrate that designing transmission models and statistical approaches that more flexibly capture the profile of epidemic growth, beyond the exponential growth assumption, and its associated uncertainty could lead to enhanced model fit, improved estimates of key transmission parameters, and more realistic epidemic forecasts.
Extrapolations of epidemic impact from the early growth trend in case incidence of an epidemic are subject to both model and data uncertainty. Indeed, in the context of limited epidemic data on the early epidemic growth phase, it is possible that 1) the data do not convey sufficient information to reliably ascertain the profile of epidemic growth and determine whether an epidemic is expected to follow exponential or polynomial-like growth dynamics, resulting in a forecast associated with wide uncertainty and/or 2) the trend in the early growth pattern is not faithfully captured by the growth assumptions specified in the model (e.g., model assumes a fixed type of epidemic growth). Consequently, transmission models calibrated using a few data points of the early phase of an infectious disease outbreak assuming exponential growth epidemic dynamics, such as the widely used SIR-type compartmental models, are unable to predict anything other than an exponentially growing epidemic in the absence of interventions or behavior changes until susceptible depletion sets in. Hence, our results highlight the need to consider less rigid transmission modeling frameworks that capture a wider range of epidemic growth profiles including the possibility of sub-exponential (e.g., polynomial) epidemic growth. More flexible models should be better equipped with different mechanisms to fit the early growth trend of an epidemic process and provide more realistic uncertainty bounds for short- and long-term epidemic forecasts. Such mechanisms include changes in population behavior that mitigate transmission and spatial heterogeneity in underlying network of contact over which the disease spreads.
Analyses of the initial epidemic growth phase comprising approximately 3–5 generations of disease transmission across infectious disease outbreaks for a range of diseases including influenza, Ebola, foot-and-mouth disease, HIV/AIDS, plague, measles and smallpox has revealed a diversity of epidemic growth profiles that were accompanied with substantial uncertainty (Viboud et al., 2016). For instance, high values for p above 0.85 were estimated for a major plague epidemic in Bombay in 1905, the 1918 influenza pandemic in San Francisco, and a smallpox outbreak in Khulna, Bangladesh in 1972. Slower growth profiles were quantified for the foot-and-mouth disease outbreak in Uruguay at the farm level with a low mean estimate of p at 0.4 and the HIV/AIDS epidemic in Japan (1985–2012) at ∼0.5 which is consistent with an approximately linear growth pattern (Viboud et al., 2016). Similarly, a recent analysis of the early growth dynamics of a Zika epidemic in Colombia indicated polynomial growth with p ∼0.4–0.7 (Chowell et al., 2016a). For the district-level Ebola epidemic outbreaks in West Africa, we found that p varied substantially with an overall mean at ∼0.6 (Viboud et al., 2016). The districts of Margibi in Liberia, and Bombali and Bo in Sierra Leone, displayed near exponential growth (p close to 1), while the districts of Bomi in Liberia and Kenema in Sierra Leone displayed particularly slow growth (p near 0.1). An intermediate pattern of growth has been observed for the district of Western Area Urban in Sierra Leone.
The diversity of epidemic growth profiles observed in real epidemic outbreaks underscores the importance of understanding the mechanisms that generate them. Our results emphasize the need to consider sub-exponential growth patterns in the design and parameterization of mathematical transmission models (Chowell, Sattenspiel, Bansal, & Viboud, 2016b). Indeed, past modeling efforts have incorporated mechanisms to account for slower than exponential growth patterns including models that gradually mitigate the transmission rate over time (Chowell, Hengartner, Castillo-Chavez, Fenimore, & Hyman, 2004) or incorporate phenomenological parameters to capture non-homogeneous population mixing (Finkenstädt and Grenfell, 2000, Grenfell et al., 2001) and models with particular spatial structuring (Chowell et al., 2006, Eubank et al., 2004, Ferguson et al., 2001, Halloran et al., 2002, Keeling et al., 2003, Kiskowski, 2014, Kiskowski and Chowell, 2015, Merler et al., 2015, Riley and Ferguson, 2006). A recent review article (Chowell et al., 2016b) surveys mathematical modeling approaches that are useful for capturing a diversity of early epidemic growth profiles, ranging from sub-exponential to exponential growth. These include models incorporating spatial details or realistic population mixing structures, including meta-population models, individual-based network models, and simple SIR-type models that incorporate the effects of reactive behavior changes or inhomogeneous mixing (Chowell et al., 2016b).
An enhanced understanding of observed epidemic growth patterns and the mechanisms that generate them for different pathogens and across temporal and social contexts should prove useful to improve our ability to design disease transmission models that are able to capture specific early epidemic growth profiles. Sub-exponential growth patterns have important implications on the characterization and interpretation of the basic reproduction number R0, a key quantity for disease control (Anderson and May, 1991, Diekmann and Heesterbeek, 2000, van den Driessche and Watmough, 2002). R0 is often estimated by fitting epidemic models that rely on exponential growth assumptions to time-series of case reports during the early epidemic ascending phase (Diekmann and Heesterbeek, 2000, Roberts and Heesterbeek, 2007, Wallinga and Lipsitch, 2007). However, approaches for estimating R0 in the context of epidemics that follow early sub-exponential growth dynamics have been lacking. One such approach based on the generalized-growth model was recently introduced in ref. (Chowell, Viboud, Simonsen, & Moghadas, 2016c). This study demonstrated that in contrast to the invariant reproduction numbers for epidemics that are well-characterized by exponential growth dynamics during the initial epidemic phase, epidemics that follow an early sub-exponential growth phase display an effective reproduction number that asymptotically declines towards unity.
Our findings have implications in a number of biological processes beyond the spread of infectious disease. Our results highlight significant impact on goodness of fit, parameter estimation, and short-term epidemic forecasting when incorrectly assuming an early exponential-growth process that truly follows near-exponential growth dynamics. Transmission models and statistical approaches that more flexibly capture the profile of epidemic growth are already starting to show promising results in improving model fit, estimates of growth parameters, and epidemic forecasts.
Funding
GC and CV acknowledge financial support from the Division of International Epidemiology and Population Studies, The Fogarty International Center, US National Institutes of Health. GC was also supported from NSF grant 1414374 as part of the joint NSF-NIH-USDA Ecology and Evolution of Infectious Diseases program; UK Biotechnology and Biological Sciences Research Council grant BB/M008894/1.
Footnotes
Peer review under responsibility of KeAi Communications Co., Ltd.
References
- Anderson R.M., May R.M. Oxford University Press; Oxford: 1991. Infectious diseases of humans. [Google Scholar]
- Bailey N.T.J. Hafner; New York: 1975. The mathematical theory of infectious disease and its applications. [Google Scholar]
- Chowell G., Hengartner N.W., Castillo-Chavez C., Fenimore P.W., Hyman J.M. The basic reproductive number of ebola and the effects of public health measures: The cases of Congo and Uganda. Journal of Theoretical Biology. 2004;229:119–126. doi: 10.1016/j.jtbi.2004.03.006. [DOI] [PubMed] [Google Scholar]
- Chowell G., Hincapia-Palacio D., Ospina J., Pell B., Tariq A. Using phenomenological models to characterize transmissibility and forecast patterns and final Burden of Zika epidemics. PLOS Currents Outbreaks. 2016 doi: 10.1371/currents.outbreaks.f14b2217c902f453d9320a43a35b9583. pii:ecurrents.outbreaks.f14b2217c902f453d9320a43a35b9583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chowell G., Rivas A.L., Hengartner N.W., Hyman J.M., Castillo-Chavez C. The role of spatial mixing in the spread of foot-and-mouth disease. Preventive Veterinary Medicine. 2006;73:297–314. doi: 10.1016/j.prevetmed.2005.10.002. [DOI] [PubMed] [Google Scholar]
- Chowell G., Sattenspiel L., Bansal S., Viboud C. Mathematical models to characterize early epidemic growth: a review. Physics of Life Reviews. 2016 doi: 10.1016/j.plrev.2016.07.005. pii: S1571-0645(16)30064-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chowell G., Viboud C., Simonsen L., Moghadas S.M. 2016. Characterizing the reproduction number of epidemics with early sub-exponential growth dynamics Journal of the Royal Society Interface.http://arxivorg/abs/160301216 (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diekmann O., Heesterbeek J. Wiley; 2000. Mathematical epidemiology of infectious diseases: Model building, analysis and interpretation. [Google Scholar]
- Diekmann O., Heesterbeek J.A., Metz J.A. On the definition and the computation of the basic reproduction ratio R0 in models for infectious diseases in heterogeneous populations. The Journal of Mathematical Biology. 1990;28:365–382. doi: 10.1007/BF00178324. [DOI] [PubMed] [Google Scholar]
- van den Driessche P., Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Mathematical Biosciences. 2002;180:29–48. doi: 10.1016/s0025-5564(02)00108-6. [DOI] [PubMed] [Google Scholar]
- Eubank S., Guclu H., Kumar V.S., Marathe M.V., Srinivasan A. Modelling disease outbreaks in realistic urban social networks. Nature. 2004;429:180–184. doi: 10.1038/nature02541. [DOI] [PubMed] [Google Scholar]
- Ferguson N.M., Donnelly C.A., Anderson R.M. The foot-and-mouth epidemic in Great Britain: Pattern of spread and impact of interventions. Science. 2001;292:1155–1160. doi: 10.1126/science.1061020. [DOI] [PubMed] [Google Scholar]
- Finkenstädt B.F., Grenfell B.T. Time series modelling of childhood diseases: A dynamical systems approach. Journal of Applied Statistics. 2000;49:187–205. [Google Scholar]
- Fisman D.N., Hauck T.S., Tuite A.R., Greer A.L. An IDEA for short term outbreak projection: Nearcasting using the basic reproduction number. PLoS One. 2013;8:e83622. doi: 10.1371/journal.pone.0083622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grenfell B.T., Bjornstad O.N., Kappey J. Travelling waves and spatial hierarchies in measles epidemics. Nature. 2001;414:716–723. doi: 10.1038/414716a. [DOI] [PubMed] [Google Scholar]
- Gustafsson L., Sternad M. Bringing consistency to simulation of population models–Poisson simulation as a bridge between micro and macro simulation. Mathematical Biosciences. 2007;209:361–385. doi: 10.1016/j.mbs.2007.02.004. [DOI] [PubMed] [Google Scholar]
- Halloran M.E., Longini I.M., Jr., Nizam A., Yang Y. Containing bioterrorist smallpox. Science. 2002;298:1428–1432. doi: 10.1126/science.1074674. [DOI] [PubMed] [Google Scholar]
- Hsieh Y.H. Pandemic influenza A (H1N1) during winter influenza season in the southern hemisphere. Influenza and Other Respiratory Viruses. 2010;4:187–197. doi: 10.1111/j.1750-2659.2010.00147.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keeling M.J., Woolhouse M.E., May R.M., Davies G., Grenfell B.T. Modelling vaccination strategies against foot-and-mouth disease. Nature. 2003;421:136–142. doi: 10.1038/nature01343. [DOI] [PubMed] [Google Scholar]
- Kermack W.O., McKendrick A.G. Contributions to the mathematical theory of epidemics: IV. Analysis of experimental epidemics of the virus disease mouse ectromelia. The Journal of Hygiene (London) 1937;37:172–187. doi: 10.1017/s0022172400034902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kiskowski M. Three-scale network model for the early growth dynamics of 2014 West Africa ebola epidemic. PLOS Currents Outbreaks. 2014 doi: 10.1371/currents.outbreaks.c6efe8274dc55274f05cbcb62bbe6070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kiskowski M., Chowell G. Modeling household and community transmission of ebola virus disease: Epidemic growth, spatial dynamics and insights for epidemic control. Virulence. 2015;7:63–73. doi: 10.1080/21505594.2015.1076613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merler S., Ajelli M., Fumanelli L., Gomes M.F., Piontti A.P. Spatiotemporal spread of the 2014 outbreak of ebola virus disease in Liberia and the effectiveness of non-pharmaceutical interventions: A computational modelling analysis. The Lancet Infectious Diseases. 2015;15:204–211. doi: 10.1016/S1473-3099(14)71074-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riley S., Ferguson N.M. Smallpox transmission and control: Spatial dynamics in Great Britain. Proceedings of the National Academy of Sciences USA. 2006;103:12637–12642. doi: 10.1073/pnas.0510873103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roberts M.G., Heesterbeek J.A. Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection. The Journal of Mathematical Biology. 2007;55:803–816. doi: 10.1007/s00285-007-0112-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ross R. John Murray; London: 1911. The prevention of Malaria. [Google Scholar]
- Tolle J. Can growth Be faster than exponential, and just how slow is the logarithm? The Mathematical Gazette. 2003;87:522–525. [Google Scholar]
- Viboud C., Simonsen L., Chowell G. A generalized-growth model to characterize the early ascending phase of infectious disease outbreaks. Epidemics. 2016;15:27–37. doi: 10.1016/j.epidem.2016.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wallinga J., Lipsitch M. How generation intervals shape the relationship between growth rates and reproductive numbers. Proceedings of the Royal Society B: Biological Sciences. 2007;274:599–604. doi: 10.1098/rspb.2006.3754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang X.S., Wu J., Yang Y. Richards model revisited: Validation by and application to infection dynamics. The Journal of Theoretical Biology. 2012;313:12–19. doi: 10.1016/j.jtbi.2012.07.024. [DOI] [PubMed] [Google Scholar]