Abstract
For decades, mathematical models of disease transmission have provided researchers and public health officials with critical insights into the progression, control, and prevention of disease spread. Of these models, one of the most fundamental is the SIR differential equation model. However, this ubiquitous model has one significant and rarely acknowledged shortcoming: it is unable to account for a disease's true infectious period distribution. As the misspecification of such a biological characteristic is known to significantly affect model behavior, there is a need to develop new modeling approaches that capture such information. Therefore, we illustrate an innovative take on compartmental models, derived from their general formulation as systems of nonlinear Volterra integral equations, to capture a broader range of infectious period distributions, yet maintain the desirable formulation as systems of differential equations. Our work illustrates a compartmental model that captures any Erlang distributed duration of infection with only 3 differential equations, instead of the typical inflated model sizes required by traditional differential equation compartmental models, and a compartmental model that captures any mean, standard deviation, skewness, and kurtosis of an infectious period distribution with 4 differential equations. The significance of our work is that it opens up a new class of easy-to-use compartmental models to predict disease outbreaks that do not require a complete overhaul of existing theory, and thus provides a starting point for multiple research avenues of investigation under the contexts of mathematics, public health, and evolutionary biology.
Keywords: Infectious disease models, Volterra integral equations, Survival analysis, Infectious period, Mean residual waiting-time
1. Introduction
The compartmental model of Kermack and McKendrick (Kermack and McKendrick, 1932, 1933, 1991) is arguably one of the greatest development in disease modeling. The formulation of this model, in its original form as a system of nonlinear Volterra integral equations (Diekmann et al., 1995), provides a general characterization of the transmission cycle between susceptible individuals and a disease that propagates throughout an environment (Brauer, 2010). Despite this generality, the vast majority of disease modelers prefer differential equation compartmental models (Blackwood & Childs, 2018). While this particular formulation of compartmental models has distinct advantages, such as the non-requirement of specialist knowledge for analysis and well-developed numerical methods for their computation (Roberts et al., 2015), they are a special case of the aforementioned system of nonlinear Volterra integral equations (Hethcote & Tudor, 1980). Specifically, one obtains traditional differential equation compartmental models (TDECM) from the nonlinear Volterra integral equations by imposing only two assumptions: 1) the “total infectivity at time ” (Brauer, 2008, 2010) corresponds to disease incidence, and 2) the duration of infection follows either an exponential or an Erlang distribution. Unfortunately, the vast majority of diseases do not have a duration of infection that follows these distributions (Bailey, 1956; Bailey & Others, 1975; Feng et al., 2007; Gough, 1977; Kelly-Hope & McKenzie, 2009; Lloyd, 2001), and force-fitting such a distributional structure is known to have a massive effect on the behavior and quality of model predictions (Earn, 2000). Regardless of these issues, the compartmental models obtained by the two traditional assumptions have undergone many extensions. A few noteworthy examples include modifying the force of infection to account for the saturation of infection in a population (Hethcote & van den Driessche, 1991; Ponciano & Capistrán, 2011), behavioral characteristics (Alexander & Moghadas, 2005; Liu et al., 1986, 1987; Ruan & Wang, 2003), modification of the recovery rate to capture disease burnout (Greenhalgh et al., 2015a), and the inclusion of additional disease stages (Hethcote & Tudor, 1980). Furthermore, the applications of these models have also grown considerably from just predicting a disease's trajectory. Today, these traditional compartmental models are used to evaluate the health benefits and cost-effectiveness of public health policies and disease interventions (Greenhalgh et al., 2015b, 2018), gauge the potential for disease virulence evolution (Day & Proulx, 2004), predict dominant influenza strains (Casagrandi et al., 2006), investigate the complexities of disease co-infection (Tchuenche et al., 2009), among many others. However, despite this growth in theory and application, the generalization of the very foundational assumptions that simplify systems of nonlinear Volterra integral equations to differential equation compartmental models remains largely undeveloped.
In what follows, we propose new assumptions to reduce nonlinear Volterra integral equations to differential equation compartmental models. The biological motivation for these new assumptions stems from the idea that the quantity of disease in a population may be described by person-days of infection, rather than disease incidence alone. Consequently, we extend current models from solely tracking disease incidence to tracking the number of person-days of infected individuals. To account for tracking person-days of infected individuals, we make the assumptions in our analysis that 1) the total infectivity at time corresponds to disease incidence and 2) the duration of infection is distributed according to a non-homogeneous analog of the exponential distribution. Under these assumptions, we derive a generalized differential equation compartmental models (GDECM) to track the flow of person-days susceptible to disease, person-days infected to disease, and person-days recovered from disease, in addition to providing model equilibria, and disease reproductive numbers (Heffernan et al., 2005).
Essential in the development of GDECM is the use of survival analysis (supplemental materials). Specifically, the development of GDECM requires the hazard function and the mean residual waiting-time of a distribution (Finkelstein, 2002a, 2002b; Gupta & Bradley, 2003) to describe a time-varying average duration of infection. With this in mind, we demonstrate the consistency of our GDECM to traditional compartmental models when the duration of infection follows an exponential distribution. We also demonstrate GDECM with an infectious period that follows an Erlang distribution and illustrate how this choice has a representation as either a system of 3 ODEs, regardless of the Erlang distribution parameters, or as a larger system of ODEs, which feature a chain of infected individual equations (Hurtado & Kirosingh, 2019; Krylova & Earn, 2013; Lloyd, 2001). Finally, we consider an infectious period that is Pearson distributed. In choosing the Pearson distribution, we develop a model that is capable of accounting for any possible mean, standard deviation, skewness, and kurtosis of the infectious period. Thereby, we provide a simple approach for measuring how altering the infectivity profile of a disease, as described by the first four statistical moments, influences the trajectory of the disease.
2. Methods
In what follows, we develop a GDECM to describe the progression of a disease throughout a population. To obtain such models, we impose new assumptions on the notion of the total infectivity at time (Brauer, 2008, 2010) used in the integral equations of Kermack and McKendrick. Before deriving GDECM, we briefly overview the quantity of person-days of infection, review the formulation of TDECM from the integral equations of Kermack and McKendrick, and then present the GDECM. In addition, we illustrate the behavior of GDCEM in the absence of transmission, provide equilibria, and their reproductive numbers.
2.1. The infectious period and duration of infection
To derive the duration of infection distribution from the infectious period distribution we consider data on newly infectious individuals, , and characterize the infectious period by the survival function . Thus, we obtain the number of infected individuals remaining infectious at time who where infected at time as
and the total number of infected individuals at time by
On the basis that is a normalizing weight, we have that the conditional survival function of the duration of infection distribution for all currently infectious individuals with infection time prior to is
(1) |
where , and .
Given the formulation of , we assume that the associated hazard rate (Clark et al., 2003a, 2003b; Finkelstein, 2002a, 2002b) does not depend on the time of infection , namely
Note, this is a slight generalization of the classical assumption for an exponentially distributed infectious period, as when it follows that , and . Thus, the hazard rate for an exponentially distributed infectious period is . The generalization we consider is a non-homogeneous analog of the exponential distribution, namely which implies . Under such an assumed form of , we have that , and the hazard rate naturally follows.
2.2. Person-days of infection
To provide insight on the quantity person-days of infection we consider a hypothetical epidemic. Solely for illustration purposes, we consider new incidence to be reported weekly (Fig. 1) and the infectious period to be at most 14 days (Fig. 2). Furthermore, akin to the assumption used in the work of Kermack and McKendrick (Brauer, 2008, 2010), we assume that new infections occur instantaneously at the end of each week. Given this setup, infectious individuals in a given week consist of newly infected individuals, and individuals that remain infectious from the prior week (Figs. 1 and 2).
Fig. 1.
New and total infectious individuals. New infectious individuals (solid line) and all infectious individuals (new and prior infectious individuals that are infectious) (dotted line).
Fig. 2.
Infectious periods. Infectious period distribution (top) and the conditional infectious period distribution for individuals whose infection lasts more than 7 days (bottom).
Under this setup, we estimate the duration of infection distribution (Fig. 3) from a discretized version of (1),
where , and .
Fig. 3.
Weekly duration of infection distribution. The duration of infection distribution and the average duration of infection (red bar) for the weeks 1–15. Each duration of infection distribution is a weighted sum of the infectious period distribution and conditional infectious period distribution, where the weights are based on the number of people currently and priorly infected with disease.
Given the duration of infection distributions for each week the person-days of infection are calculated as the area enclosed by the survival of the infected individuals from time (Fig. 4),
where is the mean residual waiting-time of infection (Gupta & Bradley, 2003) in the week.
Fig. 4.
Person-days of infection. Grey regions correspond to the total days infected people spread disease over the course of an epidemic. Dark grey regions correspond to infectious individuals that remain infectious after one week. Mean residual waiting-time for the time spent infectious for each week (red line), average duration of infection (blue line).
One alternative to representing person-days of infection as the area enclosed by the survival of the infected individuals at time is by the product of the number of infected individuals and their mean residual waiting-time of infection at time (Fig. 5). The product serves as the basis for constructing GDECM.
Fig. 5.
Person-days of infection. Rectangular red regions correspond to the same number of total days infected people potentially spread disease over the course of an epidemic. Blue rectangular regions illustrate over estimation of person-days of infection using average druation of infection. Mean residual waiting-time for the time spent infectious for each week (red line), average duration of infection (blue line).
2.3. Traditional compartmental models
Consider the general compartmental model of Kermack and McKendrick, as formulated by Brauer et al. (Brauer, 2010; Kermack and McKendrick, 1932, 1933, 1991). For this general compartmental model, the number of susceptible individuals are denoted as , and the total infectivity at time is denoted as (Brauer, 2008, 2010). We also define as the total infectivity at time of the individuals initially infected with the disease at the beginning of the epidemic. Furthermore, the mean infectivity of an infected individual units of time after the start of the infection is , where , and the mean infectivity of individuals in the population at time who where initially infected at time as
For simplicity, we assume a constant mean infectivity, . Given that transmission is a function of both shedding rate and behavior, it would be difficult to quantify changes in mean infectivity throughout an infection. Therefore, the progression of an epidemic throughout a population is described with the integral equations,
(2) |
Here is the force of infection, which we assume to be
where is the average number of contacts individuals in a population make per unit of time and is the initial number of susceptible individuals at the start of the epidemic.
Traditionally, to reduce (2) to a system of differential equations requires that the duration of infection is exponentially distributed, which yields that the conditional survival function is where is the recovery rate, or equivalently is the average duration of infection, and 2) that the total infectivity at time corresponds to the number of infected individuals, . Combining these assumptions, along with an additional compartment to track recovered individuals, transforms system (2) into
(3) |
Here and represent the total number of infected individuals and removed individuals at the start of the epidemic. An important feature of system (3) is that it conserves the total population:
Differentiating system (3) with respect to , and substituting the integral equation for for the remaining integrals yields the classic SIR system (Fig. 6):
(4) |
Fig. 6.
Compartmental diagrams. a) Flow of suscpetible person-days to infected person-days and recovered person days, and b) Flow of suscpetible persons to infected persons and recovered persons.
Alternatively, (2) reduces to a system of differential equations when the duration of infection follows an Erlang distribution, with the conditional survival function
where is a shape parameter that determines the total number of infection stages, is the average duration spent in each stage, and 2) the total infectivity of the disease corresponds to identical stages (in terms of the average duration spent in each stage) of infected individuals,
Combining these assumptions, along with an additional compartment to track recovered individuals, transforms system (1) into
(5) |
Equivalently, if the linear chain trick is applied (see supplemental materials for details) (Lloyd, 2001), we have that
where
An important feature of system (5) is that it conserves the total population:
Differentiating system (5) with respect to , and substituting as needed yields the classic SIkR system:
(6) |
where
2.4. Generalized differential equation compartmental models
Before presenting the GDECM, we briefly contextualize its differences in comparison to traditional compartmental models. The traditional compartmental models of (4), (6) track the changes in the number of individuals susceptible, infected, and recovered from disease, for a disease circulating in a population. For traditional compartmental models, the individuals infected with disease recover from infection at a constant rate, , which is analgous to a constant average duration of infection of (Fig. 1). The premise of GDECM is to consider tracking the change in the number of person-days of infection (Fig. 2). Specifically, we characterize person-days of infection as the individuals infected with a disease multiplied by their average duration of time spent in the infected state at time (Fig. 1).
To obtain GDECM, we generalize the assumptions used to formulate system (5). We first assume the total infectivity at time is the number of infected individuals at time
In addition, we assume that the duration of infection distribution corresponds to a non-homogeneous analog of the exponential distribution, as given by the conditional survival function
(7) |
where .
The rate of removal from the infectious state, , is directly linked to the mean residual waiting-time . Specifically, one can determine from through
and the reciprocal relationship by
(8) |
provided that satisfies (Gupta & Bradley, 2003)
The functions and possess the standard properties of hazard rates and mean residual waiting-time functions, including and (Finkelstein, 2002a, 2002b).
Under this setup, we consider person-days susceptible to disease, person-days infectious with disease, and person-days recovered from disease to be governed by,
(9) |
where by definition (Finkelstein, 2002a, 2002b), is the force of infection, and is given by (5).
Adding the equations of system (9) together, it follows that
Thus, given and for all , we have that the total population is conserved:
Imposing (7) on (9), taking the time derivative of system (9), assuming and applying Leibniz rule for the derivatives of integrals as needed, we obtain
(10) |
where is given by (7).
Substituting (9) into (10) to eliminate the integrals, and isolating for each of , , and , we obtain the compartmental model (Fig. 6):
(11) |
By construction, system (11) is similar to its TDECM counterpart. However, there are some subtle differences. The terms involving on the left-hand side do not represent inflows that increase the number of susceptible, infected, or recovered individuals. These terms, which are non-positive due to (Finkelstein, 2002a, 2002b; Gupta & Bradley, 2003), account for the change in person-days caused by a decrease in the average duration of infection. In a similar manor, the term on the right-hand side does not represent an inflow of susceptible individuals, as it accounts for the change in scale of total person-days over time.
2.5. Behavior in the absence of transmission
In the absence of transmission (), the infectious equation in (11) reduces to
Solving for , it follows that
Note, the extra factor originates from , and thus reflects the time variability in the mean residual waiting-time of the initially infected individuals.
2.6. Equilibria, the basic reproductive number , and the effective reproductive number
The compartmental model (11) potentially have several equilibria. As is standard, (11) possess the standard disease-free equilibrium:
(12) |
In addition, an equilibrium where the infection was exhausted, leaving susceptible individuals that escaped infection, and recovered individuals:
(13) |
Turning our attention to the reproductive numbers of the disease, we consider the basic reproductive number using the next-generation method (van den Driessche & Watmough, 2002; van den Driessche, 2017), where and It follows that
(14) |
Here is the spectral radius, is the next-generation matrix evaluated at the disease-free equilibrium, and is the constant value of at the disease-free equilibrium.
Following a similar setup, we consider the effective reproductive number, in the face of time variability of to be
(15) |
To summarize, in this section we illustrated that GDECM possesses similar biological properties to TDCEM. Specifically, GDECM's behavior in the absence of transmission, its ability to conserve the total population, equilibria, and reproductive numbers all closely resemble similar features in TDCEM. Therefore, to further compare GDECM with TDECM, in the following section we consider the GDECM based on the duration of infection distributions of TDECM, namely the exponential or Erlang distributions, before demonstrating that GDECM exists for infectious periods that are Pearson distributed.
3. Special cases of GDECM
We now consider the duration of infection distributions used in traditional differential equation compartmental models to obtain systems of differential equations, namely the exponential distribution, and the Erlang distribution. In addition, we illustrate a GDECM that accounts for any mean, standard deviation, skewness, and excess kurtosis, by assuming that the duration of infection is Pearson distributed.
3.1. The exponential distribution
Assuming the infectious period is exponentially distributed, then the duration of infection is also exponentially distributed, and the conditional survival function for the epidemic is . Under this assumption, we have that (Gupta & Bradley, 2003),
(16) |
and
(17) |
Solving (8) under the assumption of (17) yields,
(18) |
Substituting (18) into (11), and multiplying through by , we arrive at the traditional compartmental models:
(19) |
Finally, because the basic reproductive number obtained by next-generation method (van den Driessche, 2017) is
(20) |
3.2. The Erlang distribution
If the infectious period is Erlang distributed with survival function , then necessarily . It follows that the associated conditional survival function of the duration of infection distribution is
(21) |
Note, this formulation of differs from that used in the derivation of TDECM, which bases on the convolution of exponential random variables.
(22) |
Using for an Erlang distribution (Gupta & Bradley, 2003), it follows from (8) that the mean residual waiting-time:
For the ease of presentation, we rewrite and its derivative with respect to as
and
(23) |
Using (23) in system (11), where the left-hand sides are left as and for the ease of presentation, we obtain a compartmental model comprised of three equations, regardless of the value of from the Erlang distribution:
(24) |
Alternatively, by applying the ‘linear chain trick’ (Hurtado & Kirosingh, 2019; Krylova & Earn, 2013; Lloyd, 2001) (supplemental materials) to the system of three equations, we equivalently characterize as:
(25) |
and
(26) |
Thus, through (21), (22), and (24), we either reduce system (11) to a system of 3 differential equations based on an infectious period that is Erlang distributed, or through the ‘linear chain trick’ to a system of differential equations (i.e. (25)-(26), and .
Note, an important distinction between (25)–(26) and the TDCEM analog is that new infections enter into any given infectious state because of the terms , whereas the TDECM (6) requires all new infections to enter the first stage of infection.
In regards to the basic reproductive numbers, we have that
Evaluating and at the disease-free equilibrium, and when , it follows that
and as , it follows that
Thus, depending on how the value of is defined, we have the basic reproductive number
(29) |
3.3. The Pearson distribution
If the infectious period is Pearson distributed, then the associated conditional survival function of the duration of infection distribution does not possess a general closed-form. Thus, we define the Pearson distribution by
(30) |
where
(31) |
with and .
Note, and are the mean, standard deviation, skewness, and excess kurtosis of the infectious period distribution (Gupta & Bradley, 2003; Pearson, 1893, 1895).
To obtain the mean residual waiting-time in terms of the hazard function, we have that (Gupta & Bradley, 2003; Papathanasiou, 1995):
(32) |
It follows from (18) that the mean residual waiting-time is:
(33) |
where .
To ensure the mean residual waiting-time is finite we apply l’Hpital's rule, using the facts that as , to obtain
(34) |
Thus, we require that to force the terms involving to drop out. Under this assumption, we have that
(35) |
Given (35) we evaluate and at the disease-free equilibrium, as , yielding
(36) |
Similarly, evaluating and at the disease-free equilibrium when , it follows that
(37) |
It follows that the basic reproductive number is
(38) |
depending on the value of .
Note, the lower and upper bounds of (38) are consistent with the next-generation method estimate for the basic reproductive numbers (20) and (29), as the Erlang distribution has moments and , which implies .
To summarize, in this section, we illustrated GDECM is equivalent to TDECM when the duration of infection is exponentially distributed. In addition, GDECM is similar to TDECM when the duration of infection is Erlang distributed. Although, the GDECM advantageously possesses formulations as either a system of differential equations, as would be required from TDECM, or as a system of differential equations. We concluded by demonstrating GDECM exist for non-Erlang distributions, namely the family of Pearson distribution, and now provide an application of such a GDECM to describe measles transmission in Iceland in the next section.
4. Application of methodology
To illustrate the utility of our approach, we model measles outbreaks based on time series data of measles incidence in Iceland from 1924 to 1928 (Cliff et al., 1981), taking into account classical data on the infectious period of measles (Simpson, 1952).
We use sample moments of the mean, standard deviation, skewness, and kurtosis to estimate the infectious period distribution of measles, under the assumption that it is Pearson distributed, and compare the GDECM to a traditional compartmental model that features an infectious period that is Erlang distributed (Fig. 7).
Fig. 7.
Measles infectious period. Infectious period distribution based on observed incidence data (red dash-dot line), fit Pearson distribution (black solid line) based on sample moments , and fit Erlang distribution (blue dotted line) based a shape parameter of and rate parameter of .
For the GDECM we determine the duration of infection distribution (1). Assuming that is Pearson distributed with (Fig. 8), it follows that the mean, standard deviation, skewness, and kurtosis are 5.98, 4.15, 1.62, and 3.93 days, respectively. Given these moments, the hazard rate and mean residual waiting-time naturally follow (Fig. 9). Finally, fitting system (11) to data using a least-squares procedure yields (Fig. 10).
Fig. 8.
Survival function for individuals initially infected at a given day throughout the epidemic.
Fig. 9.
Hazard rate and mean residual waiting-time .
Fig. 10.
Predicted measles incidence in Reykjavik Iceland during 1924. Trajectories of incidence are based on interpolation of data (dash-dotted red line) (Cliff et al., 1981), and simulated trajectory with the GDECM (black solid line), and simulated trajectory with SIkR model (blue dotted line).
When is Erlang distributed, we have that the mean and standard deviation are 5.98, and 2.44 days, respectively. Given these values, it follows that the classical SIR model, with a duration of infection that is Erlang distributed, is given by
(39) |
where . From a least-squares fit (Fig. 10) we have that , where the total population of Reykjavik (in 1924) is estimated to be (Greenhalgh & Day, 2017).
Given the model parameters, the possible range of basic reproductive number (36) of measles for the GDECM (11) given by the next-generation method is
and for the traditional compartmental model (39), we have that
5. Discussion
In this work, we presented a class of GDECM by modifying the classical assumptions that simplify nonlinear Volterra integral equations into systems of differential equations. To do this, we generalize the notion of compartmental models to describe person-days of infection, rather than infected persons. We illustrate the consistency of our class of models with traditional compartmental models for exponential and Erlang distributed infectious periods, present a new class of differential equation compartmental models based on a Pearson distributed infectious period, and provide equilibria and basic reproductive numbers for our approach.
The requirement that the duration of infection follows an exponential distribution is often a source of weakness with regards to the biological validity of differential equation compartmental models. While the extension of such models through the linear chain trick (Hurtado & Kirosingh, 2019; Krylova & Earn, 2013; Lloyd, 2001) to a duration of infection that follows the Erlang distribution alleviates this weakness to some degree, it does so at the cost of inflating the size of the compartmental model and thereby increasing the computational complexity of the system. Our new class of models avoids this inflation while retaining the benefits of having an infectious period that follows an Erlang distribution. Thereby our new class of models offers an approach to reduce model complexity in an era when the complexity of compartmental models is ever increasing. Furthermore, if one generalizes the concept of a function to include the survival function of the Gamma distribution, our new class of models advantageously accounts for any Gamma distribution parameter values, including non-integer cases.
A substantial advantage of the new class of models is that they are ODE based. Therefore, like traditional compartmental models, they do not require specialist knowledge to use and possess well-developed numerical methods for their simulation. Furthermore, the theoretical extensions of GDECM to include alternative formulations of the force of infection, state-dependent recovery rates, in addition to the inclusion of additional disease compartments are easy to implement. In addition, many of the applications of the traditional models, such as the study of multi-strain dynamics, health benefit analysis, and cost-effectiveness analysis, should naturally carry over without the need to reinvent the procedures of each analysis.
A potentially fruitful avenue for future applications of our new class of models is in the study of virulence and disease evolution. To elaborate, because GDECM includes both the quantity of infected individuals and their duration of infection, it may serve as a better paradigm for investigating selective pressures that pathogens face, at least relative to traditional compartmental models. Similarly, our idea to track both the quantity of infected individuals along with their duration of infection could be adapted to study species competition, as a similar modification should provide stronger intuition on species fitness.
A surprising outcome of our work is the discovery that the lower bound provided by the next-generation method estimate of the basic reproductive number for the Pearson distributed example depends on standard deviation and skewness, instead of the mean. As standard deviation and skewness indicate the spread and lean of a distribution, it seems reasonable that their combination makes for a decent proxy for the location of the middle of a distribution. While this could be a consequence of assuming that the duration of infection follows the Pearson distribution, it also highlights a potentially new approach to bound the basic reproductive number for a disease directly from data.
The use of an infectious period that is Pearson distributed may also open up an interesting avenue into bifurcation analysis. In particular, with modifications to incorporate demographic turnover or loss of immunity and the use of the Pearson distribution, one could investigate if a relationship exists between the occurrence of periodic cycles and the first four moments of the infectious period distribution. Thereby, one may be able to gain intuition as to whether bifurcations are likely to occur simply by examining statistical moments and the disease transmission rate. In addition, through such modified models, it may be possible to determine the existence (or non-existence) of Hopf bifurcations by examining whether is periodic, as this seems like it would be a requirement for such behavior in reality. Additional avenues for investigation include the direct extension to periodic compartmental models (Wang & Zhao, 2008) through the use of some form of double truncated mean residual waiting-time (Khorashadizadeh et al., 2012), and a thorough comparison to renewal equations (Champredon et al., 2018), as such work would likely prove valuable for the study infectious disease dynamics.
As GDECMs are based on the integral equation versions of the Kermack and McKendrick model, they share this model's limitations. Namely, the assumptions of a sufficiently large and well-mixed population, the compartmentalization of diseases into distinct stages, and the transmission assumption of the law of mass-action. While these limitations may seem numerous, they do not impede research on traditional models, and thereby should not inhibit the theoretical extension and application of the broader class of models proposed here.
The traditional assumptions that reduce the integral equation version of the Kermack and McKendrick model to a system of differential equations provide disease modelers with a rich source for mathematical and scientific discovery. Here, we proposed a generalization of these traditional assumptions to a biologically more accurate description of the total quantity of a disease in a population. By imposing these new assumptions, we provide a more descriptive picture of how disease propagates throughout a population while retaining the convenience and simplicity of differential equation compartmental models.
Declaration of competing interest
The authors declare no conflict of interest.
Acknowledgments
SG was partially supported by the National Science Foundation Grant DMS-2052592. The authors thank Drs. Emelie Kenney and Troy Day for helpful comments that improved the clarity of the work.
Handling editor: Dr Lou Yijun
Footnotes
Peer review under responsibility of KeAi Communications Co., Ltd.
Supplementary data to this article can be found online at https://doi.org/10.1016/j.idm.2021.08.007.
Contributor Information
Scott Greenhalgh, Email: sgreenhalgh@siena.edu.
Carly Rozins, Email: crozins@yorku.ca.
Author's contributions
Scott Greenhalgh: Conceptualization, Formal analysis, Writing – Original draft preparation, Reviewing and Editing. Carly Rozins: Formal analysis, Writing – Original draft preparation, Reviewing and Editing
Appendix A. Supplementary data
The following is the Supplementary data to this article:
References
- Alexander M.E., Moghadas S.M. Bifurcation analysis of an SIRS epidemic model with generalized incidence. SIAM Journal on Applied Mathematics. 2005;65:1794–1816. doi: 10.1137/040604947. [DOI] [Google Scholar]
- Bailey N.T.J. On estimating the latent and infectious periods of measles: I. Families with two susceptibles only. Biometrika. 1956;43:15. doi: 10.2307/2333574. [DOI] [Google Scholar]
- Bailey N.T.J., Others . Charles Griffin & Company Ltd, 5a Crendon Street, High Wycombe, Bucks HP13 6LE.; 1975. The mathematical theory of infectious diseases and its applications. [Google Scholar]
- Blackwood J.C., Childs L.M. An introduction to compartmental modeling for the budding infectious disease modeler. Lett Biomath. 2018;5:195–221. doi: 10.1080/23737867.2018.1509026. [DOI] [Google Scholar]
- Brauer F. Age-of-infection and the final size relation. Mathematical Biosciences and Engineering. 2008;5:681–690. doi: 10.3934/mbe.2008.5.681. [DOI] [PubMed] [Google Scholar]
- Brauer F. General compartmental epidemic models. Chinese Ann Math Ser B. 2010;31:289–304. doi: 10.1007/s11401-009-0454-1. [DOI] [Google Scholar]
- Casagrandi R., Bolzoni L., Levin S.A., Andreasen V. The SIRC model and influenza A. Mathematical Biosciences. 2006;200:152–169. doi: 10.1016/j.mbs.2005.12.029. [DOI] [PubMed] [Google Scholar]
- Champredon D., Dushoff J., Earn D.J.D. Equivalence of the erlang-distributed SEIR epidemic model and the renewal equation. SIAM Journal on Applied Mathematics. 2018;78:3258–3278. doi: 10.1137/18M1186411. [DOI] [Google Scholar]
- Clark T.G., Bradburn M.J., Love S.B., Altman D.G. Survival Analysis Part I: Basic concepts and first analyses. British Journal of Cancer. 2003;89:232–238. doi: 10.1038/sj.bjc.6601118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clark T.G., Bradburn M.J., Love S.B., Altman D.G. Survival Analysis Part IV: Further concepts and methods in survival analysis. British Journal of Cancer. 2003;89:781–786. doi: 10.1038/sj.bjc.6601117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cliff A.D., Haggett P., Ord J.K., Versey G.R. Cambridge University Press; 1981. Spatial diffusion: An historical geography of epidemics in an island community. [Google Scholar]
- Day T., Proulx S.R. A general theory for the evolutionary dynamics of virulence. The American Naturalist. 2004;163:E40–E63. doi: 10.1086/382548. [DOI] [PubMed] [Google Scholar]
- Diekmann O., Metz H., Heesterbeek H. In: Epidemic models: Their structure and relation to data. Mollison D., editor. Cambridge University Press; Cambridge, UK: 1995. The legacy of Kermack and McKendrick; pp. 95–115. [Google Scholar]
- van den Driessche P. Reproduction numbers of infectious disease models. Infect Dis Model. 2017;2:288–303. doi: 10.1016/j.idm.2017.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van den Driessche P., Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Mathematical Biosciences. 2002:29–48. doi: 10.1016/S0025-5564(02)00108-6. [DOI] [PubMed] [Google Scholar]
- Earn D.J. A simple model for complex dynamical transitions in epidemics. Science. 2000;287:667–670. doi: 10.1126/science.287.5453.667. 80- [DOI] [PubMed] [Google Scholar]
- Feng Z., Xu D., Zhao H. Epidemiological models with non-exponentially distributed disease stages and applications to disease control. Bulletin of Mathematical Biology. 2007;69:1511–1536. doi: 10.1007/s11538-006-9174-9. [DOI] [PubMed] [Google Scholar]
- Finkelstein M.S. On the reversed hazard rate. Reliability Engineering & System Safety. 2002;78:71–75. doi: 10.1016/S0951-8320(02)00113-8. [DOI] [Google Scholar]
- Finkelstein M.S. On the shape of the mean residual lifetime function. Applied Stochastic Models in Business and Industry. 2002;18:135–146. doi: 10.1002/asmb.461. [DOI] [Google Scholar]
- Gough K.J. The estimation of latent and infectious periods. Biometrika. Biometrika Trust. 1977;64:559–565. [Google Scholar]
- Greenhalgh S., Day T. Time-varying and state-dependent recovery rates in epidemiological models. Infect Dis Model. 2017 doi: 10.1016/j.idm.2017.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenhalgh S., Galvani A.P., Medlock J. Disease elimination and re-emergence in differential-equation models. Journal of Theoretical Biology. 2015;387:174–180. doi: 10.1016/j.jtbi.2015.09.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenhalgh S., Hobbs C.V., Parikh S. Brief report: Antimalarial benefit of HIV antiretroviral therapy in areas of low to moderate malaria transmission intensity. Journal of Acquired Immune Deficiency Syndromes. 2018;79:249–254. doi: 10.1097/QAI.0000000000001783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenhalgh S., Ndeffo M., Galvani A.P., Parikh S. The epidemiological impact of HIV antiretroviral therapy on malaria in children. AIDS. 2015;29:473–482. doi: 10.1097/QAD.0000000000000550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gupta R.C., Bradley D.M. Representing the mean residual life in terms of the failure rate. Mathematical and Computer Modelling. 2003;37:1271–1280. doi: 10.1016/S0895-7177(03)90038-0. [DOI] [Google Scholar]
- Heffernan J.M., Smith R.J., Wahl L.M. Perspectives on the basic reproductive ratio. Journal of The Royal Society Interface. 2005;2:281–293. doi: 10.1098/rsif.2005.0042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hethcote H.W., Tudor D.W. Integral equation models for endemic infectious diseases. Journal of Mathematical Biology. 1980;9:37–47. doi: 10.1007/BF00276034. [DOI] [PubMed] [Google Scholar]
- Hethcote H.W., van den Driessche P. Some epidemiological models with nonlinear incidence. Journal of Mathematical Biology. 1991;29:271–287. doi: 10.1007/BF00160539. http://www.ncbi.nlm.nih.gov/pubmed/2061695 Available: [DOI] [PubMed] [Google Scholar]
- Hurtado P.J., Kirosingh A.S. Generalizations of the ‘linear chain trick’’: Incorporating more flexible dwell time distributions into mean field ODE models. Journal of Mathematical Biology. 2019 doi: 10.1007/s00285-019-01412-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kelly-Hope L.A., McKenzie F.E. The multiplicity of malaria transmission: A review of entomological inoculation rate measurements and methods across sub-Saharan Africa. Malaria Journal. 2009;8:19. doi: 10.1186/1475-2875-8-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kermack W.O., McKendrick A.G. Contributions to the mathematical theory of epidemics--II. The problem of endemicity. Bulletin of Mathematical Biology. 1932;53:57–87. doi: 10.1007/BF02464424. http://www.ncbi.nlm.nih.gov/pubmed/2059742 1991. Available: [DOI] [PubMed] [Google Scholar]
- Kermack W.O., McKendrick A.G. Contributions to the mathematical theory of epidemics--III. Further studies of the problem of endemicity. Bulletin of Mathematical Biology. 1933;53:89–118. doi: 10.1007/BF02464425. http://www.ncbi.nlm.nih.gov/pubmed/2059743 1991. Available: [DOI] [PubMed] [Google Scholar]
- Kermack W.O., McKendrick A.G. Contributions to the mathematical theory of epidemics--I. 1927. Bulletin of Mathematical Biology. 1991;53:33–55. doi: 10.1007/BF02464423. http://www.ncbi.nlm.nih.gov/pubmed/2059741 Available: [DOI] [PubMed] [Google Scholar]
- Khorashadizadeh M., Rezaei Roknabadi A.H., Mohtashami Borzadaran G.R. Characterizations of lifetime distributions based on doubly truncated mean residual life and mean past to failure. Communications in Statistics - Theory and Methods. 2012;41:1105–1115. doi: 10.1080/03610926.2010.535626. [DOI] [Google Scholar]
- Krylova O., Earn D.J.D. Effects of the infectious period distribution on predicted transitions in childhood disease dynamics. Journal of The Royal Society Interface. 2013;10:20130098. doi: 10.1098/rsif.2013.0098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu W.M., Hethcote H.W., Levin S.A. Dynamical behavior of epidemiological models with nonlinear incidence rates. Journal of Mathematical Biology. 1987;25:359–380. doi: 10.1007/BF00277162. http://www.ncbi.nlm.nih.gov/pubmed/3668394 Available: [DOI] [PubMed] [Google Scholar]
- Liu W.M., Levin S.A., Iwasa Y. Influence of nonlinear incidence rates upon the behavior of SIRS epidemiological models. Journal of Mathematical Biology. 1986;23:187–204. doi: 10.1007/BF00276956. http://www.ncbi.nlm.nih.gov/pubmed/3958634 Available: [DOI] [PubMed] [Google Scholar]
- Lloyd A.L. Realistic distributions of infectious periods in epidemic models: Changing patterns of persistence and dynamics. Theoretical Population Biology. 2001;60:59–71. doi: 10.1006/tpbi.2001.1525. [DOI] [PubMed] [Google Scholar]
- Papathanasiou V. A characterization of the Pearson system of distributions and the associated orthogonal polynomials. Annals of the Institute of Statistical Mathematics. 1995;47:171–176. doi: 10.1007/BF00773421. [DOI] [Google Scholar]
- Pearson K. Contributions to the mathematical theory of evolution. Proceedings of the Royal Society of London. 1893;54:329–333. doi: 10.1098/rspl.1893.0079. [DOI] [Google Scholar]
- Pearson K. Contributions to the mathematical theory of evolution. II. Skew variation in homogeneous material. Philos Trans R Soc A Math Phys Eng Sci. 1895;186:343–414. doi: 10.1098/rsta.1895.0010. [DOI] [Google Scholar]
- Ponciano J.M., Capistrán M.A. First principles modeling of nonlinear incidence rates in Seasonal epidemics. Pascual M, editor. PLoS Computational Biology. 2011;7 doi: 10.1371/journal.pcbi.1001079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roberts M., Andreasen V., Lloyd A., Pellis L. Nine challenges for deterministic epidemic models. Epidemics. 2015;10:49–53. doi: 10.1016/j.epidem.2014.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruan S., Wang W. Dynamical behavior of an epidemic model with a nonlinear incidence rate. Journal of Differential Equations. 2003;188:135–163. doi: 10.1016/S0022-0396(02)00089-X. [DOI] [Google Scholar]
- Simpson R.E.H. Infectiousness of communicable diseases in the household (measles, chickenpox, and mumps) Lancet. 1952;2:549–554. doi: 10.1016/s0140-6736(52)91357-3. http://www.ncbi.nlm.nih.gov/pubmed/12981903 Available: [DOI] [PubMed] [Google Scholar]
- Tchuenche J., Garira W., Gumel A., Mukandavire Z. Mathematical analysis of a model for HIV-malaria co-infection. Mathematical Biosciences and Engineering. 2009;6:333–362. doi: 10.3934/mbe.2009.6.333. [DOI] [PubMed] [Google Scholar]
- Wang W., Zhao X.-Q. Threshold dynamics for compartmental epidemic models in periodic environments. Journal of Dyn Differ Equations. 2008;20:699–717. doi: 10.1007/s10884-008-9111-8. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.