Abstract
The duration of the infectious period is a crucial determinant of the ability of an infectious disease to spread. We consider an epidemic model that is network based and non-Markovian, containing classic Kermack-McKendrick, pairwise, message passing, and spatial models as special cases. For this model, we prove a monotonic relationship between the variability of the infectious period (with fixed mean) and the probability that the infection will reach any given subset of the population by any given time. For certain families of distributions, this result implies that epidemic severity is decreasing with respect to the variance of the infectious period. The striking importance of this relationship is demonstrated numerically. We then prove, with a fixed basic reproductive ratio (), a monotonic relationship between the variability of the posterior transmission probability (which is a function of the infectious period) and the probability that the infection will reach any given subset of the population by any given time. Thus again, even when is fixed, variability of the infectious period tends to dampen the epidemic. Numerical results illustrate this but indicate the relationship is weaker. We then show how our results apply to message passing, pairwise, and Kermack-McKendrick epidemic models, even when they are not exactly consistent with the stochastic dynamics. For Poissonian contact processes, and arbitrarily distributed infectious periods, we demonstrate how systems of delay differential equations and ordinary differential equations can provide upper and lower bounds, respectively, for the probability that any given individual has been infected by any given time.
I. INTRODUCTION
In a homogeneously mixing large population, under certain common assumptions, the epidemiological quantity (this being the expected number of secondary cases per typical primary case near the start of an epidemic) depends on the infectious period only through its mean [1]. However, under the same assumptions, other important quantifiers such as the probability of a major outbreak, the final size, and the initial growth rate can depend on the variability of the infectious period, higher variability tending to decrease these quantities [1,2]. When accounting for the more realistic scenario where individuals can only make direct contacts to their neighbor in a contact network [3], typically depends on the variability of the infectious period and, even when is held fixed, the probability that any given individual will eventually get infected is still dependent on the variability of the infectious period [4]. Here we extend these results to a much more general epidemic model and consider the effect of the infectious period distribution on the probability that the disease will spread to an arbitrary subset of the population by an arbitrary time . This probability underpins the likelihood of an epidemic, and the speed and extent of its propagation.
It is commonplace to assume that the infectious period is exponentially distributed because this leads to greater mathematical tractability. In choosing the parameter for this distribution, the modeler may try to replicate the estimated average infectious period or the estimated value for . In any case, the exponential distribution is typically not very realistic for this variable. For example, it has been suggested that gamma, Weibull, and degenerate (nonrandom) distributions may be more realistic for diseases such as smallpox, ebola, and measles [5–8]. Thus, investigating the effect of the infectious period distribution is important for obtaining a qualitative understanding of the ability of different diseases to propagate, and of the effects of intervention strategies which may modify this distribution. It is also important for informing parameter choices in epidemic models.
The susceptible-exposed-infectious-recovered (SEIR) compartmental model for the spread of infectious diseases may be considered in a general stochastic and network-based form (see, for example, [9,10]). Here we consider a similar stochastic epidemic model which we construct as a non-Markovian stochastic process taking place on an arbitrary static contact network (or graph). We allow arbitrarily distributed exposed and infectious periods, heterogeneous contact processes between individuals, and heterogeneity in susceptibility and infectiousness. Many previously studied models such as Kermack-McKendrick [11], pairwise [12,13], message passing [10], and spatial models [4,14] are identical to, consistent with, or approximations of special cases of the stochastic model which we examine here [15]. We show how our conclusions apply to these well-known models.
Let and be two real-valued random variables. If for all convex functions then we say that is greater than in convex order [16] and write . The convex order, which provides a type of variability ordering for random variables with the same mean, is central to the paper that we present here. Our main result shows that, under mild assumptions, by changing the infectious period distributions such that they decrease in convex order, which necessarily decreases their variance, we can only increase . We discuss some important corollaries of this and then present examples and a numerical illustration (Fig. 1).
The strength of the relationship between epidemic severity and the variability of the infectious period may depend on many factors, such as the topology of the contact network and the processes by which individuals interact. However, this is not of primary concern here since we note that epidemic severity may be made arbitrarily small by increasing the variance of the infectious period, regardless of these other factors. This is the case since we may define the infectious period, with specified mean, to be able to take only the value zero or some arbitrarily large number. Thus, the probability that the infectious period is zero may be set arbitrarily close to 1.
The most relevant previous work [4] compares two susceptible-infected-recovered (SIR) network-based epidemic models, where the infectious period is random in one and nonrandom in the other, and where the “transmission probability” that an individual, given that it gets infected, will contact a given neighbor before recovering is the same in both models. It was shown that, under stronger assumptions than here, the long-term probabilities are not less in the model with the nonrandom infectious period. To relate more directly to this result, we define (following [17] and [18]) the “transmissibility” to be the posterior probability that an infected individual, with a given infectious period, will make a contact to a given neighbor before recovering. Thus, the transmissibility is a random variable since it is a function of the infectious period, and its expected value is the transmission probability. We show that by changing the infectious period distribution such that the transmissibility is decreased in convex order, which we shall argue keeps constant, we can only increase . We discuss some important corollaries of this and then present an example and a numerical illustration (Fig. 2).
Finally, we show how our results carry over to well-known message passing, pairwise, and Kermack-McKendrick models.
II. THE STOCHASTIC MODEL
The SEIR epidemic model under consideration is defined as follows. Let be an arbitrary simple undirected graph, where is a finite or countably infinite set of vertices (individuals) and is a set of undirected edges between the vertices. For , let be the set of neighbor of and let for all (the graph is thus described as “locally finite”). We assume that two individuals are neighbor if and only if at least one can make direct contacts to the other. Let denote the time period that spends in the exposed state; is 's infectious period, i.e., the time period that spends in the infectious state; is the time elapsing between first entering the infectious state and it making a sufficient (for transmission) contact to (note that the sufficient contact is not infectious, i.e., cannot cause infection, if it occurs after 's infectious period has terminated); is some variable on which all of the sufficient contact times may depend, e.g., a quantifier of 's infectiousness arising from sources other than the length of its infectious period (similar to in [17]); is some variable on which all of the sufficient contact times may depend, e.g., a quantifier of 's susceptibility (similar to in [17]). For , makes an infectious contact to at time if and only if (i) enters the infectious state at some time , (ii) , and (iii) . Susceptible individuals enter the exposed state as soon as they receive an infectious contact, exposed individuals immediately enter the infectious state when their exposed period terminates, and infectious individuals immediately enter the recovered state when their infectious period terminates. Individuals may be in any state at except the exposed state, and we may interpret being in the recovered state at as being vaccinated.
Letting , the situation which we wish to consider is where and the initial conditions are random. We will assume that, excluding the variables, is mutually independent; for all and is conditionally independent from given and ; and the initial state of the population is independent from .
In line with the discussion in Sec. I, we define to be the probability that at least one member of is initially infectious, or is initially susceptible and receives an infectious contact before or at time . Thus, we say that is the probability that the disease spreads to by time .
For ease of reference, the definitions of all of the above variables, and other important definitions, are collected and presented as a list at the start of the Appendix.
III. THE IMPACT OF THE INFECTIOUS PERIOD DISTRIBUTION
To understand the impact of the infectious period on the likelihood, speed and extent of epidemic spread, we will first focus on a single individual and label a subset of its neighbor using a bijection to . Assume that gets infected and consider its behavior after it leaves the exposed state and immediately enters the infectious state, and also assume that all of the variables except 's infectious period and the sufficient contact times have already been drawn from their joint distribution. Let denote the event that does not make an infectious contact to neighbor 1 within time period (since entering the infectious state), neighbor 2 within time period , , and neighbor within time period , where the are arbitrary non-negative numbers. We may now write
(1) |
where
and
(2) |
We use to indicate that we are conditioning on the values already drawn for the infectiousness and susceptibility variables and , since the sufficient contact times may depend on these. The form of (2) may be understood by observing that if the infectious period takes the value and then any sufficient contact made from to within time period is an infectious contact. Thus, for no infectious contact to within time period we need to be greater than . On the other hand, if then the only sufficient contacts made within time period which are infectious are those made in the smaller time period . Here then, for no infectious contact to within time period we only need to be greater than .
Let us now consider the conditions under which is convex since this will be necessary for a precise statement of our results. It is convex if is convex for all , since the are non-negative and nonincreasing. Further, is convex if the survival function for , after conditioning on any possible values for and , is convex; note that a nonincreasing probability density function (PDF) is sufficient for a convex survival function. If contact processes are independent Poissonian processes, which is a common assumption, then the are exponential and thus have convex survival functions. If the are independent and gamma distributed with shape parameters less than or equal to 1 then their survival functions will be convex. We also note that the survival function for the heavy-tailed Lomax distribution , where , is convex on . Moreover, since , where , is convex on then sufficient contact times which have other heavy-tailed distributions may have convex survival functions. This is of relevance since it has been shown how processes which depend on human decision making may develop interevent times which have heavy-tailed distributions, and data for some such processes do indeed indicate heavy tails [19]. Alternatively, if is the residual waiting time of a renewal process which governs the times at which makes sufficient contacts to then it follows that the PDF for is nonincreasing. See Sec. 2.2 in Ref. [20].
Two important examples where is certainly convex are as follows. In both cases, the infectiousness and susceptibility variables take values in (0,1] and, for all , we have individual , while infectious, making contacts to according to an independent Poissonian process of rate (a time-inhomogeneous Poissonian process could be used instead but the rate would need to be nonincreasing). In the first case any given contact from to is sufficient with probability while in the second case only the first contact may be sufficient and it is so with probability . Such scenarios have previously been considered and proposed for modeling the spread of HIV [21–23].
Having discussed that the convexity of is realistic, and follows from many common assumptions, we will assume this in what follows and use it to prove results concerning the effect of the infectious period distribution on the ability of the disease to spread.
Recall that for two real-valued random variables, and , if for all convex functions then we say that is greater than in convex order and write . An important result for the convex order is that
(3) |
Another useful result is that if , and and cross exactly once [where these are the cumulative distribution functions (CDFs) for and , and the sign sequence of is , then this implies that [16]. We will refer to this as the graphical sufficient condition for the order.
Thus, since is convex and nonincreasing, then decreasing 's infectious period in convex order, or increasing in the usual stochastic order, can only decrease because the expectation in (1) can only decrease. (If and are two real-valued random variables then is less than in the usual stochastic order, and we write , if and only if for all nonincreasing functions .) This means that, since the are arbitrary non-negative numbers and is an arbitrary subset of 's neighbor, the transmission probability that will make an infectious contact to , given that gets infected, can only increase. By assuming that is nondecreasing with respect to these transmission probabilities, it follows that can only increase.
More importantly, for all subsets of individuals and all , the probability , that the disease will spread to by time , can only increase. To understand this, note that if we have already drawn all of the variables except 's infectious period and the sufficient contact times , then either it is already known whether or not the disease reaches subset by time , or there exists some choice of and the such that this occurs if and only if does not occur; and, as we have shown, the probability of can only decrease. As a simple example of this, consider the case where the population consists of and two other individuals, and , connected in a line, i.e., is a neighbor of both and , but and are not neighbor. Let consist of the single individual and let be the only initially infected individual with the others being initially susceptible. Here, if then it is already known that the disease does not reach by time no matter what values are drawn for and . However, if then the disease does not reach by time if and only if occurs where and , i.e., if and only if does not make any infectious contacts to within time period .
Since is an arbitrary member of and all of the infectious period distributions are arbitrary, we can repeatedly apply this argument to conclude that can only increase if any subset of the infectious periods are decreased in convex order or increased in the usual stochastic order (see Theorem 1 in the Appendix).
Let us now consider what this suggests more generally about the importance of the shape of the infectious period distributions. First, for given means, the infectious period distributions which maximize are degenerate, i.e., the infectious periods are nonrandom. This follows from the graphical sufficient condition for the convex order which shows that any other infectious periods with the same means are necessarily greater in convex order. Secondly, for given means and given maximum values, i.e., bounded infectious periods, the infectious periods which minimize are such that they are equal either to zero or to their maximum values (their variance is maximal). Again, this follows similarly from the graphical sufficient condition for the convex order. Thus, the tendency of decreasing the variances of the infectious periods to increase the probability that the disease will spread to a given part of the network by a given time is made clear. This tendency is also highlighted by (3).
Gamma and Weibull distributions are potentially realistic for the infectious periods; they allow concentration about their mean values unlike the exponential distribution. For two gamma distributions with the same mean, we can use the graphical sufficient condition to conclude that the one with greater variance is necessarily greater in convex order; the same applies for two Weibull distributions with the same mean. So if we restrict our distributions to one of these two families, and keep the means fixed, then decreasing the variances of the infectious periods can only increase . An illustration of the extent of this increase, for the case of the gamma distribution, is shown in Fig. 1 where we have computed the expected number susceptible at time as . The effect is remarkable when one considers that the mean is fixed and we have just interpolated between the exponential distribution and the degenerate distribution, both of which are commonly assumed for the infectious period. It also reveals the large amount of error that could be introduced, at all points in time, when approximating the epidemic as a Markov process and using the reciprocal of the estimated average infectious period as the recovery rate in the model.
IV. THE IMPACT OF THE INFECTIOUS PERIOD DISTRIBUTION WHEN TRANSMISSION PROBABILITY IS FIXED
We have shown how the transmission probabilities are decreasing with respect to the variability (in the sense of the convex order) of the infectious period, and we assume that is a function of these transmission probabilities. Since it may be sensible to choose an infectious period distribution for our model such that the estimated value of for the disease is replicated, as opposed to the estimated mean of the infectious period, then it is pertinent to consider the sensitivity of to the infectious period distribution when the transmission probabilities and are fixed [recall that is the probability that the disease will spread to by time .
Let us now assume that the sufficient contact times are mutually independent, so we discard the infectiousness and susceptibility variables , and assume that, for each , the sufficient contact times are independent and identically distributed (i.i.d.) (let denote the random variable with this distribution). However, some gains here are that we do not make any other assumptions about the distributions of the variables and we allow infectious periods to be infinite with positive probability since we do not specify a finite mean. For , let denote and let denote the random “transmissibility” variable (recall that is 's infectious period). It is the transmission probability that we will keep constant.
Again, we will first focus on a single individual and label a subset of its neighbor using a bijection to . Assume that gets infected and consider its behavior after it leaves the exposed state and immediately enters the infectious state, and also assume that all of the variables except 's infectious period and the sufficient contact times have already been drawn from their joint distribution. As previously, we let denote the event that does not make an infectious contact to neighbor 1 within time period (since entering the infectious state), neighbor 2 within time period , , and neighbor within time period , where the are arbitrary non-negative numbers. We may now write
(4) |
where
and
(5) |
The form of (5) may be understood by observing that is less than or equal to if the infectious period takes the value and . In this case, any sufficient contact made from to within time period is an infectious contact. Thus, for no infectious contact to within time period we need to be greater than . On the other hand, if then the only sufficient contacts made within time period which are infectious are those made in the smaller time period . Here then, for no infectious contact to within time period we only need to be greater than and this occurs with probability .
Note that since is convex, non-negative, and nonincreasing for all , then is convex and nonincreasing on [0,1]. Therefore, decreasing in convex order, or increasing in the usual stochastic order, can only cause the expectation in (4) to decrease.
Thus, altering any subset of the infectious periods such that the corresponding transmissibility variables are decreased in convex order, or increased in the usual stochastic order, can only cause to decrease and to increase by the same arguments as in Sec. III (see Theorem 1 in the Appendix). Using the graphical sufficient condition for the convex order, and keeping constant by keeping the expected values of the constant, we have that is maximized when the are nonrandom. This is the case when the infectious periods are nonrandom. So, whether the infectious periods are altered such that the means are held constant, or such that is held constant (with the slightly different sets of assumptions), is maximized when the infectious periods are nonrandom. On the other hand, is minimized when the can only be equal to either zero or one. This is the case when the infectious periods can only be zero or infinite. Thus, as with the infectious periods themselves, there is a clear tendency for decreasing the variances of the transmissibility variables to increase .
If the sufficient contact times have cumulative distribution functions which are strictly increasing on then the CDF for is given by for all . In this case, if 's infectious period is altered such that its new CDF crosses its original CDF exactly once and from below, then the new CDF for crosses the original CDF for exactly once and from below. We may interpret this alteration as a reduction in the variability of 's infectious period since the CDF becomes less “spread out.” Thus, assuming the transmission probability is held constant, then decreases in convex order (by the graphical sufficient condition) and increases. Therefore, when transmission probabilities and are held constant, as opposed to the means of the infectious periods, we see that lesser variability in the infectious period can still lead to greater epidemic severity.
In Fig. 2, we demonstrate the extent to which the infectious period distribution can affect , when is held constant, by computing the expected number susceptible at time as . The infectious period distribution is here clearly less important than when the means of the infectious periods are held fixed.
V. THE IMPACT OF THE INFECTIOUS PERIOD IN MESSAGE PASSING AND PAIRWISE MODELS
There exist message passing and pairwise systems of equations which, in some cases, may be solved in order to exactly capture the probability distribution for the state of any given individual at any given time in the stochastic model [10,13,15]. If this is the case then the effect of the infectious period distribution on , for all , is also exactly captured.
More generally, epidemic models such as those formed from message passing equations, or moment closure methods, approximate the probability distribution for the state of any given individual at any given time. Here we show that the same conclusions about the impact of the infectious period also apply to these approximate models. To be able to relate to previous work we discard the exposed periods and the susceptibility variables . The message passing system for our stochastic model is defined, for and :
(6) |
(7) |
(8) |
where, for ,
(9) |
Here, , and , approximate the probability that at time individual is susceptible, infectious, and recovered/vaccinated, respectively; approximates the probability that at time individual (in the cavity state [10]) has not received an infectious contact from individual ; and are the probabilities that individual is initially recovered/vaccinated and initially susceptible, respectively; and are the PDFs for and respectively; is the survival function for . This system has a unique feasible solution if , by Theorem 1 in [15], and gives exactly the same output as a pairwise model which has well-known special cases, by [13] and Theorem 5 in [15]. Thus our conclusions about the effect of the infectious period in the above message passing system (6)–(9) also apply to pairwise models.
It can be shown that (see Theorem 2 in the Appendix), similarly to the stochastic model, if any subset of the infectious periods are increased in the usual stochastic order then can only decrease for all and all ; if the infectious periods of any subset are decreased in convex order, and is nonincreasing for all , then can only decrease for all and all . Now assume that for each the are i.i.d. (let denote the random variable with this distribution) and define the transmissibility variable . In this case, if the distributions for any subset of the infectious periods are altered such that the corresponding are increased in the usual stochastic order, or decreased in convex order, then can only decrease for all and all (see Theorem 2). Using the graphical sufficient condition for the convex order, this means that when the transmission probabilities ( for all ) and are fixed, is minimized when infectious periods are nonrandom and maximized when infectious periods may be only zero or infinite. Note that in the former case the CDFs for the infectious periods are Heaviside step functions while in the latter case they are constant on .
We can now build on these results in order to write down systems of equations which are simpler to solve and which provide rigorous lower and upper bounds, and approximations, for and for all and all . Importantly, if the variables are mutually independent or positively correlated and the states of individuals at are mutually independent, then a lower bound on is also a lower bound on the probability that is susceptible at time , and an upper bound on is also an upper bound on the probability that is recovered/vaccinated at time . This follows since is a lower bound on the former probability while is an upper bound on the latter [10,13,15]. Such bounds provide a “worst case scenario” [10] and an upper bound on the expected final size of the epidemic.
To obtain a lower bound and approximation for we may replace in (9) by where is the Heaviside step function and is defined to satisfy
(10) |
Similarly, to obtain an upper bound and approximation for we may replace in (9) by a constant which is defined to satisfy
(11) |
These results are presented as part (d) of Theorem 2 in the Appendix. Note that, in both cases, making these changes to (9) does not alter the probability, as it is represented in the message passing system, that a given infected individual will make an infectious contact to a given neighbor before recovering [this is the quantity in (10) and (11)]. It is then straightforward, following Sec. IV of [10] and the proof of Theorem 4 in [15], that is also unaltered for all . On these grounds, we expect the bounds and approximations to be good. Additionally, replacing in (8) by its lower and upper bound (for all ) produces an upper and lower bound, and approximations, respectively, for . This follows because the integrand in (8) is decreasing with respect to .
As an example, if contact processes are Poissonian such that the are exponentially distributed with parameters , we can then conveniently obtain the lower bounds via delay differential equations (DDEs)
and the upper bounds via ordinary differential equations (ODEs)
where we use “dot” notation to indicate derivatives with respect to time. The lower and upper bounds, and approximations, for are then given by and , respectively.
VI. THE IMPACT OF THE INFECTIOUS PERIOD IN THE KERMACK-MCKENDRICK MODEL
Similar results also apply to the classic SIR model proposed by Kermack and McKendrick [11]. The model is defined as follows:
(12) |
(13) |
(14) |
where the variables on the left-hand-side represent the fraction susceptible, infected, and recovered, respectively, at time ; is the rate at which an individual, that has been infected for time period , makes sufficient contacts to others; and is the random infectious period with density function and survival function . Let be the accumulated infectivity [1], such that is the expected number of infectious contacts that an infected individual will make before recovering. Thus plays a similar role to the transmissibility random variable. Equation (12) can be derived from Eq. (13) in [11] by dividing the latter through by the total population size, and after appropriately renaming the variables and functions.
Let be given by (12) but with the infectious period replaced by . Let be given by (12) but with replaced by . Let be continuously differentiable and assume at least one of the following conditions.
(i) and the infectious period cannot be infinite.
(ii) and is nonincreasing on and the infectious period cannot be infinite.
(iii) (defined using and , respectively).
(iv) .
Then for all , we have . See Theorem 3 in the Appendix. Note that if individuals are assumed to make contacts according to a homogeneous Poissonian process then is constant and therefore nonincreasing and continuously differentiable. It is also worth noting that by replacing the infectious period in the Kermack-McKendrick model by one which is nonrandom, keeping or the expected infectious period constant, a lower bound on is achieved for all (using the graphical sufficient condition for the convex order); replacing in (14) by produces an upper bound on [since the integrand in (14) is decreasing with respect to . For example, if then we may obtain by solving
with for all and . In this case and so the expected infectious period and are simultaneously kept constant. It is then straightforward that since this quantity is determined by and the initial conditions [11]. On these grounds we expect the bound to be good.
VII. CONCLUSION
For an extremely general epidemic model, we have proved a monotonic relationship between the variability of the infec- tious period and the severity of an epidemic. Specifically, the probability that an arbitrary individual will get infected by time is decreasing with respect to the variability of the infectious period with fixed mean (using the convex order as a variability order). Similarly, and more intuitively, is increasing with respect to the magnitude of the infectious period (using the usual stochastic order as a magnitude order). Since the expected number to get infected by time is obtained by summing over all individuals, this quantity is also decreasing with respect to the variability of the infectious period and increasing with respect to the magnitude of the infectious period.
Using a graphical sufficient condition for the convex order, we have shown that for an infectious period with fixed mean, is maximized if the infectious period distribution is degenerate (nonrandom). For an infectious period with fixed mean and fixed maximum value, is minimized when the infectious period can only take its maximum value or zero. These results also apply to the expected number to get infected by time .
We have also shown that when (the basic reproductive ratio) is fixed, is decreasing with respect to the variability of the posterior transmission probability, which is a function of the infectious period. It follows that when is fixed, is maximized if the infectious period is nonrandom and minimized if it can only be either infinite or zero. These results also apply to the expected number to get infected by time .
Our main results were found to carry over, in an obvious sense, to message passing and pairwise models. For the message passing model, we also showed that by changing the cumulative distribution functions of the infectious periods to more tractable Heaviside step functions or constants, while keeping fixed, lower and upper bounds, respectively, may be obtained for the expected number susceptible at time in the message passing model. We showed that, if contact processes are Poissonian, the lower and upper bounds may be obtained via DDEs and ODEs respectively.
For the classic SIR model of Kermack and McKendrick [11] we were able to show that the fraction susceptible at time is increasing with respect to the variability of the infectious period with fixed mean, assuming that the rate at which an infected individual makes contacts to others is nonincreasing with time. Additionally, by making the infectious period nonrandom (which changes its CDF to a Heaviside step function), keeping its mean or constant, a lower bound on the fraction susceptible is obtained for all time points (an upper bound on the model's epidemic final size is thus also obtained). We showed that, if contact processes are Poissonian, the lower bound may be obtained via a system of one ODE and one DDE.
Our numerical results illustrate that, even under common parametrizations, the severity of the stochastic epidemic is highly sensitive to the infectious period distribution when its mean is fixed, but less so when is fixed. This suggests that we should base our choice for the infectious period distribution more on the estimated value of than on the estimated average infectious period—at least when computing the timecourse of the expected number susceptible (equivalently, the timecourse of the expected total number of cases). For a given epidemic model, this also suggests the strategy of computing the transmission probability, or , first and then using this to inform a new choice for the infectious period distribution which will ease numerical solution or mathematical analysis. However, is much more difficult to measure empirically than the average infectious period.
This paper adds to recent research which has sought to articulate the impact of non-Markovian dynamics in epidemic models [1,24–27]. Notably, our results do not depend on the assumption of exponential contact times, the validity of which has recently been questioned since heavy-tailed distributions have been inferred from observation [19,24,28].
It is unclear whether similar results can be found in compartmental structures, such as susceptible-infected-susceptible (SIS) dynamics, where individuals may be infected multiple times. Indeed, it has recently been shown [29] that for a particular stochastic SIS model, in which contact processes are Poissonian, the expected total time that the system spends in any given state only depends on the infectious period distribution through its mean.
ACKNOWLEDGMENTS
This research was funded by the Leverhulme Trust, Grant No. RPG-2014-341. We thank the reviewers for their constructive comments which have helped us to improve the presentation of our results.
APPENDIX
Note the following definitions.
. A simple undirected graph where is a countable set of vertices and is a set of undirected edges between the vertices. This graph is to be interpreted as the contact network on which the disease spreads with the vertices representing individuals and edges representing possible transmission routes.
. The set of neighbor of . Specifically, .
. The random infectious period of individual .
. The random time between entering the infectious state and its subsequent sufficient contact to .
. The transmissibility random variable for . It is only defined for the case where the are independent and identically distributed. In this case we have , where for all .
. The random exposed period of individual .
. A random variable which is a measure of the susceptibility of , in the sense that the may depend on it.
. A random variable which is a measure of the infec- tiousness of , in the sense that the may depend on it.
. Let and be two real-valued random variables. is greater than in convex order, and we write or (where these are the cumulative distribution functions), if and only if for all convex functions .
. Let and be two real-valued random variables. is greater than in decreasing convex order, and we write or (where these are the cumulative distribution functions), if and only if for all nonincreasing convex functions (see Sec. 4.A.1 in [16]). (Note that implies .)
. If and are two real-valued random variables then is less than in the usual stochastic order, and we write or (where these are the cumulative distribution functions), if and only if for all nonincreasing functions .
. If and are two real-valued random variables then , and we say that and are equal in distribution, if and only if their cumulative distribution functions are equal over the whole domain of real numbers.
. The probability that the infection spreads to by time . Specifically, the probably that at least one member of is initially infectious, or is initially susceptible and receives an infectious contact before or at time .
Note that every edge of may be specified as an unordered pair of vertices. Let us now replace each edge by two oppositely directed arcs and note that every arc may be specified as an ordered pair of vertices. Let a finite sequence of arcs be a finite simple path between vertices and if and only if is an arc between and for all where for all , and for all where and . All the paths with which we are concerned are finite simple paths and we refer to them as paths for brevity.
Now, following Donnelly [9], let us define the variables
and associate arc with for all . We define the total weighting of path to be the sum of the (infectious contact times) associated with its arcs plus the sum of the (exposed periods), excepting the first and last, associated with its individuals. We can write
Letting be the set of paths which have an initially infectious individual at the start and some member of at the end, and where every individual except the first is initially susceptible, it then follows from the above definitions that is the time at which the first infectious contact to an initially susceptible member of occurs. We assume when is empty. We can now write
(A1) |
where the measure is the distribution of the initial state of the population and (, susceptible; , infected; , recovered/vaccinated); is the subset of which has no members of (initially) infected. Note that the probability in the integrand does not need to be conditioned on the initial state of the population because we are assuming that the weightings of paths are independent from the initial state.
Theorem 1. Let epidemic 1 and epidemic 2 be two parametrizations of the stochastic model. Assume that they are the same except for the distributions of the infectious periods of individuals in . Assume that for all at least one of the following conditions holds, where is the cumulative distribution function of random variable in epidemic .
(a) , and cannot be infinite.
(b) and is convex in for all possible and all , and cannot be infinite.
(c) and the are i.i.d.
(d) and the are i.i.d.
Then the probability that the infection spreads to by time is greater, or the same, in epidemic 2 than in epidemic 1 for all and all .
Proof 1. We prove the theorem by showing that the probability in the integrand of (A1) is greater, or the same, in epidemic 1 than in epidemic 2 for all initial states (the distribution of the initial state is the same for epidemics 1 and 2). Let be an arbitrary set of paths and assume, for now, that is finite and that where , and let us label all of 's neighbor via an arbitrary bijection to where . For any arc , where , let where the infimum is over all paths which contain . If there are no paths in which contain then we let . Let where the infimum is over all paths which do not contain an arc where . If all paths in contain an arc , where , then we let . Thus, we may write
(A2) |
By assumption, the two sets of random variables and are conditionally independent of each other given , where . We can now express (A2) as
(A3) |
where and is the range of . The measure is the joint distribution of , where correspond to respectively, corresponds to , and corresponds to . We can now reexpress (A3) as
(A4) |
where, for ,
is convex if (b) holds and is nonincreasing in any case. Thus if condition (a) or (b) holds then (A4) [equal to (A2)] is greater in epidemic 1, or the same, than in epidemic 2 by the definitions of the decreasing convex and usual stochastic orders.
If condition (c) or (d) holds, we can reexpress (A3) as
(A5) |
where, for ,
is convex and nonincreasing. Thus if condition (c) or (d) holds then (A5) [equal to (A2)] is greater in epidemic 1, or the same, than in epidemic 2 by the definitions of the decreasing convex and usual stochastic orders. Thus, the theorem is true for this special case.
In the case where is infinite we may use the continuity of probability measures to write
where is the finite set consisting of the first paths in ( is countable since is countable by assumption and the set of finite subsets of a countable set is countable). Thus when is infinite the theorem still holds for this special case.
Since is finite or countably infinite and the infectious period distributions are arbitrary we may repeatedly apply the theorem in the special case where , which we have already proved, to prove the theorem in general.
Theorem 2. Consider the message passing epidemic model (6)–(9) and assume that .
(a) If the infectious periods of are increased in the usual stochastic order then is decreased or remains the same, for all and all .
(b) If the infectious periods of are decreased in convex order, and is nonincreasing for all , then is decreased or remains the same, for all and all .
(c) Assume that for each the are i.i.d. (let denote the random sufficient contact time with this distribution) and define the transmissibility variable . If the infectious periods of are altered such that the are increased in the usual stochastic order, or decreased in convex order, then is decreased or remains the same, for all and all .
(d) For all , let be given by the equation for [Eq. (9)] when it is modified such that is replaced by , where is the Heaviside step function and satisfies (10). Then is a lower bound on for all . Additionally, for all , let be given by the equation for [Eq. (9)] when it is modified such that is replaced by a constant which satisfies (11). Then is an upper bound on for all .
Proof 2. The message passing model (6)–(9) has a unique feasible solution if , by Theorem 1 in [15], and we assume this to be the case. Let . With reference to (9), for , , , let
(A6) |
and let for all . Here, and are independent non-negative random variables, for all , and are defined such that the PDF for satisfies
and the CDF for satisfies
The second equality in (A6) then follows from how the CDF for the sum of two non-negative independent random variables is formed from their two respective distributions. This is discussed further below [see (A7)].
It is the case that for all , and so , and recall that . Thus, using (A6), we may prove parts (a), (b), and (c) of the theorem by showing that their conditions lead to and increasing for all , where these are the CDFs for the and the . This is because the CDF for the sum of two independent non-negative random variables, and , is given by
(A7) |
where . Thus is increased if is increased on and, by induction, if is simultaneously increased on .
If the CDFs for the are increased on , and if the CDFs for the are increased on , then is decreased for all , whence the CDFs for the are increased on . Now note that if the CDFs for the are increased on then is decreased for all , whence the CDFs for the are increased on .
We may now prove parts (a), (b), and (c) of the theorem by showing that the conditions in each part lead to the CDFs for the increasing on , since then, by induction, the CDFs for the are also increased on . We do this by showing that the survival functions for the are decreased on for all (if then is unaltered).
First, for , we may write
(A8) |
where, for ,
is convex if the conditions in (b) are met and is nonincreasing in any case. Thus, the conditions in (a) and (b) lead to the expectation in (A8) decreasing and hence to the survival functions for the decreasing on .
Similarly, if the conditions for (c) are met then for , we may write
(A9) |
where, for ,
is convex and nonincreasing. Thus, the conditions in (c) lead to the expectation in (A9) decreasing and hence to the survival functions for the decreasing on .
To prove part (d) of the theorem, let us replace in (A6) and in (9) by , and redefine the PDF for to be equal to on . Let us define such that (A9) still holds if is replaced by . Initially, assume that the distribution of is the same as for so that the model gives exactly the same output. Now, altering the distributions of the such that the are decreased in convex order then is decreased or remains the same for all and all by the same argument as for part (c) of the theorem. Thus, using the graphical sufficient condition for the convex order, we may achieve a lower bound on for all , by setting to be nonrandom while holding constant for all , and an upper bound by setting such that it may be only zero or infinite while holding constant for all . In the former case, in (9) becomes where is the Heaviside step function and satisfies (10), while in the latter case it becomes a constant which is defined to satisfy (11).
Theorem 3. Let be given by the Kermack-McKendrick [11] model (12)–(14) but with (the infectious period) replaced by . Similarly, let be given by (12)–(14) but with replaced by . Define to be and to be , where is given in (12)–(14). Let
and assume at least one of the following conditions.
(i) and the infectious period cannot be infinite.
(ii) and is nonincreasing on and the infectious period cannot be infinite.
(iii) .
(iv) .
Then for all , we have .
Proof 3. Following Sec. 3 of [15], let us consider the special case of the stochastic model where the contact network/graph is an infinite -regular tree (also known as a Bethe lattice); for all (we remove the exposed state); for all (let denote the random infectious period with this distribution); the variables are independent from the variables and so may be discarded; for all (let denote the random sufficient contact time with this distribution); the states of individuals at are i.i.d. random variables.
Now consider a sequence of such stochastic models indexed by (where is also the regular degree of the contact network). We let the density function , for the sufficient contact time , depend on as follows:
where is taken from the Kermack-McKendrick model (12)–(14). Note that if is nonincreasing then the density function is nonincreasing and the survival function is convex, for all . Thus, if condition (i) or (ii) holds, then condition (a) or (b) holds for Theorem 1 and we have that is greater, or the same, if than if , for all .
Note that the transmissibility random variable must now depend on and, letting , we have (to check this, express both sides as a function of and note that both sides are equal when and both sides have the same derivative with respect to ). Therefore if or then or , where and . This follows since any decreasing function of can be expressed as a decreasing function of [since is an increasing function of , and any decreasing convex function of can be expressed as a convex function of [since is a concave function of . Thus, if condition (iii) or (iv) holds, then condition (c) or (d) holds for Theorem 1 and we have that is greater, or the same, if than if , for all .
The probability that individual is susceptible at time , which by symmetry is the same for all , is equal to minus the probability that the individual is initially recovered/vaccinated. Thus, if at least one of the conditions holds then, by Theorem 1, the probability that an arbitrary individual is susceptible at time is greater (or the same) if than if , for every model in the sequence, i.e., for all .
Now, using Theorem 6 in [15], which tells us that as the probability that an arbitrary individual is susceptible at time converges to given by the Kermack-McKendrick model (12)–(14) [since here the message passing system is exact [15] and , satisfying condition (i) of that theorem], we have for all .
REFERENCES
- [1].T. Britton and D. Lindenstrand, Epidemic modeling: Aspects where stochasticity matters, Math. Biosci. 222, 109 (2009). 10.1016/j.mbs.2009.10.001 [DOI] [PubMed] [Google Scholar]
- [2].C. Lefevre and P. Picard, An unusual stochastic order relation with some applications in sampling and epidemic theory, Adv. Appl. Prob. 25, 63 (1993). 10.2307/1427496 [DOI] [Google Scholar]
- [3].M. E. J. Newman, Networks: An Introduction (Oxford University, New York, 2010). [Google Scholar]
- [4].K. Kuulasmaa, The spatial general epidemic and locally dependent random graphs, J. Appl. Probab. 19, 745 (1982). 10.2307/3213827 [DOI] [Google Scholar]
- [5].A. L. Lloyd, Realistic distributions of infectious periods in epidemic models: Changing patterns of persistence and dynamics, Theor. Popul. Biol. 60, 59 (2001). 10.1006/tpbi.2001.1525 [DOI] [PubMed] [Google Scholar]
- [6].M. Eichner and K. Dietz, Transmission potential of smallpox: Estimates based on detailed data from an outbreak, Am. J. Epidemiol. 158, 110 (2003). 10.1093/aje/kwg103 [DOI] [PubMed] [Google Scholar]
- [7].N. Bailey, On estimating the latent and infectious periods of measles: I. Families with two susceptible only, Biometrika 43, 15 (1956). 10.2307/2333574 [DOI] [Google Scholar]
- [8].G. Chowell and H. Nishiura, Transmission dynamics and control of ebola virus disease (EVD): A review, BMC Medicine 12, 196 (2014). 10.1186/s12916-014-0196-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].P. Donnelly, The correlation structure of epidemic models, Math. Biosci. 117, 49 (1993). 10.1016/0025-5564(93)90017-5 [DOI] [PubMed] [Google Scholar]
- [10].B. Karrer and M. E. J. Newman, A message passing approach for general epidemic models, Phys. Rev. E 82, 016101 (2010). 10.1103/PhysRevE.82.016101 [DOI] [PubMed] [Google Scholar]
- [11].W. O. Kermack and A. G. McKendrick, A contribution to the mathematical theory of epidemics, Proc. R. Soc. A 115, 700 (1927). 10.1098/rspa.1927.0118 [DOI] [PubMed] [Google Scholar]
- [12].M. J. Keeling, The effects of local spatial structure on epidemiological invasions, Proc. R. Soc. London B 266, 859 (1999). 10.1098/rspb.1999.0716 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].R. R. Wilkinson and K. J. Sharkey, Message passing and moment closure for susceptible-infected-recovered epidemics on finite networks, Phys. Rev. E 89, 022808 (2014). 10.1103/PhysRevE.89.022808 [DOI] [PubMed] [Google Scholar]
- [14].D. Mollison, Spatial contact models for ecological and epidemic spread, J. Royal Stat. Soc. B 39, 283 (1977). [Google Scholar]
- [15].R. R. Wilkinson, F. G. Ball, and K. J. Sharkey, The relationships between message passing, pairwise, Kermack-McKendrick and stochastic SIR epidemic models, J. Math. Biol. 75, 1563 (2017). 10.1007/s00285-017-1123-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].M. Shaked and J. G. Shanthikumar, Stochastic Orders (Springer, New York, 2007). [Google Scholar]
- [17].J. C. Miller, Epidemic size and probability in populations with heterogeneous infectivity and susceptibility, Phys. Rev. E 76, 010101(R) (2007). 10.1103/PhysRevE.76.010101 [DOI] [PubMed] [Google Scholar]
- [18].F. M. Neri, F. J. Pérez-Reche, S. N. Taraskin, and C. A. Gilligan, Heterogeneity in susceptible-infected-removed (SIR) epidemics on lattices, J. R. Soc. Interface 8, 201 (2011). 10.1098/rsif.2010.0325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].A. L. Barabási, The origin of bursts and heavy tails in human dynamics, Nature (London) 435, 207 (2005). 10.1038/nature03459 [DOI] [PubMed] [Google Scholar]
- [20].M. Karsai, H.-H. Jo, and K. Kaski, Bursty Human Dynamics (Springer International Publishing, Berlin, 2018). [Google Scholar]
- [21].R. Meester and P. Trapman, Bounding basic characteristics of spatial epidemics with a new percolation model, Adv. Appl. Probab. 43, 335 (2011). 10.1239/aap/1308662482 [DOI] [Google Scholar]
- [22].H. Knolle, A discrete branching process model for the spread of HIV via steady sexual partnerships, J. Math. Biol. 48, 423 (2004). 10.1007/s00285-003-0241-7 [DOI] [PubMed] [Google Scholar]
- [23].C. H. Watts and R. M. May, The influence of concurrent partnerships on the dynamics of HIV/AIDS, Math. Biosci. 108, 89 (1992). 10.1016/0025-5564(92)90006-I [DOI] [PubMed] [Google Scholar]
- [24].P. Van Mieghem and R. van de Bovenkamp, Non-Markovian Infection Spread Dramatically Alters the Susceptible-Infected-Susceptible Epidemic Threshold in Networks, Phys. Rev. Lett. 110, 108701 (2013). 10.1103/PhysRevLett.110.108701 [DOI] [PubMed] [Google Scholar]
- [25].M. Starnini, J. P. Gleeson, and M. Boguñá, Equivalence Between Non-Markovian and Markovian Dynamics in Epidemic Spreading Processes, Phys. Rev. Lett. 118, 128301 (2017). 10.1103/PhysRevLett.118.128301 [DOI] [PubMed] [Google Scholar]
- [26].E. Cator, R. van de Bovenkamp, and P. Van Mieghem, Susceptible-infected-susceptible epidemics on networks with general infection and cure times, Phys. Rev. E 87, 062816 (2013). 10.1103/PhysRevE.87.062816 [DOI] [PubMed] [Google Scholar]
- [27].N. Sherborne, K. B. Blyuss, and I. Z. Kiss, Dynamics of multi-stage infections on networks, Bull. Math. Biol. 77, 1909 (2015). 10.1007/s11538-015-0109-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [28].M. Karsai, K. Kaski, A. L. Barabási, and J. Ketész, Universal features of correlated bursty behavior, Scientific Reports 2, 397 (2012). 10.1038/srep00397 [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].F. Ball, T. Britton, and P. Neal, On expected durations of birth-death processes, with applications to branching processes and SIS epidemics, J. Appl. Probab. 53, 203 (2016). 10.1017/jpr.2015.19 [DOI] [Google Scholar]