SUMMARY
This paper considers novel Bayesian non-parametric methods for stochastic epidemic models. Many standard modeling and data analysis methods use underlying assumptions (e.g. concerning the rate at which new cases of disease will occur) which are rarely challenged or tested in practice. To relax these assumptions, we develop a Bayesian non-parametric approach using Gaussian Processes, specifically to estimate the infection process. The methods are illustrated with both simulated and real data sets, the former illustrating that the methods can recover the true infection process quite well in practice, and the latter illustrating that the methods can be successfully applied in different settings.
Keywords: Bayesian non-parametrics, Epidemic model, Gaussian process, SIR model
1. Introduction
This paper is concerned with developing methods of Bayesian non-parametric inference for stochastic epidemic models which are partially observed through time. Our specific focus is on models of the susceptible-infective-removed (SIR) type in which only removals are observed. The following paragraphs contain the background and motivation for this work.
Methods for fitting stochastic models of disease transmission to outbreak data have been the subject of considerable research activity during the past fifteen years or so. Popular approaches include Markov chain Monte Carlo (MCMC) methods (Gibson and Renshaw, 1998; O'Neill and Roberts, 1999; O'Neill and others, 2000), Approximate Bayesian Computation methods (McKinley and others, 2009) and Sequential Monte Carlo methods (Ionides and others, 2006), all of which assume an underlying parametric model whose parameters are to be inferred from the data to hand. A natural alternative is to consider non-parametric approaches, which do not assume a specific parametric model and therefore offer greater potential flexibility. Another motivating factor in the context of epidemic models is that it is often unclear how best to assess the goodness of fit of parametric epidemic models, and hence hard to quantify the extent to which the underlying model assumptions are in line with observed data.
Non-parametric methods have to date received relatively little attention in the epidemic modeling literature, and to our knowledge there have been no previous Bayesian approaches. Both Becker and Yip (1989) and Becker (1989) consider non-parametric estimation of the infection rate in the so-called general epidemic model (i.e. the SIR model with infectious periods distributed according to an exponential distribution) by allowing the infection rate to depend on time. Their approach uses estimating equations, derived from suitable martingales, in combination with a suitable smoothing kernel. The approach also requires infection times to be known, which is rarely the case in reality, although by assuming fixed-length infectious periods the infection times are immediately specified by the observed removal times. In contrast, Lau and Yip (2008) assume only removals are observed, and use a kernel estimator to estimate the unobserved process of infectives, assuming that the parameter of the exponential infectious period distribution is known. Finally, Chen and others (2008) consider a related problem in which kernel estimation is used to estimate the infection rate in a large-scale epidemic model in which the depletion of susceptibles is ignored.
In this paper, we consider an SIR model in a closed population, and for simplicity and comparison with other methods, we assume exponentially distributed infectious periods. However, our methods can also be applied to general infectious period distributions. Throughout, we assume that the available data consist only of removal times, so that the infection process itself is unobserved. This assumption can usually be interpreted to mean that we observe case detection times, and that, at these times, the individuals in question are no longer able to infect others, perhaps due to isolation or other interventions. We assume that the infection mechanism is time-dependent, specifically making one of two assumptions.
First, we suppose that the usual overall incidence rate of new infections, , is replaced by
, where
and
) are the numbers of susceptibles and infectives at time
, respectively. This approach is a natural starting point for non-parametric inference since it retains the usual natural mass-action assumption for new infections. In the second approach, we replace
by
, where
is the indicator function of the event
. A key motivation here is to relax the usual mass-action assumption, which at least informally enables us to investigate the validity of assuming a particular parametric form for the incidence rate.
Although the assumption that is constant over time is a common one in epidemic modeling, it is certainly not always realistic (Fang and others, 2004). Specifically,
could vary over time as a result of factors such as behavior change in response to the epidemic, the introduction of control or mitigation measures, greater public awareness of the epidemic, and so on. There have been numerous approaches to estimating time-dependent infection rates for epidemic models, including the assumption of a particular parametric form with few parameters (Becker, 1989; Lekone and Finkenstadt, 2006) or many parameters (Ionides and others, 2006; Cauchemez and Ferguson, 2008), inference in deterministic models assuming that the number of infectives is known (Pollicott and others, 2012) and for modified deterministic models by assuming that the infection rate is a function of a diffusion process (Dureau and others, 2013). All of these approaches are distinct from that which we consider.
We adopt a Bayesian framework, and base our non-parametric modeling on Gaussian Processes (GPs). Specifically, we place GP prior distributions on the incidence rate or infection rate as appropriate, and adapt a method for estimating the intensity function of an inhomogeneous Poisson process (Adams and others, 2009) to our setting. The methods involve MCMC algorithms.
The paper is organized as follows. In Section 2, we define the epidemic models of interest, and briefly recall some key facts about GPs. In Sections 3 and 4, we present estimation methods for the infection rate and overall incidence rate, respectively. Section 5 contains some numerical illustrations of the methods, and we conclude in Section 6 with some discussion.
2. Background
2.1. Multitype SIR epidemic model
We first recall a multitype stochastic epidemic model (see, e.g. Becker and Hopper, 1983). In this model, individuals are grouped according to their susceptibility to disease, but all individuals are equally infectious if they become infected.
Consider a population consisting of groups, labeled
, such that the
th group initially contains
susceptible individuals. At any time, each individual in the population can be in one of three states, namely susceptible, infective, or removed. An epidemic is initiated in this population at time
by one of the susceptibles becoming infective. For
and
denote by
and
, respectively, the numbers of susceptibles and infectives in the
th group at time
. Let
denote the total number of infectives in the population at time
. The epidemic can be defined as a bivariate Markov chain with the following transition probabilities, which correspond, respectively, to an infection in group
and a removal:
![]() |
with all other transitions having probability . It follows that (i) the overall rate of new infections, which we refer to as the incidence rate, is given by
, and (ii) individuals have infectious periods that are independently distributed according to an exponential distribution with mean
. The
parameters are called infection rates. If
there is only one group, the model reduces to the general stochastic epidemic model (see, e.g. Bailey, 1975), and in this case we write
for
.
2.2. GPs and inference for Poisson processes
We now briefly review relevant facts about GPs (see, e.g. Rasmussen and Williams, 2006), and a method of Bayesian non-parametric inference for time-inhomogeneous Poisson processes that we shall adapt to our setting. Recall that a GP is a stochastic process whose realizations consist of Gaussian random variables indexed by some set (in our case, the set of times in some interval). A GP is completely specified by its mean and covariance function. Throughout this paper, we will only use zero-mean GPs, so that we need only specify a covariance function. GPs are commonly used in Bayesian inference to act as prior distributions over spaces of functions which are themselves the object of inference. We will use GPs as prior distributions for infection rate and incidence rate functions.
A closely related problem to that we will consider is that of Bayesian non-parametric inference for a time-inhomogeneous Poisson process. Specifically, suppose we observe a set of points from a Poisson process with time-dependent intensity
during
. The likelihood is specified by
![]() |
where . If
is the object of inference, then in the non-parametric setting,
is an infinite-dimensional object (specifically, it is defined by the set of values
), which in turn makes the integral in the likelihood intractable in practice.
A method for overcoming this difficulty is described in Adams and others (2009). The key idea is that the original process can be viewed as a thinned homogeneous Poisson process of rate , where
for all
, in which each point is retained with probability
(see Lewis and Shedler, 1979). Inference then proceeds by augmenting the observed data with the unobserved thinned points,
, say, yielding an augmented likelihood
![]() |
Finally, since it cannot be assigned a GP prior distribution directly, but instead we use the transformation
, where
, and assign a GP prior distribution to
. This prior distribution is specified by assuming a particular form of covariance function with parameter vector
, and placing a prior distribution on
.
Let . Then from Bayes' Theorem the posterior density of interest is
![]() |
where is the density of a multivariate Gaussian random variable, and
and
, respectively, are the prior density functions of
and
. In practice these prior distributions are often fairly uninformative; note also that a Gamma prior distribution for
is (conditionally on the data and other parameters) conjugate. The posterior density itself can be explored using MCMC methods as described in Adams and others (2009), and since
is specified by
and
, we can hence obtain posterior samples for
.
3. Non-parametric inference for the infection rate
In this section, we modify the multitype SIR model by replacing the group- infection rate
with
, and then describe how to estimate the latter in a Bayesian non-parametric framework. Thus the transition probabilities governing the model become
![]() |
We assume that the data consist of the unique group membership of each individual in the population, the removal times in each group, and the initial numbers of susceptibles . For convenience, we assume time zero is the time of the first observed removal time and that all removals are observed during
. Our objective is to infer both the infection rate functions
and the removal rate
. We start with some notation.
For , let
and
denote, respectively, the numbers of observed removals and unobserved infections in group
. Let
denote the set of ordered removal times in group
, so that
, and let
denote the vector of ordered infection times in group
. Set
as the time of the initial infection and
as the group of the initial infective, so that if
, then
. Let
denote all infection times other than the initial infection time. Finally, set
,
,
and let
denote the initial number of susceptibles in the whole population.
The likelihood of the observed data given the model parameters is analytically and computationally intractable in all but the simplest cases, essentially because its calculation requires integrating over all possible unobserved infection events. We therefore adopt the data augmentation approach in Hayakawa and others (2003) in which the unobserved infection times become model parameters. This leads to the augmented likelihood
![]() |
We now place a prior distribution on by setting
, where
is an upper bound on
,
and
has a GP distribution with parameter vector
. As for the Poisson process example described above, this makes the augmented likelihood intractable, so we proceed by regarding the infection process as a thinned homogeneous Poisson process, as follows.
For group for which
, we introduce the additional variables (i) the number of thinned events,
; (ii) the locations of the thinned events,
; (iii) the
function values at the infection times,
and (iv) the
function values at the locations of thinned events,
. For groups
with
, we introduce the corresponding variables but also need to consider the first infection time,
, in group
. Specifically,
is added to
. The augmented likelihood thus becomes
![]() |
where is the concatenation of
and
and
. Note that the values of
and
do not change at the times of the thinned events. The posterior density of interest is then given by
![]() |
where the final four terms denote prior densities. Note that we assume independent prior distributions; this can of course be relaxed, but independence seems the most natural assumption. Samples from the posterior density can be obtained using MCMC methods, as described in the online supplementary material (available at Biostatistics online).
4. Non-parametric inference for the overall incidence rate
We now adapt the methods described above to the situation in which the mass-action assumption is relaxed. Specifically, we are interested in estimating the overall incidence rate in the absence of any particular parametric form for the infection process model. We restrict attention to the single-type epidemic, in keeping with the motivation to make as few assumptions as possible about the infection mechanism. However, the methods could be extended to multitype models.
The model of interest is a single-type SIR model in which the overall incidence rate is replaced by
, where
denotes the indicator function of the event
. Thus infections can only occur if there is at least one infective and one susceptible. The epidemic can be described according to the transition probabilities
![]() |
As before, we assume that we observe only the removal times and the initial number of susceptibles. Our objective is to infer both and
.
Denote the observed ordered removal times in as
, so
. Let
be the unobserved time of the first infection and let
denote the remaining unobserved ordered infection times during
, so
. We assume that the epidemic is known to have ceased by time
, i.e. the number of infection times and removal times are equal. In order that the epidemic does not die out before all the observed removals have occurred, we require that
for
. The augmented likelihood of the removal times and unobserved infection times is
![]() |
see, for example, O'Neill and Roberts (1999), where denotes the indicator function of the event that
for
, i.e.
if and only if there is at least one infective and one susceptible present when each infection event occurs.
As before, we place a GP prior on by setting
, where
is an upper bound on
,
, and
is a random function that has a GP distribution with parameter vector
. Suppose that the (unobserved) set of thinned infection events is
and define
as the vector of values of
at these events. Similarly define
as the vector of values of
at the infection times, and denote by
the concatenation of
and
. We then obtain the augmented likelihood
![]() |
and the posterior density of interest is
![]() |
where ,
,
and
denote prior densities. The posterior density can be explored using MCMC methods as described in the online supplementary material (available at Biostatistics online).
5. Examples
We now illustrate our methods via some examples. Simulated data are used to show that the methods work in practice. We then consider two classical data sets in small populations, and finish with a more elaborate example to show that the methods can also be extended to larger population settings.
Unless stated otherwise, we use the squared exponential covariance function for the GP prior (see, e.g. Rasmussen and Williams, 2006), i.e.
![]() |
with . We comment on the use of different priors later. The prior distribution for the hyperparameter
was set to be
, where
denotes an exponential random variable with mean
. The parameters
,
, and
were assigned independent
prior distributions, which, for the examples below corresponds to fairly uninformative prior beliefs. In general, the choice of prior distribution will be problem-specific, depending on the time scale in question. Finally, we defined a time axis by setting the first removal time equal to time zero and then set
a priori.
5.1. Simulated data
Full details of a simulation study can be found in the online supplementary material (available at Biostatistics online). We considered three scenarios, namely a single-type model with constant infection rate, a single-type model with infection rate exponentially decreasing through time, and a multitype model with three types of individual. In each case we simulated 50 data sets, and applied our methods to each resulting data set. In each scenario, we found that the true infection rate was well estimated on average across the simulations, while estimates based on single data sets were also reasonable, even for relatively small epidemic outbreaks.
5.2. Abakaliki smallpox data
We now consider a classic smallpox data set taken from Bailey (1975, p. 125). The data were originally reported in a World Health Organization report (Thompson and Foege, 1968) and the time series of 30 case detection times, assuming a homogeneously mixing population of 120 individuals, have since been analyzed by numerous authors (e.g. Bailey and Thomas, 1971; Becker, 1989; Rida, 1991; O'Neill and Roberts, 1999; Boys and Giles, 2007 and references therein), while Eichner and Dietz (2003) provide a far more comprehensive analysis that takes into account the mixing structure of the population and other important factors. In terms of non-parametric analyses, Becker and Yip (1989) assume known infection times and latent periods, and use a kernel smoothing method to estimate the infection rate as a function of time, concluding that the infection rate displays some oscillation over time but with a gradual downward trend during the outbreak. Finally, Lau and Yip (2008) use kernel smoothing to obtain a non-parametric estimate of the trajectory of infectives, but assume the infection rate itself is fixed.
In our analysis, we assume that only case detection times are available, corresponding to removal times. The infection times for each individual and the removal rate are all inferred via the MCMC algorithm. Figure 1 shows posterior summaries for the infection rate and incidence rate. The infection rate shows a slight initial increase followed by a slight gradual decline. The fact that we do not see the oscillations in the Becker–Yip analysis is most likely due to the fact that our estimation of the infection times provides a degree of smoothing. The incidence rate curve peaks at around , which is around the time that control measures were estimated to have been introduced during the outbreak (Eichner and Dietz, 2003). For comparison, if the standard SIR model is fitted in a Bayesian framework, the posterior mean (standard deviation) of the infection rate
is
(
) (Fearnhead and Meligkotsidou, 2004), which is evidently very similar to the values taken by
in our analysis.
Fig. 1.
Estimation of the infection rate (a) and incidence rate (b) for the Abakaliki smallpox data. Both plots show the posterior mean (thick line), 95% posterior credible intervals (thin line), and the days on which cases were detected in the data set (vertical dashes at top of plot). All curves are plotted over the mean posterior time during which infectives were present in the population.
5.3. Tristan da Cunha respiratory disease data
We also apply our methods to a data set analyzed in Becker and Hopper (1983) and Hayakawa and others (2003). The data set corresponds to removal times of individuals with a respiratory disease which occurred between October and November of 1967 on the island of Tristan da Cunha in the South Atlantic. The total population of the island of 255 was partitioned into three groups by age: infants, children, and adults. As there was one unidentified case, we suppose that . The initial number of susceptibles are
,
and
, and we assume that the initial infective is an adult since the first case was an adult and occurred 9 days before the first non-adult case. The number of cases in each group was 9 (infants), 6 (children), and 25 (adults).
Figure 2 shows the estimation results for the infection rate of each group. The infection rate among children falls slightly during the outbreak whilst that for adults gradually increases. These findings are not unreasonable given the pattern of removals in these groups; for instance, most of the adult cases occur in the second half of the outbreak. Nevertheless, it is also clear that assuming constant infection rates would not be unreasonable for these data.
Fig. 2.
Estimation of the infection rates for infants, children, and adults for the Tristan da Cunha respiratory disease data. All plots show the posterior mean (thick line), 95% posterior credible intervals (thin line), and the days on which cases were detected in the data set (vertical dashes at top of plot). All curves are plotted over the mean posterior time during which infectives were present in the population.
The analysis in Hayakawa and others (2003) assumes the same model as ours, but with infection rates constant through time. Posterior mean (standard deviation) infection rate estimates of 0.0045 (0.0018), 0.0018 (0.00082), and 0.0013 (0.00038) were obtained for infants, children, and adults, respectively, using MCMC methods. These are clearly comparable with our results.
5.4. London measles data
Our final example shows that the idea of using GPs as a prior distribution for an infection rate function can also be applied to other settings. We specifically consider an application to an inference method for large populations. The full details are rather extensive and can be found in Xu (2015); here we just report the key aspects.
Consider data aggregated into time intervals (such as weeks or fortnights) consisting of numbers of new infections occurring during each time interval. For an SIR model, inference for the infection rate over any such time interval could in theory be achieved by imputing the numbers and times of infection and removal events, but in practice this approach breaks down in large populations unless the time interval is small, because the parameter space becomes too large to explore efficiently. Cauchemez and Ferguson (2008) tackle this problem by (i) assuming that the number of susceptibles in a time interval is constant, which implies that the infectives process is now a linear birth–death process; (ii) using an analytically tractable diffusion process to approximate the birth–death process, yielding a tractable approximation to the true likelihood. The methods also allow incorporation of data on new susceptibles, such as the number of births in each time interval in the context of childhood diseases, and estimation of the case reporting rate.
Cauchemez and Ferguson (2008) applied this method to fortnightly pre-vaccination-era data on measles in London, in which the data consist of numbers of new cases and births in each bi-weekly time interval. The authors estimated the infection rate for each time interval during a year, , say, under the assumption that the infection rates are viewed as independent parameters with no structural dependence. In reality we might expect to see some relationship between infection rates, e.g.
might be close to
and
. Cauchemez and Ferguson (2008) also modeled year-by-year trends by using the same infection rate parameters every year, but assuming that the incidence rate in each bi-weekly period was inversely proportional to the size of the number of children under the age of 4, itself known from the available data. In our framework, an alternative way to allow annual variation is to include a separate infection rate parameter for each 2-week time period in the entire data set, but then impose a GP prior on these parameters with a suitable periodic covariance structure.
We introduced these aspects by implementing the Cauchemez–Ferguson (CF) method with a non-parametrically modeled infection rate with a GP prior distribution. Specifically, we assume that the infection rate is constant in each fortnightly time interval, and then regard the mid-point of each interval as an input for the GP. Two scenarios are adopted: first, that the infection rate function was the same year-by-year, and second that it was not. We also implemented the CF method, without the trend modeling, for comparison. Cauchemez and Ferguson (2008) also considered incidence rates of the form , which we do not consider here since our focus is toward assessing and illustrating our non-parametric approach, and extra model complexity reduces the transparency of these activities. We used 10 years of measles data, 1948–1957. Both our method and the CF method involve imputation of variables necessary to compute a likelihood, namely the number of unobserved cases in each time interval, the number of infectives at the start of each time interval, and the number of susceptibles at the start of the first time interval. Both methods also assume that the mean infectious period is 14 days. For our second non-parametric method, we used the periodic covariance function
![]() |
in which represents the length of one period, which we set equal to 364 days (i.e. 26 bi-weeks, each 14 days). Rather than fixing
we included it as a model parameter, and assigned
and
independent
prior distributions.
Figure 3 shows a comparison of the posterior mean of the infection rates from the CF method and the Bayesian non-parametric method, the latter using a squared exponential covariance function for the GP prior. Both methods produce relatively similar estimates. Figure 4 compares the CF method with the periodic covariance function, again showing posterior mean values. Here, the CF method is as before, so each year has identical estimates, as shown in Figure 3. It appears visually that there is relatively little variation year-by-year in the non-parametric estimate; in fact, there are differences, but they are very small, typically of the order
. We also attempted to use the CF method by individually estimating an infection rate for each of the 260 bi-week intervals, but the mixing of the MCMC algorithm was prohibitively poor. Our periodic covariance function method therefore appears to offer a viable alternative in this setting.
Fig. 3.
Posterior mean of the infection rates over 1 year for the CF method (dashed line) and NP method (solid line) using the squared exponential covariance function for the GP for the London measles data.
Fig. 4.
Posterior mean of the infection rates over 10 years for the CF method (dashed line) and NP method (solid line) using the periodic covariance function for the GP for the London measles data.
Table 1 shows some posterior parameter estimates for the three methods. Broadly speaking these are fairly similar, although there are some differences in the estimate of the initial number of susceptibles.
Table 1.
Posterior mean, equal-tailed credible intervals, and posterior standard deviation for the reporting rate,
the initial number of susceptibles
and infectives
and two illustrative infection rates for the London measles data. CF, NP, and NP periodic represent, respectively, the CF method, the non-parametric method using the squared exponential covariance function for the GP prior with seasonal infection rates, and the non-parametric method using the periodic covariance function for the GP prior without assuming the infection rates are seasonal
CF | NP | NP periodic | |
---|---|---|---|
![]() ![]() |
50.87 [50.69, 51.05] (0.11) | 50.71 [50.56, 50.87] (0.10) | 50.26 [50.09, 50.44] (0.11) |
![]() |
164 [160, 168] (2.6) | 161 [158, 164] (2.1) | 185 [180, 190] (3.0) |
![]() |
603 [551, 659] (33.3) | 605 [550, 661] (33.7) | 602 [547, 658] (33.9) |
![]() |
5.99 [5.81, 6.17] (0.11) | 6.08 [5.92, 6.24] (0.10) | 5.31 [5.15, 5.47] (0.10) |
![]() |
4.91 [4.74, 5.06] (0.10) | 5.02 [4.88, 5.17] (0.09) | 4.18 [4.05, 4.32] (0.08) |
6. Discussion
We have demonstrated that Bayesian non-parametric inference for epidemic models can be achieved using GP methods. Although our work is preliminary, it appears worthy of further exploration. The methods that we have developed can be extended to other settings, such as epidemic models with non-exponential infection periods and with latent periods, and those with more complex population mixing structures.
We have largely employed a squared exponential covariance function to define a prior for the GP of interest in our examples. However, we found that using other covariance functions such as exponential or Matern class functions did not have a material impact on the results, with the main difference in results being the smoothness of the posterior estimates for the infection rate functions. More details can be found in Xu (2015). A more general question is how best to choose a covariance function; this is partly an issue of model assessment, i.e. selecting a covariance function to give the best fit to data, and an interesting avenue for future research. In some settings certain covariance functions might arise naturally, as illustrated by our use of a periodic covariance function in the measles data example.
One practical challenge with our methods is that the computational complexity increases with the size of the problem. Specifically, calculating the multivariate Gaussian density functions involves inversion of the GP covariance matrix, an operation of typical complexity in the range to
for an
matrix, the size of which increases with the number of infections and thinned infection events. We found that the number of thinned events was often between two and three times larger than the number of infections. In practice we found that a data set with around 100 infections took around 1–2 h to run, increasing to 3–5 h for 200 infections. It is possible that recent developments in matrix inversion (e.g. Chen and Yi, 2016) could be fruitfully applied to reduce computation times. Nevertheless, as demonstrated in the measles example, our methods can be adapted via suitable approximating models.
Funding
This work supported by the University of Nottingham (X.X.), the European Union's Seventh Framework Programme FP7-Health-2012-INNOVATION [grant agreement number 305280 (MIMOmics)] (X.X.), and the UK Engineering and Physical Sciences Research Council [grant number EP/J013528/1] (T.K.). Funding to pay the Open Access publication charges for this article was provided by Research Councils UK.
Supplementary Material
Acknowledgements
We thank the reviewers and Associate Editor for suggestions which have greatly enhanced the paper. Conflict of Interest: None declared.
References
- Adams R. P., Murray I., MacKay D. J. C. (2009) Tractable Nonparametric Bayesian Inference in Poisson Processes with Gaussian Process Intensities. New York: ACM Press. [Google Scholar]
- Bailey N. T. J. (1975) The Mathematical Theory of Infectious Diseases and its Applications, 2nd edition London: Griffin. [Google Scholar]
- Bailey N. T. J., Thomas A. S. (1971). The estimation of parameters from population data on the general stochastic epidemic. Theoretical Population Biology 2, 253–70. [DOI] [PubMed] [Google Scholar]
- Becker N. G. (1989) Analysis of Infectious Disease Data. London: Chapman and Hall. [Google Scholar]
- Becker N. G., Hopper J. L. (1983). Assessing the heterogeneity of disease spread through a community. American Journal of Epidemiology 117, 362–374. [DOI] [PubMed] [Google Scholar]
- Becker N. G., Yip P. S. F. (1989). Analysis of variation in an infection rate. Australian Journal of Statistics 31, 42–52. [Google Scholar]
- Boys R. J., Giles P. R. (2007). Bayesian inference for stochastic epidemic models with time-inhomogeneous removal rates. Journal of Mathematical Biology 55, 223–247. [DOI] [PubMed] [Google Scholar]
- Cauchemez S., Ferguson N. M. (2008). Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London. Journal of the Royal Society Interface 525, 885–897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen F., Huggins R. M., Yip P. S., Lam K. F. (2008). Nonparametric estimation of multiplicative counting process intensity functions with an application to the Beijing SARS epidemic. Communications in Statistics: Theory and Methods 37, 294–306. [Google Scholar]
- Chen K., Yi C. (2016). Robustness analysis of a hybrid of recursive neural dynamics for online matrix inversion. Applied Mathematics and Computation 273, 969–975. [Google Scholar]
- Dureau J., Kalogeropoulos K., Baguelin M. (2013). Capturing the time-varying drivers of an epidemic using stochastic dynamical systems. Biostatistics 14, 541–555. [DOI] [PubMed] [Google Scholar]
- Eichner M., Dietz K. (2003). Transmission potential of smallpox: estimates based on detailed data from an outbreak. American Journal of Epidemiology 158, 110–117. [DOI] [PubMed] [Google Scholar]
- Fang C. T., Hsu H. M., Twu S. J., Chen M. Y., Hwang J. S., Wang J. D., Chuang C. Y.. theDivision of AIDS and STD, Center forDiseaseControl, Department ofHealth, ExecutiveYuan (2004). Decreased HIV transmission after a policy of providing free access to highly active antiretroviral therapy in Taiwan. Journal of Infectious Diseases 1905, 879–885. [DOI] [PubMed] [Google Scholar]
- Fearnhead P., Meligkotsidou L. (2004). Exact filtering for partially observed continuous time models. Journal of the Royal Statistical Society Series B 66, 771–789. [Google Scholar]
- Gibson G., Renshaw E. (1998). Estimating parameters in stochastic compartmental models using Markov chain methods. IMA Journal of Mathematics Applied in Medicine and Biology 15, 19–40. [PubMed] [Google Scholar]
- Hayakawa Y., O'Neill P. D., Upton D., Yip P. S. F. (2003). Bayesian inference for a stochastic epidemic model with uncertain numbers of susceptibles of several types. Australian and New Zealand Journal of Statistics 45, 491–502. [Google Scholar]
- Ionides E. L., Breto C., Yip A. A. (2006). Inference for non-linear dynamical systems. Proceedings of the National Academy of Sciences 10349, 18438–18443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lau E. H. Y., Yip P. S. F. (2008). Estimating the basic reproductive number in the general epidemic model with an unknown initial number of susceptible individuals. Scandinavian Journal of Statistics 35, 650–663. [Google Scholar]
- Lekone P. E., Finkennstädt B. F. (2006). Statistical inference in a stochastic SEIR model with control intervention: Ebola as a case study. Biometrics 62, 1170–1177. [DOI] [PubMed] [Google Scholar]
- Lewis P. A. W., Shedler G. S. (1979). Simulation of a nonhomogeneous Poisson process by thinning. Naval Research Logistics Quarterly 26, 403–413. [Google Scholar]
- McKinley T., Cook A. R., Deardon R. (2009). Inference in epidemic models without likelihoods. The International Journal of Biostatistics 51, Article 24. [Google Scholar]
- O'Neill P. D., Balding D. J., Becker N. G., Eerola M., Mollison D. (2000). Analyses of infectious disease data from household outbreaks by Markov chain Monte Carlo methods. Journal of the Royal Statistical Society Series C 494, 517–542. [Google Scholar]
- O'Neill P. D., Roberts G. O. (1999). Bayesian inference for partially observed stochastic epidemics. Journal of the Royal Statistical Society Series A 162, 121–129. [Google Scholar]
- Pollicott M., Wang H., Weiss H. (2012). Extracting the time-dependent transmission rate from infection data via solution of an inverse ODE problem. Journal of Biological Dynamics 6, 509–523. [DOI] [PubMed] [Google Scholar]
- Rasmussen C. E., Williams C. K. I. (2006) Gaussian Processes for Machine Learning. Cambridge, Massachusetts: MIT Press. [Google Scholar]
- Rida W. N. (1991). Asymptotic properties of some estimators for the infection rate in the general stochastic epidemic model. Journal of the Royal Statistical Society Series B 53, 269–283. [Google Scholar]
- Thompson D., Foege W. (1968) Faith Tabernacle Smallpox Epidemic. Abakaliki, Nigeria: World Health Organization. [Google Scholar]
- Xu X. (2015). Bayesian nonparametric inference for stochastic epidemic models, [Ph.D. thesis]. University of Nottingham. http://eprints.nottingham.ac.uk/29170/.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.