Skip to main content
International Journal of Environmental Research and Public Health logoLink to International Journal of Environmental Research and Public Health
. 2019 Apr 23;16(8):1442. doi: 10.3390/ijerph16081442

Addendum: Wang et al. A Bayesian Approach to Real-Time Monitoring and Forecasting of Chinese Foodborne Diseases. Int. J. Environ. Res. Public Health, 2018, 15(8):1740; doi:10.3390/ijerph15081740

Xueli Wang 1, Moqin Zhou 1, Jinzhu Jia 2, Zhi Geng 3, Gexin Xiao 4,*
PMCID: PMC6518211

The authors wish to update the Abstract and Section 3 in their paper published in the International Journal of Environmental Research and Public Health (IJERPH) [1].

They would like to rewrite the abstract as follows:

Abstract: Foodborne diseases have a big impact on public health and are often underreported. This is because a lot of patients delay treatment when they suffer from foodborne diseases. In Hunan Province (China), a total of 21,226 confirmed foodborne disease cases were reported from 1 March 2015 to 28 February 2016 by the Foodborne Surveillance Database (FSD) of the China National Centre for Food Safety Risk Assessment (CFSA). The purpose of this study was to make use of the daily number of visiting patients to forecast the daily true number of patients. Our main contribution is that we take the reporting delays into consideration and apply a Bayesian hierarchical model for this forecast problem. The data shows that there were 21,226 confirmed cases reported among 21,866 visiting patients, a proportion as high as 97%. Given this observation, the Bayesian hierarchical model was established to predict the daily true number of patients using the number of visiting patients. We use several scoring rules to assess the performance of different nowcasting procedures. We conclude that Bayesian nowcasting with consideration of right truncation of the reporting delays has a good performance for short-term forecasting and could effectively predict the epidemic trends of foodborne diseases. Meanwhile, this approach could provide a methodological basis for future foodborne disease monitoring and control strategies, which are crucial for public health.

In the end of the first paragraph in Section 3, the authors would like to update the last two sentences of the paragraph and add some citations. The revised sentences are as follows:

In this paper, we apply a Bayesian nowcasting model proposed by Höhle and an der Heiden [2] to forecast the daily total number of cases. Thanks to Salmon et al. [3], who provided a convenient R package “surveillance”, the inference for the model could be easily performed. The R package surveillance also contains a few other nowcasting methods that we also tried and did comparisons with using the scoring rules implemented in the package. The results are shown in Section 4. Below we review the model.

The authors revised Section 3.2 to describe the inference approach proposed by Höhle and an der Heiden (in Section 3.2 of [2]) in some greater detail. This part is now as follows:

For the convenience of the reader, we describe the inference approach proposed by Höhle and an der Heiden (in Section 3.2 of [2]) in some greater detail.

Define pd as the (time-homogeneous) probability that a case will have a reporting delay of d days. The pd’s satisfy the following equation: d=0Dpd=1. Following Kalbfleisch and Lawless’s previous work [4] and Zeger et al.’s previous work [5], we assume that the occurrence time of cases follows an underlying inhomogeneous Poisson process. A reasonable data generating process for the daily number of cases is thus as follows:

Nt|λtPo(λt)(nt,D,nt,D1,,nt,0)|Nt,pMN(Nt,(pD,pD1,,p0))

where Po(λ) denotes the Poisson distribution with expectation λ>0 and MN(N,p) denotes the multinomial distribution with size parameter N and probability vector p. Nowcasting for a given time T can thus be divided into steps of determining the λt’s, estimating the unknown delay distribution (i.e., the pd’s), and finally predicting the unobserved nt,d’s in order to compute the total Nt. As T increases, and if the assumption about a time-homogeneous delay distribution is acceptable, the available data make it possible to estimate the delay distribution better and better, and hence the quality of the predictions near T improves with time.

Consider a fixed time T and define pT=(pT,D,pT,D1,,pT,1) as the probability vector denoting that a case is reported with a delay of d days given the observed incomplete information at time T, i.e., the set of nt,d, where t+dT. We choose as prior distribution the generalised Dirichlet distribution GD(α,β) with fixed constants α=(α0,α1,,αD) and β=(β0,β1,,βD). Now we use Property 3 in the Web appendix of [2] that shows that the posterior of p under right-truncated multinomial sampling is again a GD distribution with parameters αT*,βT* given by

αT,i*=αi+τ=0TD+inτ,DiβT,i*=βi+τ=0TD+i(Nτ,τ+Dinτ,Di),  i{0,,D1}.

hence, for a given T we can assume the following model hierarchy for the time points t{TD,,T}:

pTGD(αT*,βT*)λtGa(aλ,bλ)Nt|λtPo(λt)Nt,T|Nt,pTBin(Nt,qT,Tt),

where qT,d=δ=0dpT,δ is the proportion reported within a delay of d days. We denote by Ga(aλ,bλ) the gamma distribution with parameters aλ>0,bλ>0. For this hierarchical model, the marginal distribution of Nt is a negative binomial distribution with the following mean and variance:

E(Nt)=μλ=aλbλVar(Nt)=μλ+μλ2aλ.

To estimate Nt given the observed counts nt,d at time T, we have to perform two steps: (1) update the delay distribution qT and (2) update the prediction for Nt:

  • (1)
    For the given T we compute αT*,βT* as stated above. We then draw for k=1,,K random vectors pT(k)GD(α*T,βT*) by the algorithm of Wong [6] and calculate
    qT,d(k)=δ=0dpT,δ(k).
  • (2)

    Given the updated delay distribution qT(k) and the observed counts nt,d, we can now update the prediction of Nt,t=TD,,T.

    For n{0,1,2,} we approximate by Monte Carlo sampling
    f(Nt=n|Nt,T)1Kk=1Kfn,t(k),  t{TD,,T}
    An application of Bayes theorem provides fn,t(k)=f~n,t(k)/ct(k), where ct(k)=n=0f~n,t(k) is the normalization constant and
    f~n,t(k)=f(Nt,T|Nt=n,qT,Tt(k))f(Nt=n|λt)f(λt)
    for all t{TD,,T}. The factors of the last equation can be evaluated using the distributional assumptions of the model hierarchy. For numerical convenience we do not sum over the entire support {0,1,2,} to get the normalization, but instead approximate
    ct(k)n=0Nmaxf~n,t(k),
    where Nmax is chosen sufficiently large.

Finally, the authors would like to update “proposed in this paper” to “described in this paper”.

The changes do not affect the results. The manuscript will be updated and the original will remain online on the article webpage, with a reference to this addendum.

References

  • 1.Wang X., Zhou M., Jia J., Geng Z., Xiao G. A Bayesian Approach to Real-Time Monitoring and Forecasting of Chinese Foodborne Diseases. Int. J. Environ. Res. Public Health. 2018;15:1740. doi: 10.3390/ijerph15081740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Höhle M., An der Heiden M. Bayesian nowcasting during the STEC O104:H4 outbreak in Germany, 2011. Biometrics. 2014;70:993–1002. doi: 10.1111/biom.12194. [DOI] [PubMed] [Google Scholar]
  • 3.Salmon M., Schumacher D., Höhle M. Monitoring Count Time Series in R: Aberration Detection in Public Health Surveillance. J. Stat. Softw. 2016;70:1–35. doi: 10.18637/jss.v070.i10. [DOI] [Google Scholar]
  • 4.Kalbfleisch J.D., Lawless J.F. Inference Based on Retrospective Ascertainment: An Analysis of the Data on Transfusion-Related AIDS. J. Am. Stat. Assoc. 1989;84:360–372. doi: 10.1080/01621459.1989.10478780. [DOI] [Google Scholar]
  • 5.Zeger S.L., See L.C., Diggle P.J. Statistical methods for monitoring the AIDS epidemic. Stat. Med. 1989;8:3–21. doi: 10.1002/sim.4780080104. [DOI] [PubMed] [Google Scholar]
  • 6.Wong T.T. Generalized Dirichlet distribution in Bayesian analysis. Appl. Math. Comput. 1998;97:165–181. doi: 10.1016/S0096-3003(97)10140-0. [DOI] [Google Scholar]

Articles from International Journal of Environmental Research and Public Health are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES