Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2019 May 29;9:8046. doi: 10.1038/s41598-019-44469-9

Development and evaluation of a deep learning approach for modeling seasonality and trends in hand-foot-mouth disease incidence in mainland China

Yongbin Wang 1,#, Chunjie Xu 2,#, Shengkui Zhang 1, Li Yang 1, Zhende Wang 1, Ying Zhu 1, Juxiang Yuan 1,
PMCID: PMC6541597  PMID: 31142826

Abstract

The high incidence, seasonal pattern and frequent outbreaks of hand, foot, and mouth disease (HFMD) represent a threat for millions of children in mainland China. And advanced response is being used to address this. Here, we aimed to model time series with a long short-term memory (LSTM) based on the HFMD notified data from June 2008 to June 2018 and the ultimate performance was compared with the autoregressive integrated moving average (ARIMA) and nonlinear auto-regressive neural network (NAR). The results indicated that the identified best-fitting LSTM with the better superiority, be it in modeling dataset or two robustness tests dataset, than the best-conducting NAR and seasonal ARIMA (SARIMA) methods in forecasting performances, including the minimum indices of root mean square error, mean absolute error and mean absolute percentage error. The epidemic trends of HFMD remained stable during the study period, but the reported cases were even at significantly high levels with a notable high-risk seasonality in summer, and the incident cases projected by the LSTM would still be fairly high with a slightly upward trend in the future. In this regard, the LSTM approach should be highlighted in forecasting the epidemics of HFMD, and therefore assisting decision makers in making efficient decisions derived from the early detection of the disease incidents.

Subject terms: Viral infection, Applied mathematics, Statistics

Introduction

Hand, foot and mouth disease (HFDM) is a common acute infectious disease in children, the majority (91%) of whom are under 5 years1. Most show mild symptoms mainly characterized by fever and rash on the hands, feet and mouth. A small minority have more severe complications, such as myocarditis, pulmonary edema and aseptic meningoencephalitis, some of which are fatal2. The infections are predominantly caused by coxsackievirus A16 (CVA16) and human enterovirus 71 (EV71), although other viruses can be involved3. The viruses are thought to be predominantly transmitted from child to child by direct and indirect contact, including droplets, droplet nuclei, dust, water and food1,4. Furthermore, approximately half of the individuals may be coinfected with more than one pathogenic agent5. HFMD was first reported in New Zealand in 1957, and since then, millions of cases and many outbreaks have been reported worldwide6. However, the worldwide epidemiology of HFMD has dramatically changed during the past decade, especially in East and Southeast Asian countries such as China, Brunei, Malaysia, Mongolia, Singapore and Vietnam7, where epidemics and numerous large-scale outbreaks of HFMD have occurred, resulting in enormous burdens of disease and global public health concerns3,8,9.

In mainland China, after the first reported case of HFMD in Shanghai in 198110, several outbreaks have been reported and have caused the deaths of numerous children3. Moreover, HFMD affects more than two million children annually in mainland China11, and the number of cases and deaths caused by HFMD sporadics, epidemics and outbreaks invariably tops the list of monitored class C diseases every year since HFMD was designated as a notifiable disease in 20083,12. Since 2016, although an available vaccine that only plays a role in the infection caused by the EV71 virus has been introduced to prevent HFMD13, the potentially worsening trend in the reported cases of HFMD has not been reversed. Importantly, it is estimated that there is an increasing risk of ongoing HFMD recurrence in China in recent years14 and HFMD still continues to exert a significant influence on the general susceptible population. Early detection and advanced warning for the timing, extent and duration of HFMD epidemics will be particularly valuable in formulating effective prevention and intervention strategies to minimize the damage caused by the infection15. Therefore, a reliable forecasting technique to track the temporal patterns of HFMD is needed.

The existing early warning models for forecasting the morbidity and mortality of infectious diseases mainly consist of linear and nonlinear models, along with their hybrids16. Additionally, the autoregressive integrated moving average (ARIMA) model is one of the best linear models in terms of performance for a specified time series17; the nonlinear autoregressive neural network (NAR) approach is among the nonlinear models with arbitrarily expected accuracy that can effectively extract the meaningful dynamic information of a data sequence16. Both the ARIMA and NAR models are well suited to study the future trends of the morbidity or mortality time series of diseases with stationary short-term dependencies based on the aggregated long trajectories16. However, long-term trajectory modeling, which is most often encountered in epidemiological prediction, is frequently characterized by non-stationary long-term lags over time. Additionally, when employing an NAR model to connect the preceding information located into the time-varying series to the present assignment, with the growing gap between the past inputs and estimated outputs, the NAR technique will encounter a vanishing or exploding gradient problem during training, which makes it difficult to develop the long-term dependence structure in a time series18. The long short-term memory (LSTM) architecture, a type of deep learning network that has been extensively studied and applied to quite a few frontier fields, such as voice recognition19, video classification20 and speech synthesis systems21, comprises a cluster of recurrently connected subnets that allow the LSTM method to store and access information over long periods of time, hence mitigating the vanishing or exploding gradient problem22. However, there is a current lack of research focusing specifically on the applicability of LSTM model in the forecasting of infectious diseases with time series analysis. Therefore, motivated by the merits of the LSTM model, the burden of HFMD and the persistently high incidence in mainland China, we aim to forecast the notified incident cases of HFMD with a LSTM model. Meanwhile, the simulating and predictive abilities of the LSTM model were compared with two especially useful estimation models, including the ARIMA and NAR methods, to seek the best-fitting time series modeling technique for HFMD, which will be of great help in initiating guidance planning and effective intervention measures for HFMD-prevention in mainland China.

Results

General information

The time series of notified cases of HFMD included 121 observations from June 2008 to June 2018, and a total of 19,218,824 cases were reported. The monthly average of case notifications was 158,834, which led to an annualized average morbidity rate of 133,894 cases per 100,000 population with a standard error of 10,825 cases during the whole period. Between June 2008 and June 2017, 17,072,500 cases occurred, and the number of morbidity cases increased from 102,223 to 308,789, with an overall increase of 202.074% throughout the past decade. The highest incidence peak of 528,777 cases was observed in May 2014, which was a marginal increase of 417.278% compared to June2008. When the Hodrick-Prescott (HP) decomposition approach was performed to smooth the short-term monthly effect of the observed incidence case series from June 2008 to June 2018, it was found that there were clear seasonal peaks, specifically in May and June of every year, a trough in January and February and an evident 12-month cyclical process. In addition, notwithstanding a slight decline that existed between January 2014 and June 2018, there were still a substantially large number of reported cases each year (Fig. 1 and Supplementary Fig. S1).

Figure 1.

Figure 1

Time series of monthly HFMD observed series and decomposition using Hodrick-Prescott filter in mainland China from June 2008 to June 2018.

The best-performing SARIMA model

Before modeling, the original in-sample observations were examined using the augmented Dickey-Fuller (ADF) test (ADF = −2.3318, P = 0.1642), which is indicative of an obviously non-stationary morbidity case series. Therefore, according to the results from the HP filter and ADF test, the first-order seasonal and non-seasonal differences were used to stabilize the variance and mean to suit the modeling requirement of a stationary sequence (ADF = −4.4129, P = 0.0006). Analysis of the spikes in the ACF and PACF plots at varying lags with the transformed incidence case series for HFMD resulted in the selection of several candidate models by trial and error to further detect the best-performing specification (Supplementary Figs S2 and S3). Finally, taking the error correlations between the ACF and PACF plots comprehensively and taking the AIC, AICc and SBC into consideration (Fig. 2 and Supplementary Table S1), the preferred model of SARIMA(1,1,2)(1,1,0)12 was generated with minimized AIC, AICc and SBC values of 2339.41, 2340.08 and 2352.23, respectively. When the error correlations at lags fell into the estimated 95% confidence bounds (Fig. 2b,c), the Ljung-Box Q test showed that the residuals from the SARIMA model obtained desirable white noise (Fig. 2d and Table 1), and the LM test suggested that no ARCH effect was found at various lags in the residual series (Table 2). Furthermore, the test results of the estimated parameters were also all significant (Supplementary Table S1). Nevertheless, the only complication was that the Q-Q plot of the residuals showed a clear departure from normality at the tails (Supplementary Fig. S4). The specified equation of the SARIMA model was expressed as (1-B)(1-B12)Xt = (1 + 0.524B + 0.394B2t/(1 − 0.434B)(1 + 0.6B12). In the same way, the reported case series of HFMD from June 2008 to December 2016 and December 2017 was utilized to account for the robustness of the model. The best-fitting SARIMA method constructed using the first 103 in-sample observations was identified as a SARIMA(1,0,1)(1,1,1)12 form, and Supplementary Figs S5S8 and Tables S2S5 provide the results of the diagnostic tests for this optimal SARIMA approach; the best-simulating SARIMA approach obtained with the first 115 observed points was still identified as a SARIMA(1,0,1)(1,1,1)12 form, and the identified parameters and diagnostic tests for the preferred approach are given in Supplementary Figs S9S12 and Tables S6S9. Then, these optimal methods chosen were utilized to calculate out-of-sample predictions.

Figure 2.

Figure 2

The resulting plots of fit goodness tests from SARIMA(1,1,2)(1,1,0)12 model for HFMD notified cases series. (a) Standardized residuals. (b) Autocorrelation function (ACF) graph of errors across varying lag times. None of the autocorrelation coefficients are beyond the 95% confidence intervals in this residual series. (c) Partial autocorrelation function (PACF) graph of errors. (d) Q-statistic P-values. There are large P values at the significance level of 5%. Diagnostic checking indicates the chosen SARIMA specification can provide a reasonable approximation to the HFMD notified cases series.

Table 1.

Ljung-Box Q test of the residuals for the selected three optimal models at different lags.

Lags SARIMA model NAR model LSTM model
Box-Ljung Q P Box-Ljung Q P Box-Ljung Q P
1 0.046 0.830 0.963 0.326 1.324 0.250
3 0.319 0.956 3.489 0.322 2.818 0.421
6 1.578 0.954 5.051 0.537 3.143 0.791
9 3.295 0.952 6.941 0.643 3.441 0.944
12 8.170 0.772 7.442 0.827 7.141 0.848
15 11.330 0.729 11.443 0.721 12.800 0.618
18 13.122 0.784 11.626 0.866 15.252 0.645
21 14.728 0.836 14.941 0.826 15.355 0.805
24 18.265 0.790 16.648 0.863 16.889 0.853
27 26.991 0.464 17.448 0.920 19.358 0.857
30 29.634 0.485 19.393 0.931 21.247 0.880
33 31.664 0.534 21.815 0.932 23.716 0.883
36 50.369 0.056 24.575 0.925 27.236 0.853

Table 2.

ARCH effect of the observations and residuals of the selected three models with LM test at various lags.

Lags Observed values SARIMA model NAR model LSTM model
LM-test P LM-test P LM-test P LM-test P
1 54.599 <0.001 1.869 0.172 0.262 0.609 0.002 0.961
3 73.920 <0.001 2.669 0.446 2.823 0.420 0.440 0.932
6 72.753 <0.001 3.575 0.734 2.972 0.812 3.338 0.765
9 71.185 <0.001 4.703 0.859 14.741 0.098 5.216 0.815
12 73.136 <0.001 14.080 0.296 11.422 0.493 7.294 0.838
15 71.275 <0.001 14.566 0.483 30.219 0.011 10.241 0.804
18 69.282 <0.001 15.566 0.623 32.252 0.020 13.735 0.746
21 67.505 <0.001 20.645 0.481 34.845 0.029 0.7462 0.881
24 73.785 <0.001 21.094 0.633 34.774 0.072 14.815 0.926
27 71.487 <0.001 25.263 0.560 36.360 0.108 18.172 0.898
30 68.830 <0.001 27.984 0.571 38.706 0.133 19.899 0.919
33 66.153 <0.001 32.378 0.498 41.990 0.136 22.538 0.915
36 64.210 <0.001 33.766 0.575 46.280 0.117 23.631 0.944

The best-performing NAR model

To obtain an optimum NAR model, the hidden units and feedback delays, ranging from 10 to 35 and from 2 to 8, respectively, were iteratively examined within in-sample data points. Ultimately, we identified the best-fitting model with 18 hidden neurons and 5 feedback delays dependent on the comprehensive optimum performance indices aside from the fact that a fat-tailed distribution, compared to the normal distribution, should be utilized (Supplementary Fig. S13). As presented in Supplementary Fig. S14, in the preferred model, the minimum MSE values for the training, validation and testing datasets and for the entire dataset were 0.0011, 0.0074, 0.0144 and 0.0038, respectively; the maximum R values of the training, validation, testing subsets and entire dataset were 0.987, 0.931, 0.920 and 0.963, respectively. Moreover, the input-to-error correlations and autocorrelations of the produced residuals were never beyond the estimated 95% uncertainty limits around zero across varying lag times, apart from the one in the ACF plot at lag zero that should occur (Fig. 3 and Table 1). The LM test suggested that the ARCH effect that existed in the original data largely minimized the errors of the NAR model (Table 2). In addition, the response plot of the estimated values from the randomly selected training, validation and testing datasets against their corresponding original observations at different time points showed that the optimal approach could simulate the data points included in the three grouped subsets well because of the small residuals that were mostly located between −0.2 and 0.2 (Fig. 4). Similarly, according to the modeling steps described above, in these two robustness-test datasets, the best-presenting technique fit to the dataset between June 2008 and December 2016 was such an NAR model with 17 hidden neurons and 5 feedback delays. The identified layer architecture and statistical measures for this preferred network are displayed in Supplementary Figs S15S19 and Tables S4, S5 and S10. The best-fitting model developed utilizing the data from June 2008 to December 2017 was an NAR model with 19 hidden neurons and 6 feedback delays, and with regard to the optimal network, all further diagnostic results can be seen in Supplementary Figs S20S24 and Tables S8S10. Afterwards, these best-performing networks were employed to conduct out-of-sample forecasting, and the simulated and forecasted values obtained were back-transformed to the original scale because they were computed on the transformed scale.

Figure 3.

Figure 3

The resulting plots of fit goodness tests from the best-fitting NAR model for HFMD notified cases series. (a) Standardized residuals. (b) Autocorrelation function (ACF) plot of errors across varying lag times. All of the autocorrelations fail to be beyond the estimated 95% uncertainty bounds around zero across varying lag times apart from the one from ACF plot at zero lag that should occur. This manifests that the network appears to have captured the dependence hidden behind the HFMD notified cases series. (c) Input-to-error correlation plot for varying lags. The input-error cross-correlation function illustrates how the residuals are interrelated with the series of x(t). All of the correlations fall within the confidence bounds around zero, which hints the developed model is a perfect specification. (d) Q-statistic P-values. Analyses form the plots demonstrate the constructed model is adequate in excavating the information of this time series.

Figure 4.

Figure 4

The response of output and target for HFMD time series at various time points. This plot exhibits which time points are elected as the training, validation and testing subsets, along with their corresponding errors between inputs and targets. In view of the small errors, a further suggestion that the fitting is fairly accurate.

The best-performing LSTM model

Generally, the LSTM network with 1 hidden layer surrounded by 1 to 7 hidden units can satisfy the need for time series modeling. Consequently, to attain the optimal modeling parameters for the HFMD series, the range of time steps was set to 1 to 20, and the LSTM model was conducted repeatedly utilizing the activation sigmoid function with time steps ranging from 1 to 20 together with a batch size of 1 using the Adam optimizer technique to stabilize the argument updates to help minimize the loss function of the RMSE. All candidate models were iterated through 300 epochs. Ultimately, we identified that the best-simulating model with 1 hidden layer containing 5 hidden neurons and 11 time steps relied on a minimum training score of RMSE = 0.0031 and a testing score of RMSE = 0.0038 and with maximum R values of the training and validation subsets and of the overall data (0.972, 0.982 and 0.974, respectively) (Supplementary Fig. S25). In addition, as presented in Fig. 5, no overfitting occurred during the training process because of the similar downward trend between the testing and validation datasets before 300-step iterations. The ACF plot of the produced errors revealed no individually evident autocorrelation at varying lags except for the one at lag zero (Fig. 6b), as shown in the normal Q-Q plot (Supplementary Fig. S26), indicating that the forecasted residuals of the LSTM model were normally distributed to a great extent. The Ljung-Box Q test showed that errors did not depart from the assumptions of stochastic white noise (Fig. 6d and Table 1), and the LM test demonstrated that the volatility that existed in the actual observations was essentially eliminated in the residuals of the selected LSTM model as well (Table 2). Therefore, the model chosen is adequate in capturing the dynamic dependences of this time series. Likewise, the in-sample observations used to test the robustness of the model were applied to determine the optimal LSTM network as noted earlier: the network with 1 hidden layer including 6 hidden neurons and 12 time steps was constructed based on the data series from June 2008 to December 2016 was the best-performing; the results of the diagnostic tests for this network are summarized in Supplementary Figs S27S30 and Tables S4, S5 and S11. The network with 1 hidden layer containing 5 hidden neurons and 11 time steps built on the basis of the data series from June 2008 to December 2017 should be regarded as the best-performing. Supplementary Figs S31S34 and Tables S8, S9 and S11 offer systematic diagnostic tests for this preferred network. After the optimal LSTM approaches are selected, they can be employed to predict the epidemic trends of HFMD in the upcoming years.

Figure 5.

Figure 5

The training and validation performances for LSTM model at 300 epochs. This plot documents that no overfitting is observed during the training process due to the similar downward trend before 300-step iterations.

Figure 6.

Figure 6

The resulting plots of fit goodness tests from the LSTM model for HFMD notified cases series. (a) Standardized residuals. (b) Autocorrelation function (ACF) plot of errors across varying lag times. The ACF plot of forecasted errors reveals no individually evident autocorrelation at varying lags except for the two points occurring at lags 11 and 13. For these two lagged points out of the estimated 95% confidence limit, they are also reasonable as this phenomenon can easily happen by chance alone. (c) Partial autocorrelation function (PACF) plot of residuals. (d) Q-statistic P-values. As shown, All P-values are larger than 0.05. These diagnostics manifest that the network is well suited to the dataset.

Comparative analysis

Multiple statistical measures were applied to compare the in-sample simulation and out-of-sample predictive accuracies among the three methods. Compared to the SARIMA and NAR models, the minimum values of measures concerning the facets of training and testing were observed in the LSTM technique, aside from the MAE in the simulated stage of the NAR model, and were fitted to the robustness-test dataset from June 2008 to December 2017 (Table 3). For these three constructed models, overall, the curves simulated and predicted by the LSTM model were closer in proximity to the actual values as well (Fig. 7), which further implies that the epidemic trajectories of HFMD can be captured reasonably well by the LSTM technique. Hence, the LSTM technique was re-modeled to recursively achieve multistep-ahead predictions from July 2018 to December 2020 (Table 4); it appears that a slightly potential rising risk in the incident cases of HFMD will be observed in the forecasting period. In the meantime, the uncertainty bands for the resultant forecasts were estimated utilizing simulation of 100 future possible paths by performing bootstrapping with the number of samples of 100023,24 (Table 4 and Supplementary Table S12).

Table 3.

The comparison results of in-sample simulating and out-of-sample predicted performances for the three models.

Models Simulated performance Predicted performance
MAE MAPE RMSE MAE MAPE RMSE
In-sample observations from June 2008 to December 2017 6 out-of-sample predictions
SARIMA 30610.689 0.223 44383.404 40101.500 0.387 51150.446
NAR 19652.784 0.222 29740.835 21004.106 0.361 27364.566
LSTM 21548.164 0.203 27716.239 19505.800 0.221 25820.167
Reduced percentages (%)
LSTM vs. SARIMA 29.606 8.969 37.553 51.359 42.894 49.521
LSTM vs. NAR −6.192 8.520 4.562 3.736 36.176 3.019
In-sample observations from June 2008 to June 2017 12 out-of-sample predictions
SARIMA 30268.335 0.275 43330.873 41542.355 0.366 49978.944
NAR 20678.368 0.272 31998.251 61130.494 0.373 105803.807
LSTM 20584.538 0.192 26364.913 32451.557 0.262 41678.916
Reduced percentages (%)
LSTM vs. SARIMA 31.993 30.182 39.154 21.883 28.415 16.607
LSTM vs. NAR 0.310 29.091 13.001 69.035 30.328 128.304
In-sample observations from June 2008 to December 2016 18 out-of-sample predictions
SARIMA 30136.495 0.211 43456.811 43434.889 0.479 53852.232
NAR 19957.815 0.186 29345.366 73291.494 0.629 88387.215
LSTM 19931.051 0.176 25155.016 39729.413 0.351 50951.064
Reduced percentages (%)
LSTM vs. SARIMA 33.864 16.588 42.115 8.531 26.722 5.387
LSTM vs. NAR 0.089 4.739 9.643 77.270 58.038 69.516

Figure 7.

Figure 7

The comparisons of in-sample simulations and out-of-sample predictions among these three models selected. (a) Comparison between the actual observations and the results from the SARIMA model. (b) Comparison between the actual observations and the results from the NAR model. (c) Comparison between the actual observations and the results from the LSTM model. The shaded area represents the validation sets from July 2017 to June 2018, in which the comparative results between the values predicted by the three selected models and the actual suggest that the curve forecasted by LSTM model is more proximal to the actual. As presented in (c), the red dotted line stands for the trends from July 2018 to December 2020 projected by the LSTM method.

Table 4.

The predictive incident cases of HFMD using the best-presenting LSTM technique from July 2018 to December 2020.

Time Predictions 95% uncertainty bands Time Predictions 95% uncertainty bands
Jul-2018 417719 [366119, 469320] Oct-2019 179164 [120755, 237573]
Aug-2018 232278 [192745, 271810] Nov-2019 197632 [153917, 241346]
Sep-2018 120464 [73475, 167453] Dec-2019 116959 [80180, 153739]
Oct-2018 164881 [114495, 215267] Jan-2020 36199 [−650, 73047]
Nov-2018 188281 [153866, 222697] Feb-2020 2426 [−38061, 42912]
Dec-2018 122213 [92903, 151522] Mar-2020 115231 [61218, 169245]
Jan-2019 25824 [−12713, 64359] Apr-2020 210295 [144546, 276044]
Feb-2019 8856 [−29540, 47252] May-2020 240488 [159436, 321540]
Mar-2019 118917 [66934, 170900] Jun-2020 460614 [397661, 523566]
Apr-2019 176434 [114298, 238571] Jul-2020 419429 [342643, 496216]
May-2019 165857 [107939, 223776] Aug-2020 232450 [174459, 290441]
Jun-2019 339025 [299026, 379023] Sep-2020 121814 [67407, 176222]
Jul-2019 369045 [309402, 428689] Oct-2020 168242 [108510, 2279730]
Aug-2019 272607 [207759, 337456] Nov-2020 194318 [145616, 243020]
Sep-2019 140126 [81893, 198359] Dec-2020 126442 [85527, 167356]

Discussion

Since 2008, HFMD has regularly captured wide attention owing to both its high incidence and potential health hazards for millions of children, along with substantial losses to economy every year in mainland China8. It is imperative that specific control and intervention planning be introduced and set by the related public health agencies to handle such a wide-ranging issue. However, vital to any initiation planning of HFMD is an accurate prediction of its future temporal patterns. Early detection of HFMD epidemics based on models such as SARIMA and NAR has been a profitable technology for facilitating prevention strategies more effectively10,11,17. Therefore, in view of the LSTM model’s flexible capacity to learn what to store and what to abandon during information-processing25, to the best of our knowledge, this attempt is the first using an LSTM approach to model the long trajectory behaviors of HFMD incidence in mainland China. Our results imply that the LSTM method has the potential to obtain a clearer perspective of epidemic trends than the SARIMA and NAR models built on the specified predictive horizons. Notably, the LSTM method can make the outcome measurements of MAE, MAPE and RMSE markedly drop by 31.993%, 30.182% and 39.154%, respectively, in the training dataset, and can make their counterparts in the testing dataset decrease by 21.883%, 28.415% and 16.607%, respectively, compared to the SARIMA model. In contrast to the NAR model, the decreased percentages in the training subset for the three indices listed above are 0.310%, 29.091% and 13.001%, respectively, and their counterparts in the testing subset are 69.035%, 30.328% and 128.304%, respectively. Likewise, according to Fig. 7, it was found that the upward or downward trend simulated and forecasted by the LSTM model provides a more reasonable approximation to the reported points, especially for the identification of high incidence peaks, indicating that the LSTM model can adequately capture the essence of the dependence hidden behind the notified case series of HFMD. Similarly, in the two datasets used to account for the uncertainty in the model, the accuracy measurement indices in the LSTM approach also display the lowest error rates among these chosen, optimal methods, particularly in the 18-step ahead predictions, and the performances are as good as can be expected. The scale-dependent measure of MAE showed a lower value in the simulated stage of the NAR method than in the LSTM technique in the 6-step predictions; this result mainly arises from the fact that there are several simulated values that are far away from the observed values. Moreover, it was confirmed that the NAR model is effective in capturing the short-term dynamics of the data. In short, in comparison with the SARIMA and NAR models, the established LSTM approach not only can better explain the seasonal and trend characteristics of HFMD but also is robust with respect to medium-term and long-term forecasts. Apparently, this network can act as an effective tool for recognizing the temporal levels of HFMD incident cases in mainland China in upcoming years. Similar to the recent literature, which has also found that the LSTM method provides good forecasting power for air pollutant concentrations26, financial time series27 and harmful algal blooms in rivers28. From this point of view, our LSTM technique appears to be worthy of being popularized for forecasting the incidence case series of HFMD in other settings in China and even a wide range of simulation applications, such as for all types of contagious diseases or in all time series analyses; however, this conclusion requires further verification. It should, however, be noted that with the increasing development of hybrid techniques, numerous combined methods incorporating linear approaches such as the SARIMA method17, the gray GM(1,1) model29, the error-trend-seasonal model30 and the exponential smoothing model31 and nonlinear techniques such as the back propagation neural network approach32, the generalized regression neural network method33 and the radical basis function technique32 have already been adopted to serve as early warning tools for infectious diseases, and most have obtained satisfactory results. Consequently, much work will be required to explore the preferred models for detecting and analyzing HFMD morbidity cases in mainland China. In addition, in terms of the modeling measures (MAE, MAPE and RMSE), we found that the simulating and forecasting efficacies of the NAR method were slightly superior to the SARIMA method in the short-term (6-step) predictions, which is consistent with the earlier studies performed by Zhou et al.34,35, yet is incongruous with the study involving modeling the prevalence of schistosomiasis in Qianjiang16; in the medium-term (12-step) and long-term (18-step) prediction stages, the NAR method underperforms the SARIMA model in the testing dataset, but this result is not in line with previous work predicting the morbidity of hemorrhagic fever with renal syndrome36 and the daily number of new admission inpatients35. It seems possible that these contrasting results are due to the following: the various characteristics of infectious diseases from different regions and the NAR approach suffer from overfitting, which is a defect inherent in the ANN methods. However, during MATLAB training, a default technique of early stopping was adopted to improve generalization and avoid overfitting. Therefore, being further suggestive of the necessity of constructing forecasting techniques for different infectious diseases in various settings and at different time periods, it is superior to the NAR technique in short-term forecasting.

It is well established that accurate identification of high-risk seasonality plays a pivotal role in timely implementation of prevention strategies and the reasonable allocation of resources for HFMD6. In our report, HFMD could occur throughout the year, and larger epidemics could be regularly found every 2 to 3 years. Similar trajectory behaviors were also reported in the studies involved in the regions of Vietnam37, Malaysia38, Hong Kong39, Taiwan40 and Singapore41, but the underlying drivers fail to be fully elucidated. In our study, evident seasonal and cyclical components were observed with the aid of the HP method; for example, every year from April until July, there was peak activity accounting for 59.581% of all notified cases, among which May and June were of particular concern, as they accounted for 60.952% of cases occurring in high-risk seasons. However, between January and February annually, there was a dramatic decline in the reported cases. A similar seasonal distribution was also revealed in recent years in other countries, including Singapore41, Malaysia38 and most regions of China6,39,40,42, containing Hong Kong, Taiwan, Shenzhen, Ningbo, Shandong, Zunyi, Guangdong and Guangzhou6. Moreover, outbreaks commonly occurred during the 4 months as well42,43. Additionally, two peaks could be noted in our data from June 2008 to June 2018, the first and stronger peak primarily occurred during the high-risk season, and the weaker peak was chiefly observed from August to November annually; the appearance of the two peaks was also reported in another study of southern China6. This seasonal pattern is consistent with that of Hong Kong39, Taiwan40 and Vietnam37. The single peak was customarily observed in northern China44, and earlier studies on the temporal characteristics of HFMD in Japan3 and Malaysia38 matched that in northern China. This may be pertinent to the different viruses, geographical differences or changing risk (e.g., school attendance, temperature, humidity or other meteorological drivers). Furthermore, in studies of particular regions of China, the leading agents (EV71 and CVA16) are also distributed in various peaks45, where the pathogenic agent EV71 is predominant in the stronger peak months. By contrast, CVA16 is more inclined to circulate in the more vulnerable population. These two etiologic factors are notably attenuated in January and February. Regarding the seasonal variations, climatic factors are possibly responsible for such a discrepancy (e.g., the ability of the causative agents to survive outside the host, the variability in the behavior and immune level of the host by climatic factors, and the inclination of people to go outdoors in summer rather than in winter increases the chances of person-to-person contact causing the etiologic agents to more easily achieve transmission among humans by virtue of spreading-factors)6.

To understand the epidemic situation in advance of the coming years, the constructed LSTM model with the best-fitting and best-predicting performance was adopted to calculate forecasts for the next two years. The results indicated that although the estimated observations would not show a large amplitude of oscillations relative to the in-sample data obtained, HFMD morbidity cases remained high, among which the highest-risk seasonality seemed to occur in June and July. Similar to prior findings6, two apparent seasonal peaks will be observed separately in subsequent Junes and Octobers, in all probability. Thus, due attention and instant action should be paid to these months and a response should be prompted, such as health promotion education; prevention at and control of key locations, particularly in nurseries and schools; vaccination and financial support. In addition, the prevention and control strategies for the rest of the low-risk months should fail to be ignored. In summary, the expected number of cases of HFMD remain present and still comparatively large, demonstrating that China is still afflicted with a chronic threat of HFMD.

Some limitations should be acknowledged in this work. First, no theoretical guidance can be adopted to identify the optimum number of hidden units, feedback delays and other key parameters during the establishment of ANN models. In practice, they are frequently selected by trial and error, and the specific forecasting process is poorly understood. Second, estimating the 95% uncertainty bounds for the predictions remains an additional problem. Third, the aggregated HFMD incidence case data utilized were obtained from nationwide passive infectious disease surveillance. We thus fail to rule out artifactual monitoring biases (e.g., substantial underreporting, misdiagnosis and delay). Fourth, the statistical predictions do not take known drivers into account and lack any epidemiological data other than case numbers and months, owing to their unavailability. Therefore, whether further studies that take these variables into account will have the potential to boost the fit and predictive ability remains to be authenticated. Fifth, albeit the LSTM technique built can be considered to be an instrumental tool for the medium-long-term estimation of future trends in HFMD incidence case data, in applications, this network is expected to be updated in due course with the incident cases to ensure its superiority in predictive performance. Sixth, detailed data on HFMD notifications are missing (e.g., age and sex), which precludes further analysis in the present work. Lastly, the LSTM model was developed based only on nationwide monitoring data over the period from 2008 to2018. These results therefore need to be interpreted with caution, and the analytic results can represent only entire epidemics of HFMD on the Chinese mainland. Remodeling for the region-specific notified HFMD cases time series may act as guidance for the formulation of targeted public health strategies, and whether the model is appropriate to calculate predictions for other kinds of communicable diseases requires further study.

In conclusion, notwithstanding its flaws, our study does indicate that the LSTM model established can provide more accurate predictions, be it in the in-sample dataset or the out-of-sample dataset. For the HFMD notified case time series compared with the individual SARIMA and NAR models, the LSTM model may be a beneficial tool for the early detection and advanced warning of HFMD activities in mainland China and can allow the official government to allocate health resources effectively and appropriately formulate the preventive and control planning for HFMD. Additionally, the number of forecasted incident cases are still relatively large and indeed present in the imminent future, this issue warrants to be resolved urgently and strategically within the effective measures taken.

Materials and Methods

Data collection

In this study, the aggregated monthly and yearly reported cases of HFMD, available from June 2008 to June 2018, were obtained from the notifiable infectious disease monitoring system provided by the Chinese Center for Disease Control and Prevention (CDC) (http://www.nhfpc.gov.cn/jkj/s3578/new_list.shtml). A total of 121 observations covering 11 years were collated and summarized. Subsequently, the whole dataset was split into two blocks to build the models, among which the first 109 data points (from June 2008 to June 2017) were regarded as in-sample modeling horizons, while the remaining 12 data points (from July 2017 to June 2018) were considered as out-of-sample predictive horizons. Since the sample length and time periods adopted to construct the models might have an impact on the forecasting power, two additional data categories were provided to test the robustness of the models developed, among which the first 103 (from June 2008 to December 2016) and 115 data points (from June 2008 to December 2017) were considered as in-sample modeling horizons, while the other 18 (from January 2017 to June 2018) and 6 data points (from January 2018 to June 2018) were used as out-of-sample predictive horizons.

In China, HFMD is clinically diagnosed by physicians, and the laboratory confirmed the diagnosis dependent on the detection of specific nucleic acids, the isolation of enterovirus related to pathogenic factors and the detection of a fourfold change in neutralizing antibodies. In addition, verified cases must be registered within 24 hours, and duplicate cases must be deleted by professionals at the end of the same month. Ethical approval or consent is not required for our present study owing to the public availability of HFMD surveillance data in China.

Constructing the SARIMA model

The Box-Jenkins method of ARIMA(p, d, q) has been the most commonly used statistical forecasting technique for time series data that display no seasonality46. However, in applications, particularly in the morbidity time series of diseases, this time series frequently shows marked seasonal and cyclic tendencies30. Consequently, to avert losing significant series traits, a seasonal ARIMA method, specified as SARIMA(p, d, q)(P, D, Q)s, has been proposed to reveal data with those patterns47. In this model, the actual observation can be represented as a linear combination of the prior observation and the error sequence. As such, the secular change and seasonal variation of time series are captured in the SARIMA method as interpretable terms48. Although, of note, the linear SARIMA method can also model periodicity, the fitted cyclic change remains invariably symmetric. In our present study, considering the characteristics of HFMD incident case sequences containing evident cyclical and seasonality6, a typical SARIMA model will be a useful tool in predicting the future temporal trends17, among which the seasonal part of HFMD was taken for the predictors and the monthly HFMD incidence time series was used for the dependent variable. The final formula of a SARIMA model can be expressed as:

{φ(B)Φ(Bs)ΔdΔsDXt=θ(B)Θ(Bs)εtE(εt)=0,Var(εt)=σε2,E(εtεs)=0,stE(Xsεt)=0,s<t 1

where B refers to the backshift operator, ɛt denotes the errors from HFMD series, S signifies the length of seasonal cycle of HFMD notifications, d and D are the non-seasonal and seasonal differenced times, respectively. In the SARIMA model notation, p and q represent the orders of the non-seasonal autoregressive and moving average models, respectively; P and Q represent the orders of the seasonal autoregressive and moving average models, respectively. d=(1B)d, SD=(1B)SD, φ(B)=1φ1BφpBp,θ(B)=1θ1BθqBq, Φ(Bs)=1Φ1BsΦPBPs, Θ(Bs)=1Θ1BsΘQBQs.

We utilized the R statistical package (version 3.4.3, R Development Core Team, Vienna, Austria) and SPSS software (version 17.0, IBM Corp, Armonk, NY) to construct the SARIMA model. The development of the SARIMA approach is under the assumption of a stationary incidence time series10. Therefore, in this research, the ADF test was used to identify whether the actual reported cases and processed data using differencing or a transformation technique accomplished stationarity49. Afterwards, the autocorrelation function (ACF) and partial autocorrelation function (PACF) plots, the Schwarz Bayesian criterion (SBC), the Akaike information criterion (AIC) and the corrected Akaike Information criterion (AICc), along with the Lagrangian multiplier (LM) and Ljung-Box Q tests were applied to estimate and diagnose the model50. The above mentioned modeling procedures were repeatedly conducted until the best-performing model was ultimately discovered.

Establishing the NAR model

Complexities and challenges in understanding the temporal characteristics of infectious diseases that exist are the complicated nonlinear interactions among different dimensions in real-world scenarios16. Artificial neural networks (ANNs) can adequately enable arbitrarily intricate non-stationary series to attain any desired accuracy owing to their powerful flexible nonlinear mapping capacity, and they have been considered as a function approximator applied in the domains of environmental forecasting, electrical energy and medicine10,16. The NAR model is a leading shallow, dynamic recurrent neural network (RNN) that is based on the linear autoregressive model with the ability to time-varying the state of interconnected neurons51, can be adopted to explore the nonlinear relationship between the response variable and its predictors owing to its network architecture with a hidden layer accompanied by a sigmoid transfer function that allows it to have no restrictions on the parameters that comply with the requirement of stationarity5254. Furthermore, with the aid of tapped delay lines, the NAR technique also has a short-term memory function for the previous inputs and outputs, which makes its response at any given time rely not only on the present inputs but on the history of the inputs series as well55. For a time series with obvious seasonality, when providing suitable observations from the same season as inputs, this network can capture the time series components of periodicity, seasonality and secular trend adequately well with respect to the appropriate inputs that require a multitude of experiments to discover the optimum. Accordingly, this method can provide reliable forecasts for current HFMD incidence case series including linear and nonlinear information. The specified equation of the NAR method can be written as:

y(t)=f(y(t1),y(t2),,y(td)), 2

where y(t) represents the predicted points of the HFMD incidence series relied merely on the prior data of lagged period d.

In this work, the graphical user interface (GUI) in MATLAB (Version R2014a, MathWorks, Natick, MA, USA) was employed to automatically create an advanced script prior to modeling an NAR. First, the actual HFMD observations were processed between 0 and 1 using a normalized approach56 to facilitate further analysis. Second, the dividerand function was used to randomly divide the in-sample data into training, validation and testing subsets, following ratios of 70%, 15% and 15%, respectively. In the robustness-test data, the abovementioned ratio in the training, validation and testing subsets was also used in the first test dataset, while another commonly used ratio of 80%, 10% and 10% corresponding to the training, validation and testing subsets, respectively, was employed in the second test dataset. Third, the number of hidden neurons and delays d were adjusted by repeated attempts with the Levenberg-Marquardt algorithm in an open feedback loop. The response plot of outputs and targets, correlograms and input-error cross-correlation plots, coupled with the mean square error (MSE) and correlation coefficient (R), were offered to choose the best-fitting model. Finally, the training open-loop form was transformed to a closed loop to achieve a goal of multistep-ahead forecasting (Supplementary Fig. S35).

Establishing the LSTM model

As mentioned above, coincident with the increase in the time lag has been a decrease in the long-term learning ability during the training of an NAR method due to a vanishing or exploding gradient problem, which is a major flaw for NAR method forecasting26. A LSTM model, not the least prevalent and rewarding variant of the conventional RNNs, can overcome this disadvantage encountered in an NAR model as it is capable of maintaining state and identifying traits over the length of the sequences used26,27. The LSTM technique has a special layered architecture with memory blocks that contain one or more self-connected memory cells22,57, surrounded by three gating units, including the input, output and forget gates, that can continuously perform write, read and reset operations, to preserve information (Supplementary Fig. S36)57,58. Such a configuration can keep the states persisting or communicating between updates of the weights with the progress of each epoch; moreover, it can reinforce the RNN by capturing the long-term dynamics of the time series components of periodicity, seasonality and secular trend in addition to the short-term dynamics. The estimated equations of the LSTM model can be defined as:

it=σ(Wxixt+Whiht1+Wcict1+b)i 3
ft=σ(Wxfxt+Whfht1+Wcfct1+b)f 4
ct=ftct1+ittanh(Wxcxt+Whcht1+Wcict1+b)c 5
ot=σ(Wxoxt+Whoht1+Wcoct1+b)o 6
ht=ottanh(ct) 7
y(t)=Wyh(f(Whxy(t1)+Whhht1+bh)+by 8

where it refers to the input gate; ft is the forget gate; ct stands for the states of memory cell at time t; ot represents the output gate; ht is the hidden states at time t; Wxc, Wxi, Wxf and Wxo are the weight matrices connecting the input signals; xt, Whc, Whi, Whf and Who represent the weight matrices connecting the hidden layer output signals; ht, Wci, Wcf and Wco stand for the diagonal matrices connecting the neuron activation functions; bi, bc, bf, bo, bh and by refer to the bias vectors; σ is the activation function (tanh or sigmoid); y(t) is the predicted points of the HFMD incidence series; Whx, Whh, and Wyh are the input-hidden weight matrix, hidden-hidden weight matrix and hidden-output weight matrix, respectively; and yt1=(yt1,yt2,,ytd) is a vector including time steps of the series.

Our prediction project using the LSTM model for regression was conducted with Keras. First, the actual HFMD notifications were rescaled between 0 and 1 using a normalized preprocessing approach. Second, in-sample data were separated into two blocks, 75% of them would be used for the training model, whereas the remaining 25% of the data would be utilized to validate the generalization and to simulate performance. Similarly, in the robustness-test data, the same data classification described above was applied to the first test dataset, while 70% and 30% data points were taken as training and validation subsets, respectively, out of the second test dataset. Third, the back propagation through time (BPTT) algorithm59 was adopted for LSTM training with various time steps, and hidden layer neurons used to select the preferred model relied on the minimum root mean square error loss and satisfactory ACF plot22. Finally, the best-fitting architecture was chosen to generate out-of-sample predictions, and then the results should further be transformed to the simulated and forecasted values from the original observations with the inverse transform technique.

Measuring for accuracy

In order to distinguish the stimulation and forecasting accuracies from the selected various models, the root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) were ultimately adopted to measure the performance accuracy.

RMSE=1Ni=1N(XiX¯i)2 9
MAE=1Ni=1N|XiX¯i| 10
MAPE=1Ni=1N|XiX¯i|Xi 11

Where Xi stands for the actual notified notifications, X¯i represents the simulated and predictive values with the selected preferred methods, N is the number of simulations and predictions under the models used.

Supplementary information

Table S12 (34.5KB, xlsx)

Acknowledgements

We thank all units involving the collection of HFMD data and the infectious disease monitoring system, coupled with the funders for the present study. This project was supported by the Graduate Student Innovation Fund of Hebei Province (CXZZBS2017130).

Author Contributions

Y.B.W., C.J.X. and J.X.Y. conceived this work, and collected and analyzed the data. S.K.Z., Z.D.W., L.Y. and Y.Z. improved the paper. All authors agree to submit this article. The funders did not take part in the data curation, formal analysis, methodology and improvement of the present manuscript.

Data Availability

These data can be extracted as presented in the website of data collection or please contact the first author on reasonable request.

Competing Interests

The authors declare no competing interests.

Footnotes

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Yongbin Wang and Chunjie Xu contributed equally.

Supplementary information

Supplementary information accompanies this paper at 10.1038/s41598-019-44469-9.

References

  • 1.Wang JF, et al. Hand, foot and mouth disease: spatiotemporal transmission and climate. Int J Health Geogr. 2011;10:25. doi: 10.1186/1476-072X-10-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Wang Y, et al. Hand, foot, and mouth disease in China: patterns of spread and transmissibility. Epidemiology. 2011;22:781–792. doi: 10.1097/EDE.0b013e318231d67a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Xing W, et al. Hand, foot, and mouth disease in China, 2008–2012: an epidemiological study. Lancet Infectious Diseases. 2014;14:308–318. doi: 10.1016/S1473-3099(13)70342-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wei J, et al. The effect of meteorological variables on the transmission of hand, foot and mouth disease in four major cities of shanxi province, China: a time series data analysis (2009–2013) PLoS Neglected Tropical Diseases. 2015;9:e0003572. doi: 10.1371/journal.pntd.0003572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ang LW, et al. Seroepidemiology of Coxsackievirus A6, Coxsackievirus A16, and Enterovirus 71 Infections among Children and Adolescents in Singapore, 2008–2010. PloS One. 2015;10:e0127999. doi: 10.1371/journal.pone.0127999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhuang ZC, et al. Epidemiological Research on Hand, Foot, and Mouth Disease in Mainland China. Viruses. 2015;7:6400–6411. doi: 10.3390/v7122947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wang L, et al. Epidemiologic characteristics of hand, foot, and mouth disease in China from 2006 to 2015. Journal of Infection. 2016;73:512–515. doi: 10.1016/j.jinf.2016.08.007. [DOI] [PubMed] [Google Scholar]
  • 8.Zheng YM, Yang J, Liao QH. Direct medical costs and influencing factors in severe hand, foot and mouth disease in children aged between six months and five years old. Zhonghua Yu Fang Yi Xue Za Zhi. Chinese Journal of Preventive Medicine. 2017;51:87–92. doi: 10.3760/cma.j.issn.0253-9624.2017.01.017. [DOI] [PubMed] [Google Scholar]
  • 9.Ma E, Chan KC, Cheng P, Wong C, Chuang SK. The enterovirus 71 epidemic in 2008–public health implications for Hong Kong. International Journal of Infectious Diseases. 2010;14:e775–e780. doi: 10.1016/j.ijid.2010.02.2265. [DOI] [PubMed] [Google Scholar]
  • 10.Yu L, et al. Application of a new hybrid model with seasonal auto-regressive integrated moving average (ARIMA) and nonlinear auto-regressive neural network (NARNN) in forecasting incidence cases of HFMD in Shenzhen, China. PloS One. 2014;9:e98241. doi: 10.1371/journal.pone.0098241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Du Z, et al. Predicting the hand, foot, and mouth disease incidence using search engine query data and climate variables: an ecological study in Guangdong, China. Bmj Open. 2017;7:e016263. doi: 10.1136/bmjopen-2017-016263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zhang X, et al. Temporal and long-term trend analysis of class C notifiable diseases in China from 2009 to 2014. BMJ Open. 2016;6:e011038. doi: 10.1136/bmjopen-2016-011038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ministry of Science and Technology of the People’s Republic of China. HFMD vaccine developed by China now on sale. Available at, http://www.most.gov.cn/eng/pressroom/ 201605/t20160506_125468.htm (Accessed: August 03 2018) (2016).
  • 14.National Health and Family Planning Commission of China. Suggestions on prevention and control of HFMD in 2018. Available at, http://www.nhfpc.gov.cn/jkj/s3578/201803/45ae9b 6347dc4b6cb87b8b871c578456.shtml (Accessed: August 03 2018) (2018).
  • 15.Organization, W. H. A Guide to Clinical Management and Public Health Response for Hand, Foot and Mouth disease (HFMD). Organizational Behavior and Human Decision Processes (2011).
  • 16.Zhou L, et al. A hybrid model for predicting the prevalence of schistosomiasis in humans of Qianjiang City, China. PloS One. 2014;9:e104875. doi: 10.1371/journal.pone.0104875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Liu L, Luan RS, Yin F, Zhu XP, Lu Q. Predicting the incidence of hand, foot and mouth disease in Sichuan province, China using the ARIMA model. Epidemiology and Infection. 2016;144:144–151. doi: 10.1017/s0950268815001144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation. 1997;9:1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]
  • 19.Chen K, Huo Q. Training Deep Bidirectional LSTM Acoustic Model for LVCSR by a Context-Sensitive-Chunk BPTT Approach. IEEE/ACM Transactions on Audio Speech & Language Processing. 2016;24:1185–1193. doi: 10.1109/TASLP.2016.2539499. [DOI] [Google Scholar]
  • 20.Gao L, Guo Z, Zhang H, Xu X, Shen HT. Video Captioning With Attention-Based LSTM and Semantic Consistency. IEEE Transactions on Multimedia. 2017;19:2045–2055. doi: 10.1109/TMM.2017.2729019. [DOI] [Google Scholar]
  • 21.Zen H. Acoustic Modeling in Statistical Parametric Speech Synthesis - From HMM to LSTM-RNN. Middle East Policy. 2015;15:125–132. [Google Scholar]
  • 22.Deng L, Yu D. Deep Learning: Methods and Applications. Foundations & Trends in Signal Processing. 2014;7:197–387. doi: 10.1561/2000000039. [DOI] [Google Scholar]
  • 23.Hyndman RJ, Ord K, Snyder RD, Koehler AB. Prediction intervals for exponential smoothing using two new classes of state space models. Journal of Forecasting. 2005;24:17–37. doi: 10.1002/for.938. [DOI] [Google Scholar]
  • 24.Khosravi A, Nahavandi S, Creighton D, Atiya AF. Comprehensive review of neural network-based prediction intervals and new advances. IEEE Trans Neural Netw. 2011;22:1341–1356. doi: 10.1109/tnn.2011.2162110. [DOI] [PubMed] [Google Scholar]
  • 25.D’Informatique DE, et al. Long Short-Term Memory in Recurrent Neural Networks. Epfl. 2001;9:1735–1780. [Google Scholar]
  • 26.Li X, et al. Long short-term memory neural network for air pollutant concentration predictions: Method development and evaluation. Environmental Pollution (Barking, Essex: 1987) 2017;231:997–1004. doi: 10.1016/j.envpol.2017.08.114. [DOI] [PubMed] [Google Scholar]
  • 27.Bao W, Yue J, Rao Y. A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PloS One. 2017;12:e0180944. doi: 10.1371/journal.pone.0180944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lee Sangmok, Lee Donghyun. Improved Prediction of Harmful Algal Blooms in Four Major South Korea’s Rivers Using Deep Learning Models. International Journal of Environmental Research and Public Health. 2018;15(7):1322. doi: 10.3390/ijerph15071322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gan R, Chen X, Yan Y, Huang D. Application of a hybrid method combining grey model and back propagation artificial neural networks to forecast hepatitis B in china. Computational and Mathematical Methods in Medicine. 2015;2015:328273. doi: 10.1155/2015/328273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Wang Y, et al. Temporal trends analysis of human brucellosis incidence in mainland China from 2004 to 2018. Scientific Reports. 2018;8:15901. doi: 10.1038/s41598-018-33165-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Al-Sakkaf A, Jones G. Comparison of time series models for predicting campylobacteriosis risk in New Zealand. Zoonoses Public Health. 2014;61:167–174. doi: 10.1111/zph.12046. [DOI] [PubMed] [Google Scholar]
  • 32.Zhang X, et al. Comparative study of four time series methods in forecasting typhoid fever incidence in China. PloS One. 2013;8:e63116. doi: 10.1371/journal.pone.0063116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.He F, et al. Construction and evaluation of two computational models for predicting the incidence of influenza in Nagasaki Prefecture, Japan. Scientific Reports. 2017;7:7192. doi: 10.1038/s41598-017-07475-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Zhou L, et al. Using a Hybrid Model to Forecast the Prevalence of Schistosomiasis in Humans. International Journal of Environmental Research and Public Health. 2016;13:355. doi: 10.3390/ijerph13040355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhou L, Zhao P, Wu D, Cheng C, Huang H. Time series model for forecasting the number of new admission inpatients. BMC Medical Informatics and Decision Making. 2018;18:39. doi: 10.1186/s12911-018-0616-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Wu W, et al. Application of nonlinear autoregressive neural network in predicting incidence tendency of hemorrhagic fever with renal syndrome. Zhonghua Liu Xing Bing Xue Za Zhi. 2015;36:1394–1396. [PubMed] [Google Scholar]
  • 37.Van Tu P, et al. Epidemiologic and virologic investigation of hand, foot, and mouth disease, southern Vietnam, 2005. Emerging Infectious Diseases. 2007;13:1733–1741. doi: 10.3201/eid1311.070632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Podin Y, et al. Sentinel surveillance for human enterovirus 71 in Sarawak, Malaysia: lessons from the first 7 years. BMC Public Health. 2006;6:180. doi: 10.1186/1471-2458-6-180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Yang B, Lau EH, Wu P, Cowling BJ. Transmission of Hand, Foot and Mouth Disease and Its Potential Driving Factors in Hong Kong. Scientific Reports. 2016;6:27500. doi: 10.1038/srep27500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Chen KT, Chang HL, Wang ST, Cheng YT, Yang JY. Epidemiologic features of hand-foot-mouth disease and herpangina caused by enterovirus 71 in Taiwan, 1998–2005. Pediatrics. 2007;120:e244–252. doi: 10.1542/peds.2006-3331. [DOI] [PubMed] [Google Scholar]
  • 41.Ang LW, et al. Epidemiology and control of hand, foot and mouth disease in Singapore, 2001–2007. Annals of the Academy of Medicine, Singapore. 2009;38:106–112. [PubMed] [Google Scholar]
  • 42.Zhang Y, et al. An emerging recombinant human enterovirus 71 responsible for the 2008 outbreak of hand foot and mouth disease in Fuyang city of China. Virology Journal. 2010;7:94. doi: 10.1186/1743-422X-7-94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhang Y, et al. An outbreak of hand, foot, and mouth disease associated with subgenotype C4 of human enterovirus 71 in Shandong, China. Journal of Clinical Virology. 2009;44:262–267. doi: 10.1016/j.jcv.2009.02.002. [DOI] [PubMed] [Google Scholar]
  • 44.Wang C, et al. Spatiotemporal Cluster Patterns of Hand, Foot, and Mouth Disease at the County Level in Mainland China, 2008–2012. PloS One. 2016;11:e0147532. doi: 10.1371/journal.pone.0147532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Liu W, et al. Co-circulation and genomic recombination of coxsackievirus A16 and enterovirus 71 during a large outbreak of hand, foot, and mouth disease in Central China. PloS One. 2014;9:e96051. doi: 10.1371/journal.pone.0096051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.He Z, Tao H. Epidemiology and ARIMA model of positive-rate of influenza viruses among children in Wuhan, China: A nine-year retrospective study. International Journal of Infectious Diseases. 2018;74:61–70. doi: 10.1016/j.ijid.2018.07.003. [DOI] [PubMed] [Google Scholar]
  • 47.Cortes F, et al. Time series analysis of dengue surveillance data in two Brazilian cities. Acta Tropica. 2018;182:190–197. doi: 10.1016/j.actatropica.2018.03.006. [DOI] [PubMed] [Google Scholar]
  • 48.Arruda AG, Vilalta C, Puig P, Perez A, Alba A. Time-series analysis for porcine reproductive and respiratory syndrome in the United States. PloS One. 2018;13:e0195282. doi: 10.1371/journal.pone.0195282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Wang Chunli, Li Yongdong, Feng Wei, Liu Kui, Zhang Shu, Hu Fengjiao, Jiao Suli, Lao Xuying, Ni Hongxia, Xu Guozhang. Epidemiological Features and Forecast Model Analysis for the Morbidity of Influenza in Ningbo, China, 2006–2014. International Journal of Environmental Research and Public Health. 2017;14(6):559. doi: 10.3390/ijerph14060559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Xu Q, et al. Forecasting the Incidence of Mumps in Zibo City Based on a SARIMA Model. International Journal of Environmental Research and Public Health. 2017;14:925. doi: 10.3390/ijerph14080925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Wang KW, et al. Hybrid methodology for tuberculosis incidence time-series forecasting based on ARIMA and a NAR neural network. Epidemiology and Infection. 2017;145:1118–1129. doi: 10.1017/S0950268816003216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Adeboye A, Davies O, Akinwumi O, James N, Ruffin M. Seasonality and Trend Forecasting of Tuberculosis Prevalence Data in Eastern Cape, South Africa, Using a Hybrid Model. International Journal of Environmental Research and Public Health. 2016;13:757. doi: 10.3390/ijerph13080757.. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Ansari M, Othman F, Abunama T, El-Shafie A. Analysing the accuracy of machine learning techniques to develop an integrated influent time series model: case study of a sewage treatment plant, Malaysia. Environmental Science and Pollution Research International. 2018;25:12139–12149. doi: 10.1007/s11356-018-1438-z. [DOI] [PubMed] [Google Scholar]
  • 54.Zhang Juan, Qiu Han, Li Xiaoyu, Niu Jie, Nevers Meredith B., Hu Xiaonong, Phanikumar Mantha S. Real-Time Nowcasting of Microbiological Water Quality at Recreational Beaches: A Wavelet and Artificial Neural Network-Based Hybrid Modeling Approach. Environmental Science & Technology. 2018;52(15):8446–8455. doi: 10.1021/acs.est.8b01022. [DOI] [PubMed] [Google Scholar]
  • 55.Wu W, et al. Comparison of Two Hybrid Models for Forecasting the Incidence of Hemorrhagic Fever with Renal Syndrome in Jiangsu Province, China. PloS One. 2015;10:e0135492. doi: 10.1371/journal.pone.0135492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Shafaei M, Adamowski J, Fakherifard A, Dinpashoh Y, Adamowski K. A wavelet-SARIMA-ANN hybrid model for precipitation forecasting. Journal of Water & Land Development. 2016;28:27–36. doi: 10.1515/jwld-2016-0003. [DOI] [Google Scholar]
  • 57.Jimeno Yepes A. Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation. J Biomed Inform. 2017;73:137–147. doi: 10.1016/j.jbi.2017.08.001. [DOI] [PubMed] [Google Scholar]
  • 58.Volkova S, Ayton E, Porterfield K, Corley CD. Forecasting influenza-like illness dynamics for military populations using neural networks and social media. PloS One. 2017;12:e0188941. doi: 10.1371/journal.pone.0188941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Bengio Y, Simard P, Frasconi P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw. 2002;5:157–166. doi: 10.1109/72.279181. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S12 (34.5KB, xlsx)

Data Availability Statement

These data can be extracted as presented in the website of data collection or please contact the first author on reasonable request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES