Short-Term Infectious Diarrhea Prediction Using Weather and Search Data in Xiamen, China

Infectious diarrhea has high morbidity and mortality around the world. For this reason, diarrhea prediction has emerged as an important problem to prevent and control outbreaks. Numerous studies have built disease prediction models using large-scale data. However, these methods perform poorly on diarrhea data. To address this issue, this paper proposes a parsimonious model (PM), which takes historical outpatient visit counts, meteorological factors (MFs) and Baidu search indices (BSIs) as inputs to perform prediction. An experimental evaluation was done to compare the short-term prediction performance of ten algorithms for four groups of inputs, using data collected in Xiamen, China. Results show that the proposed method is effective in improving the prediction accuracy.

[1]  Junzhong Gu,et al.  Diarrhoea outpatient visits prediction based on time series decomposition and multi-local predictor fusion , 2015, Knowl. Based Syst..

[2]  Jiujun Cheng,et al.  Dendritic Neuron Model With Effective Learning Algorithms for Classification, Approximation, and Prediction , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Giancarlo Fortino,et al.  A Smartphone-Enabled Fall Detection Framework for Elderly People in Connected Home Healthcare , 2019, IEEE Network.

[4]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[5]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[6]  S. Rajaram,et al.  A survey on forecasting of time series data , 2016, 2016 International Conference on Computing Technologies and Intelligent Data Engineering (ICCTIDE'16).

[7]  Guokun Lai,et al.  Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks , 2017, SIGIR.

[8]  Junzhong Gu,et al.  Artificial neural networks for infectious diarrhea prediction using meteorological factors in Shanghai (China) , 2015, Appl. Soft Comput..

[9]  Lianmei Jin,et al.  Chapter 2 – Infectious Disease Surveillance in China , 2017 .

[10]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[11]  Diego Reforgiato Recupero,et al.  Deep learning and time series-to-image encoding for financial forecasting , 2020, IEEE/CAA Journal of Automatica Sinica.

[12]  Yiming Yang,et al.  Deep Learning for Epidemiological Predictions , 2018, SIGIR.

[13]  Xiangnan He,et al.  Modeling Extreme Events in Time Series Prediction , 2019, KDD.

[14]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[15]  Wenjun Ma,et al.  Dengue Baidu Search Index data can improve the prediction of local dengue epidemic: A case study in Guangzhou, China , 2017, PLoS neglected tropical diseases.

[16]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[17]  S. Nie,et al.  Application of multiple seasonal ARIMA model in forecasting incidence of HFMD in Wuhan, China , 2014 .

[18]  MengChu Zhou,et al.  An embedded feature selection method for imbalanced data classification , 2019, IEEE/CAA Journal of Automatica Sinica.

[19]  Cristina C. R. Sady,et al.  Symbolic features and classification via support vector machine for predicting death in patients with Chagas disease , 2016, Comput. Biol. Medicine.

[20]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[21]  Xuerui Tan,et al.  The application of meteorological data and search index data in improving the prediction of HFMD: A study of two cities in Guangdong Province, China. , 2019, The Science of the total environment.

[22]  Hung-yi Lee,et al.  Temporal pattern attention for multivariate time series forecasting , 2018, Machine Learning.

[23]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[24]  Modesto Castrillón Santana,et al.  Deep learning for source camera identification on mobile devices , 2017, Pattern Recognit. Lett..

[25]  Yajia Lan,et al.  Development of Early Warning Models , 2017 .

[26]  P. Daszak,et al.  Economic growth, urbanization, globalization, and the risks of emerging infectious diseases in China: A review , 2016, Ambio.

[27]  Tao Chen,et al.  Effective tourist volume forecasting supported by PCA and improved BPNN using Baidu index , 2018, Tourism Management.

[28]  Garrison W. Cottrell,et al.  A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction , 2017, IJCAI.

[29]  MengChu Zhou,et al.  An online fault detection model and strategies based on SVM-grid in clouds , 2018, IEEE/CAA Journal of Automatica Sinica.

[30]  Yang Yang,et al.  Using Baidu Search Index to Predict Dengue Outbreak in China , 2016, Scientific Reports.

[31]  Kai Xu,et al.  Prediction and analysis of net ecosystem carbon exchange based on gradient boosting regression and random forest , 2020 .

[32]  Rosa Oppenheim Forecasting via the Box-Jenkins method , 1978 .

[33]  Ireneous N. Soyiri,et al.  An overview of health forecasting , 2012, Environmental Health and Preventive Medicine.