Intelligent Dengue Infoveillance Using Gated Recurrent Neural Learning and Cross-Label Frequencies

With dengue becoming a major concern in tropical countries such as the Philippines, it is important that public health officials are able to accurately determine the presence and magnitude of dengue activity as quickly as possible to facilitate fast emergency response. The prevalence of massive streams of publicly available data from social media make this possible through infoveillance. Infoveillance involves observing and analyzing online interactions to gather health-related data for informing decisions on public health. In this paper, we present a public health agent model that performs dengue infoveillance using a gated recurrent neural network classification model incorporated with pre-trained word embeddings and cross-label frequency calculation. We setup the agent to work on the Philippine Twitter stream as its primary environment. Further, we evaluate the agents classification ability using a holdout set of human-labeled tweets. Afterwards, we run a historical simulation where the trained agent works with a stream of six months worth of tweets from the Philippines and we correlate its infoveillance results with actual dengue morbidity data of that time period. Experiments show that the agent is capable of accurately identifying dengue-related tweets with low loss. Moreover, we confirm that the agent model can be used for determining actual dengue activity and can serve as an early warning system with high confidence.

[1]  Paola Velardi,et al.  Twitter mining for fine-grained syndromic surveillance , 2014, Artif. Intell. Medicine.

[2]  Klaus-Robert Müller,et al.  Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[3]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[4]  Geoffrey A. Solano,et al.  A disease outbreak detection system using autoregressive moving average in time series analysis , 2015, 2015 6th International Conference on Information, Intelligence, Systems and Applications (IISA).

[5]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[6]  Jaime Caro,et al.  Forecasting and data visualization of dengue spread in the Philippine Visayas island group , 2016, 2016 7th International Conference on Information, Intelligence, Systems & Applications (IISA).

[7]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[8]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[9]  Soon Ae Chun,et al.  Monitoring Public Health Concerns Using Twitter Sentiment Classifications , 2013, 2013 IEEE International Conference on Healthcare Informatics.

[10]  B. Lewis,et al.  Methods of using real-time social media technologies for detection and remote monitoring of HIV outcomes. , 2014, Preventive medicine.

[11]  Ivica Dimitrovski,et al.  Emotion identification in FIFA world cup tweets using convolutional neural network , 2015, 2015 11th International Conference on Innovations in Information Technology (IIT).

[12]  G. Eysenbach Infodemiology and Infoveillance: Framework for an Emerging Set of Public Health Informatics Methods to Analyze Search, Communication and Publication Behavior on the Internet , 2009, Journal of medical Internet research.

[13]  Kennedy Espina,et al.  Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets , 2016 .

[14]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[15]  Kurt Junshean Espinosa,et al.  An agent-based epidemic model for dengue simulation in the Philippines , 2015, 2015 Winter Simulation Conference (WSC).

[16]  Clayton Fink,et al.  Tweeting Fever: Can Twitter Be Used to Monitor the Incidence of Dengue-Like Illness in the Philippines? , 2014 .