Enhancement of Epidemiological Models for Dengue Fever Based on Twi er Data

Epidemiological early warning systems for dengue fever rely on up-to-date epidemiological data to forecast future incidence. However, epidemiological data typically requires time to be available, due to the application of time-consuming laboratorial tests. This implies that epidemiological models need to issue predictions with larger antecedence, making their task even more difficult. On the other hand, online platforms, such as Twitter or Google, allow us to obtain samples of users' interaction in near real-time and can be used as sensors to monitor current incidence. In this work, we propose a framework to exploit online data sources to mitigate the lack of up-to-date epidemiological data by obtaining estimates of current incidence, which are then explored by traditional epidemiological models. We show that the proposed framework obtains more accurate predictions than alternative approaches, with statistically better results for delays greater or equal to 4 weeks.

[1]  S. Cassadou,et al.  Time series analysis of dengue incidence in Guadeloupe, French West Indies: Forecasting models using climate variables as predictors , 2011, BMC infectious diseases.

[2]  Wagner Meira,et al.  A latent shared-component generative model for real-time disease surveillance using Twitter data , 2015, ArXiv.

[3]  D. Cummings,et al.  Prediction of Dengue Incidence Using Search Query Surveillance , 2011, PLoS neglected tropical diseases.

[4]  Gisele L. Pappa,et al.  An Accurate Gaussian Process-Based Early Warning System for Dengue Fever , 2016, 2016 5th Brazilian Conference on Intelligent Systems (BRACIS).

[5]  Virgílio A. F. Almeida,et al.  Dengue surveillance based on a computational model of spatio-temporal locality of Twitter , 2011, WebSci '11.

[6]  Irene Casas,et al.  Intra- and Interseasonal Autoregressive Prediction of Dengue Outbreaks Using Local Weather and Regional Climate for a Tropical Environment in Colombia , 2014, The American journal of tropical medicine and hygiene.

[7]  K. Jaroensutasinee,et al.  Forecasting Dengue Haemorrhagic Fever Cases in Southern Thailand using ARIMA Models , 2006 .

[8]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[9]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[10]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[11]  John S. Brownstein,et al.  The global distribution and burden of dengue , 2013, Nature.

[12]  Nazri Che Dom,et al.  Generating temporal model using climate variables for the prediction of dengue cases in Subang Jaya, Malaysia , 2013 .

[13]  Alina Deshpande,et al.  Global Disease Monitoring and Forecasting with Wikipedia , 2014, PLoS Comput. Biol..

[14]  E. Martinez,et al.  Predicting the number of cases of dengue infection in Ribeirão Preto, São Paulo State, Brazil, using a SARIMA model. , 2011, Cadernos de saude publica.

[15]  Gisele L. Pappa,et al.  An Evolutionary Methodology for Handling Data Scarcity and Noise in Monitoring Real Events from Social Media Data , 2014, IBERAMIA.

[16]  Alicia Karspeck,et al.  Real-Time Influenza Forecasts during the 2012–2013 Season , 2013, Nature Communications.