A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data

BackgroundDengue is the most common arboviral disease of humans, with more than one third of the world’s population at risk. Accurate prediction of dengue outbreaks may lead to public health interventions that mitigate the effect of the disease. Predicting infectious disease outbreaks is a challenging task; truly predictive methods are still in their infancy.MethodsWe describe a novel prediction method utilizing Fuzzy Association Rule Mining to extract relationships between clinical, meteorological, climatic, and socio-political data from Peru. These relationships are in the form of rules. The best set of rules is automatically chosen and forms a classifier. That classifier is then used to predict future dengue incidence as either HIGH (outbreak) or LOW (no outbreak), where these values are defined as being above and below the mean previous dengue incidence plus two standard deviations, respectively.ResultsOur automated method built three different fuzzy association rule models. Using the first two weekly models, we predicted dengue incidence three and four weeks in advance, respectively. The third prediction encompassed a four-week period, specifically four to seven weeks from time of prediction. Using previously unused test data for the period 4–7 weeks from time of prediction yielded a positive predictive value of 0.686, a negative predictive value of 0.976, a sensitivity of 0.615, and a specificity of 0.982.ConclusionsWe have developed a novel approach for dengue outbreak prediction. The method is general, could be extended for use in any geographical region, and has the potential to be extended to other environmentally influenced infections. The variables used in our method are widely available for most, if not all countries, enhancing the generalizability of our method.

[1]  M. Szczur,et al.  Surveillance of Arthropod Vector-Borne Infectious Diseases Using Remote Sensing Techniques: A Review , 2007, PLoS pathogens.

[2]  M. Guzmán,et al.  Dengue: an update. , 2002, The Lancet. Infectious diseases.

[3]  D. Focks,et al.  A simulation model of the epidemiology of urban dengue fever: literature analysis, model development, preliminary validation, and samples of simulation results. , 1995, The American journal of tropical medicine and hygiene.

[4]  Atlanta,et al.  Locally acquired Dengue--Key West, Florida, 2009-2010. , 2010, MMWR. Morbidity and mortality weekly report.

[5]  Harold Soh,et al.  Time-series infectious disease data analysis using SVM and genetic algorithm , 2007, 2007 IEEE Congress on Evolutionary Computation.

[6]  S. Halstead,et al.  Dengue , 1872, The Lancet.

[7]  L. Stark,et al.  Locally acquired dengue - Key West, Florida, 2009-2010. , 2010 .

[8]  S. Frolking,et al.  Satellite-based modeling of gross primary production in a seasonally moist tropical evergreen forest , 2005 .

[9]  Niranjan Kissoon,et al.  Dengue hemorrhagic fever and shock syndromes* , 2011, Pediatric critical care medicine : a journal of the Society of Critical Care Medicine and the World Federation of Pediatric Intensive and Critical Care Societies.

[10]  D. Vaughn,et al.  Dengue: an escalating problem , 2002, BMJ : British Medical Journal.

[11]  P. Raju,et al.  APPLICATION OF GIS MODELING FOR DENGUE FEVER PRONE AREA BASED ON SOCIO-CULTURAL AND ENVIRONMENTAL FACTORS – A CASE STUDY OF DELHI CITY ZONE , 2008 .

[12]  R. Irizarry,et al.  Travelling waves in the occurrence of dengue haemorrhagic fever in Thailand , 2004, Nature.

[13]  N.A. Husin,et al.  Modeling of dengue outbreak prediction in Malaysia: A comparison of Neural Network and Nonlinear Regression Model , 2008, 2008 International Symposium on Information Technology.

[14]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[15]  D. Fuller,et al.  El Niño Southern Oscillation and vegetation dynamics as predictors of dengue fever cases in Costa Rica , 2009, Environmental research letters : ERL [Web site].

[16]  Murat Sari,et al.  RECOGNITION OF DENGUE DISEASE PATTERNS USING ARTIFICIAL NEURAL NETWORKS , 2009 .

[17]  Christopher M. Gifford,et al.  Fuzzy association rule mining for community crime pattern discovery , 2010, ISI-KDD '10.

[18]  P Weinstein,et al.  El Niño and the dynamics of vectorborne disease transmission. , 1999, Environmental health perspectives.

[19]  Man Hon Wong,et al.  Mining fuzzy association rules in databases , 1998, SGMD.

[20]  C. Kummerow,et al.  The Tropical Rainfall Measuring Mission (TRMM) Sensor Package , 1998 .

[21]  Cláudia Torres Codeço,et al.  Spatial Evaluation and Modeling of Dengue Seroprevalence and Vector Density in Rio de Janeiro, Brazil , 2009, PLoS neglected tropical diseases.

[22]  María Cristina Riff,et al.  Towards an immune system that solves CSP , 2007, 2007 IEEE Congress on Evolutionary Computation.

[23]  A. Anyamba,et al.  Mapping Potential Risk of Rift Valley Fever Outbreaks in African Savannas Using Vegetation Index Time Series Data , 2002 .

[24]  D. Heymann Control of Communicable Diseases Manual , 2004 .

[25]  J. Gonzalez,et al.  Modelling the effect of temperature on transmission of dengue , 2010, Medical and veterinary entomology.

[26]  M. Islam,et al.  Forecasting dengue incidence in Dhaka, Bangladesh: A time series analysis , 2008 .

[27]  Halmar Halide,et al.  A predictive model for Dengue Hemorrhagic Fever epidemics , 2008, International journal of environmental health research.

[28]  K. Jaroensutasinee,et al.  Predicting DHF Incidence in Northern Thailand using Time Series Analysis Technique , 2007 .

[29]  Michael A. Johansson,et al.  Multiyear Climate Variability and Dengue—El Niño Southern Oscillation, Weather, and Dengue Incidence in Puerto Rico, Mexico, and Thailand: A Longitudinal Data Analysis , 2009, PLoS medicine.

[30]  John C. Daucsavage,et al.  Land processes distributed active archive center product lifecycle plan , 2014 .

[31]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[32]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[33]  G. K. Tan,et al.  Pathogenesis and prevention of dengue virus infection: state-of-the-art , 2009, Current opinion in infectious diseases.

[34]  Alfredo Huete,et al.  Assessing the response of the MODIS vegetation indices to landscape disturbance in the forested areas of the legal Brazilian Amazon , 2010 .

[35]  K. Higuchi,et al.  Automatic Prediction System of Dengue Haemorrhagic-Fever Outbreak Risk by Using Entropy and Artificial Neural Network , 2008, 2008 International Symposium on Communications and Information Technologies.

[36]  Richard K. Kiang,et al.  Modeling and Predicting Seasonal Influenza Transmission in Warm Regions Using Climatological Parameters , 2010, PloS one.

[37]  Gavin C. Cawley,et al.  On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[38]  Shilu Tong,et al.  Dengue fever and El Niño/Southern Oscillation in Queensland, Australia: a time series predictive model , 2009, Occupational and Environmental Medicine.

[39]  Philip S. Brachman,et al.  Control of Communicable Diseases Manual, 17th Edition , 2001 .