Real-Time Monitoring of Flu Epidemics through Linguistic and Statistical Analysis of Twitter Messages

The recent rise in popularity of Twitter and its open API provides developers the opportunity to extract amounts of data which can be a thesaurus of information. This opportunity led to the development of an open source and open API system called Flu track (http://flutrack.org) that monitors influenza epidemics, based on geo-located self-reports on Twitter. In particular, we detect words such as sore throat, cough, fever etc. Moreover, we detect the aggravation of a patient's clinical condition when a user posts a second flu related tweet that contains words indicating further symptoms such as: worse, deteriorating. Finally, we present flu-positives with real time anonymous visualizations using maps (mapping), which might be helpful for authorities and sensitive populations to plan upcoming events or activities. In order to further aid the surveillance of the spreading of the disease, a classification experiment has been conducted for automatically identifying Tweets that describe cases with acute and more critical symptoms from those referring to milder cases. We found that making use of mereley very small n-gram keyword lexica, the automatic identification of critical cases reaches an accuracy of 92%.

[1]  Mark Dredze,et al.  Separating Fact from Fear: Tracking Flu Infections on Twitter , 2013, NAACL.

[2]  J. Taubenberger,et al.  Influenza: The Once and Future Pandemic , 2010, Public health reports.

[3]  Henry A. Kautz,et al.  Predicting Disease Transmission from Geo-Tagged Micro-Blog Data , 2012, AAAI.

[4]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[5]  R. Eccles,et al.  Understanding the symptoms of the common cold and influenza , 2005, The Lancet Infectious Diseases.

[6]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[7]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[8]  Y. Ghendon Introduction to pandemic influenza through history , 1994, European Journal of Epidemiology.

[9]  Beveridge Wi The chronicle of influenza epidemics. , 1991 .

[10]  Armin R. Mikler,et al.  Text and Structural Data Mining of Influenza Mentions in Web and Social Media , 2010, International journal of environmental research and public health.

[11]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[12]  Emmanouil Magkos,et al.  A spatial stochastic model for worm propagation: scale effects , 2007, Journal in Computer Virology.

[13]  Henry A. Kautz,et al.  Modeling the impact of lifestyle on health at scale , 2013, WSDM.

[14]  Benyuan Liu,et al.  Twitter Improves Seasonal Influenza Prediction , 2018, HEALTHINF.

[15]  Karin Thursky,et al.  Working towards a simple case definition for influenza surveillance. , 2003, Journal of clinical virology : the official publication of the Pan American Society for Clinical Virology.

[16]  Nigel Collier,et al.  Uncovering text mining: A survey of current work on web-based epidemic intelligence , 2012, Global public health.

[17]  Mizuki Morita,et al.  Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter , 2011, EMNLP.

[18]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .