Using Twitter Data and Sentiment Analysis to Study Diseases Dynamics

Twitter has been recently used to predict and/or monitor real world outcomes, and this is also true for health related topic. In this work, we extract information about diseases from Twitter with spatio-temporal constraints, i.e. considering a specific geographic area during a given period. We exploit the SNOMED-CT terminology to correctly detect medical terms, using sentiment analysis to assess to what extent each disease is perceived by persons. We show our first results for a monitoring tool that allow to study the dynamic of diseases.

[1]  Michael J. Paul,et al.  Discovering Health Topics in Social Media Using Topic Models , 2014, PloS one.

[2]  Wolfgang Nejdl,et al.  Understanding the diversity of tweets in the time of outbreaks , 2013, WWW.

[3]  Sunmoo Yoon,et al.  A practical approach for content mining of Tweets. , 2013, American journal of preventive medicine.

[4]  Declan Butler,et al.  When Google got flu wrong , 2013, Nature.

[5]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[6]  Peter Jackson,et al.  Natural language processing for online applications : text retrieval, extraction and categorization , 2002 .

[7]  Gunther Eysenbach,et al.  Infodemiology and infoveillance tracking online health information and cyberbehavior for public health. , 2011, American journal of preventive medicine.

[8]  Jennifer L. Fisher,et al.  Who gives a tweet: assessing patients' interest in the use of social media for health care. , 2012, Worldviews on evidence-based nursing.

[9]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[10]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[11]  Benyuan Liu,et al.  Predicting Flu Trends using Twitter data , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[12]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[13]  Heng Ji,et al.  The Nature of Communications and Emerging Communities on Twitter Following the 2013 Syria Sarin Gas Attacks , 2014, 2014 IEEE Joint Intelligence and Security Informatics Conference.

[14]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[15]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[16]  John A. Bateman Review of “Natural Language Processing for online applications: Text retrieval, extraction and categorization” by Peter Jackson and Isabelle Moulinier , 2004 .

[17]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[18]  Gunther Eysenbach Infodemiology and Infoveillance: Innovative Methods and Tools to Measure, Track, and Analyze Population Health-Relevant Unstructured Data from the Internet and Social Media , 2011 .

[19]  Huan Liu,et al.  Twitter Data Analytics , 2013, SpringerBriefs in Computer Science.

[20]  Nicolette de Keizer,et al.  A survey of SNOMED CT implementations , 2012, Journal of Biomedical Informatics.

[21]  Fabrício Benevenuto,et al.  Comparing and combining sentiment analysis methods , 2013, COSN '13.

[22]  Krzysztof J. Cios,et al.  Uniqueness of medical data mining , 2002, Artif. Intell. Medicine.

[23]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..

[24]  Alok N. Choudhary,et al.  Real-time disease surveillance using Twitter data: demonstration on flu and cancer , 2013, KDD.