Sentiment Analysis on Tweets related to infectious diseases in South America

Infectious diseases have a huge social and economic impact. They are caused by pathogenic microorganisms such as bacteria, viruses, parasites or fungi and they can be transmitted, directly or indirectly, from one person to another or from animals to humans (Zoonoses). Nowadays it is very important to detect the infectious diseases as soon as possible to prevent critical problems for the society. In this work we propose an approach for the sentiment classification of tweets related to infectious diseases. This kind of systems could help health professionals to know how society respond to advances in the treatment of these diseases. In addition, a comparison was made of the performance of three classification algorithms (J48, BayesNet, and SMO). The results showed that SMO provides better results than BayesNet and J48 algorithms, obtaining an F-measure of 84.4%.

[1]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[4]  Daniel Neagu,et al.  Social media analysis for product safety using text mining and sentiment analysis , 2014, 2014 14th UK Workshop on Computational Intelligence (UKCI).

[5]  M. Woolhouse,et al.  Emerging pathogens: the epidemiology and evolution of species jumps , 2005, Trends in Ecology & Evolution.

[6]  Vincenza Carchiolo,et al.  Using Twitter Data and Sentiment Analysis to Study Diseases Dynamics , 2015, ITBAM.

[7]  Mario Andrés Paredes-Valverde,et al.  Sentiment Classification of Spanish Reviews: An Approach based on Feature Selection and Machine Learning Methods , 2016, J. Univers. Comput. Sci..

[8]  Jianqiang Hao,et al.  Social media content and sentiment analysis on consumer security breaches , 2016 .

[9]  Miguel Ángel Rodríguez-García,et al.  Automatic detection of satire in Twitter: A psycholinguistic-based approach , 2017, Knowl. Based Syst..

[10]  Raymond Hsu,et al.  Machine Learning for Sentiment Analysis on the Experience Project , 2010 .

[11]  Aparup Khatua,et al.  Immediate and long-term effects of 2016 Zika Outbreak: A Twitter-based study , 2016, 2016 IEEE 18th International Conference on e-Health Networking, Applications and Services (Healthcom).

[12]  D. Jamison,et al.  Disease Control Priorities in Developing Countries , 1993 .

[13]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[14]  Nathalie Aussenac-Gilles,et al.  A study on LIWC categories for opinion mining in Spanish reviews , 2014, J. Inf. Sci..

[15]  G. Eysenbach,et al.  Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak , 2010, PloS one.

[16]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[17]  Karin M. Verspoor,et al.  Towards Early Discovery of Salient Health Threats: A Social Media Emotion Classification Technique , 2016, PSB.