Analyzing Algorithms to Detect Disaster Events using Social Media

Disasters are instabilities that occur on the interface between society and the environment. During disasters, people communicate to inform and request for support for themselves or their community. Social media is used as a medium for communication due to its wide reach and global audience. During disasters, people communicate via messages regarding similar or different types of emergencies in the same general location. Interpreting and validating these messages during the occurrence of a disaster costs a significant time and loss. Therefore, this study presents a comparison between three models, K-Nearest Neighbor (KNN), Naive Bayes (NB), and Support Vector Machine (SVM), to classify and label a message as a disaster event. In order to simulate the examining process further, a categorization system is introduced to categorize the severity of a disaster as described in each message in a disaster environment. performances are compared for each of the models using classification scores of supervised learning.

[1]  Khairullah Khan,et al.  A Review of Machine Learning Algorithms for Text-Documents Classification , 2010 .

[2]  P. Yuan,et al.  A new emergency management approach in disaster operation management based on the activity network technology , 2017, 2017 Chinese Automation Congress (CAC).

[3]  Marcelo G. Manzato,et al.  Mining unstructured content for recommender systems: an ensemble approach , 2016, Information Retrieval Journal.

[4]  Y. Lin,et al.  Tracking and Analyzing Individual Distress Following Terrorist Attacks Using Social Media Streams , 2017, Risk analysis : an official publication of the Society for Risk Analysis.

[5]  María del Rocío Martínez-Torres,et al.  A machine learning approach for the identification of the deceptive reviews in the hospitality sector using unique attributes and sentiment orientation , 2019, Tourism Management.

[6]  Niranjan N. Chiplunkar,et al.  A new big data approach for topic classification and sentiment analysis of Twitter data , 2019, Evolutionary Intelligence.

[7]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[8]  Shishir Kumar,et al.  An Effective Approach to Track Levels of Influenza-A (H1N1) Pandemic in India Using Twitter , 2015 .

[9]  Vijayan Sugumaran,et al.  Building knowledge base of urban emergency events based on crowdsourcing of social media , 2016, Concurr. Comput. Pract. Exp..

[10]  Bo Song,et al.  Cross-border e-commerce commodity risk assessment using text mining and fuzzy rule-based reasoning , 2019, Adv. Eng. Informatics.

[11]  Ali Selamat,et al.  Hybrid sentiment classification on twitter aspect-based sentiment analysis , 2018, Applied Intelligence.

[12]  Oscar Castillo,et al.  A state of the art review of intelligent scheduling , 2018, Artificial Intelligence Review.

[13]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[14]  Kyung Sup Kwak,et al.  Transportation sentiment analysis using word embedding and ontology-based topic modeling , 2019, Knowl. Based Syst..

[15]  Yau-Hwang Kuo,et al.  Integrated microblog sentiment analysis from users’ social interaction patterns and textual opinions , 2015, Applied Intelligence.

[16]  S. Abdallah,et al.  Mining government tweets to identify and predict citizens engagement , 2020 .

[17]  Azuraliza Abu Bakar,et al.  Hybrid N-gram model using Naïve Bayes for classification of political sentiments on Twitter , 2019, Neural Computing and Applications.

[18]  Dan Grigoras,et al.  Using Twitter and the mobile cloud for delivering medical help in emergencies , 2017, Concurr. Comput. Pract. Exp..

[19]  P. I. Banokin,et al.  Natural language text parsing for social network user sentiment analysis based on fuzzy sets , 2015, 2015 International Conference on Mechanical Engineering, Automation and Control Systems (MEACS).

[20]  Khaled Shaalan,et al.  An Arabic social media based framework for incidents and events monitoring in smart cities , 2019, Journal of Cleaner Production.

[21]  Guandong Xu,et al.  What’s Happening Around the World? A Survey and Framework on Event Detection Techniques on Twitter , 2019, Journal of Grid Computing.

[22]  Bijan Raahemi,et al.  Detecting financial restatements using data mining techniques , 2017, Expert Syst. Appl..

[23]  Hermann Szymczak,et al.  Social media in emergencies: How useful can they be , 2016, 2016 3rd International Conference on Information and Communication Technologies for Disaster Management (ICT-DM).

[24]  D. Jahed Armaghani,et al.  Random Forest and Bayesian Network Techniques for Probabilistic Prediction of Flyrock Induced by Blasting in Quarry Sites , 2020, Natural Resources Research.

[25]  S. Selva Brunda,et al.  Sentiment analysis by POS and joint sentiment topic features using SVM and ANN , 2018, Soft Comput..

[26]  Y. Hu,et al.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, The Lancet.

[27]  Hossein Karami,et al.  A novel framework to generate clustering algorithms based on a particular classification structure , 2017, 2017 Artificial Intelligence and Signal Processing Conference (AISP).

[28]  Christopher F. Beaulieu,et al.  Artificial Intelligence and Machine Learning Applications in Musculoskeletal Imaging , 2020 .

[29]  Ahmet Ertugan,et al.  Applying fuzzy logic for sentiment analysis of social media network data in marketing , 2017 .

[30]  Juan M. Corchado,et al.  A polarity analysis framework for Twitter messages , 2015, Appl. Math. Comput..

[31]  Gonzalo A. Ruz,et al.  Sentiment analysis of Twitter data during critical events through Bayesian networks classifiers , 2020, Future Gener. Comput. Syst..

[32]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[33]  Christina B. Azodi,et al.  Opening the Black Box: Interpretable Machine Learning for Geneticists. , 2020, Trends in genetics : TIG.

[34]  Matthew Leighton Williams,et al.  Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making , 2015 .

[35]  Ammar Ismael Kadhim Term Weighting for Feature Extraction on Twitter: A Comparison Between BM25 and TF-IDF , 2019, 2019 International Conference on Advanced Science and Engineering (ICOASE).

[36]  Padraig Cunningham,et al.  k-Nearest Neighbour Classifiers - A Tutorial , 2020, ACM Comput. Surv..

[37]  Seba Susan,et al.  Fuzzy rule based unsupervised sentiment analysis from social media posts , 2019, Expert Syst. Appl..

[38]  Yogesh Kumar Dwivedi,et al.  Measuring social media influencer index- insights from facebook, Twitter and Instagram , 2019, Journal of Retailing and Consumer Services.

[39]  Mohamed Moussaoui,et al.  A possibilistic framework for the detection of terrorism‐related Twitter communities in social media , 2018, Concurr. Comput. Pract. Exp..

[40]  Natalia Sadovnikova,et al.  Analysis of Comments of Users of Social Networks to Assess the Level of Social Tension , 2017 .

[41]  Mark Heitmann,et al.  Comparing automated text classification methods , 2019, International Journal of Research in Marketing.

[42]  Tianzhen Hong,et al.  State-of-the-art on research and applications of machine learning in the building life cycle , 2020, Energy and Buildings.

[43]  Gilles Louppe,et al.  Independent consultant , 2013 .