Decoding Linguistic Ambiguity in Times of Emergency based on Twitter Disaster Datasets

As of November 2020, the massively popular social media site Twitter boasts 1.3 billion accounts, 330 million monthly users, and 500 million tweets per day. The sheer volume of data shared on this platform has legitimized Twitter as a real-time information sharing monolith. Millions of people can share information via Twitter making it's power in this space undeniable. In this project, we'd like to use that power for good. Twitter's real-time information sharing prowess has become the platform of interest when attempting to learn information on events in progress. This paper identifies Tweets related to disasters, accidents, or emergencies and determines such statements' sincerity. We introduce a new model that can detect real emergency tweets with a high classification accuracy by using a transfer learning model BERT for emotion classification and behavior modeling approach to detect tweets in Twitter disaster datasets. In doing so, we hope to provide a great service in identifying relevant, real-time information to first responders.

[1]  Rafael Valencia-García,et al.  Review of English literature on figurative language applied to social networks , 2019, Knowledge and Information Systems.

[2]  Jixing Yang,et al.  Application Methodology of Big Data for Emergency Management , 2020, 2020 IEEE 11th International Conference on Software Engineering and Service Science (ICSESS).

[3]  Thanasis Daradoumis,et al.  Towards the Construction of a Multi-agent Approach for Discovering the Meaning of Natural Language Collaborative Conversations , 2008, 2008 International Conference on Complex, Intelligent and Software Intensive Systems.

[4]  Guoliang Chen The analysis of ambiguity in English language based on structural grammar , 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet).

[5]  Yi-Shin Chen,et al.  EmotionX-IDEA: Emotion BERT - an Affectional Model for Conversation , 2019, ArXiv.

[6]  P. Saktel,et al.  Context based domain identification for resolving ambiguity , 2012, 2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT'12).

[7]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[8]  James H. Jones,et al.  Using BERT to Extract Topic-Independent Sentiment Features for Social Media Bot Detection , 2020, 2020 11th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON).

[9]  David R. Traum,et al.  "yeah Right": Sarcasm Recognition for Spoken Dialogue Systems , 2006, INTERSPEECH.

[10]  Hemant Purohit,et al.  Ranking and grouping social media requests for emergency services using serviceability model , 2020, Social Network Analysis and Mining.

[11]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[12]  Darwin Alulema,et al.  System for monitoring natural disasters using natural language processing in the social network Twitter , 2016, 2016 IEEE International Carnahan Conference on Security Technology (ICCST).

[13]  Cristina Bosco,et al.  Developing Corpora for Sentiment Analysis: The Case of Irony and Senti-TUT , 2013, IEEE Intelligent Systems.

[14]  Hidehiko Tanaka,et al.  Semantic ambiguity resolution by parallel distributed, associative inference and contradiction detection , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[15]  Weijian Ni,et al.  Text Quality Analysis of Emergency Response Plans , 2020, IEEE Access.

[16]  Andrea Esuli,et al.  Identification of Cross-Domain Ambiguity with Language Models , 2018, 2018 5th International Workshop on Artificial Intelligence for Requirements Engineering (AIRE).

[17]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[18]  Zhao Qian,et al.  Research on cross-language text similarity calculation , 2015, 2015 IEEE 5th International Conference on Electronics Information and Emergency Communication.

[19]  Byron C. Wallace,et al.  Modelling Context with User Embeddings for Sarcasm Detection in Social Media , 2016, CoNLL.

[20]  K. Raja,et al.  A fuzzy ERM for extracting and modeling uncertain spatial expressions in text , 2014, International Conference on Information Communication and Embedded Systems (ICICES2014).

[21]  Ronaldo C. Prati,et al.  The Comprehension of Figurative Language: What Is the Influence of Irony and Sarcasm on NLP Techniques? , 2016, Sentiment Analysis and Ontology Engineering.

[22]  Setareh Rafatirad,et al.  Using Transfer Learning Approach to Implement Convolutional Neural Network model to Recommend Airline Tickets by Using Online Reviews , 2020, 2020 15th International Workshop on Semantic and Social Media Adaptation and Personalization (SMA.

[23]  Paolo Rosso,et al.  On the difficulty of automatically detecting irony: beyond a simple case of negation , 2014, Knowledge and Information Systems.

[24]  Srinath Perera,et al.  Real-time natural language processing for crowdsourced road traffic alerts , 2015, 2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer).

[25]  L. Javier García-Villalba,et al.  Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation , 2019, Sensors.

[26]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[27]  James H. Jones,et al.  Deep Contextualized Word Embedding for Text-based Online User Profiling to Detect Social Bots on Twitter , 2020, 2020 International Conference on Data Mining Workshops (ICDMW).

[28]  Véronique Hoste,et al.  Exploring the fine-grained analysis and automatic detection of irony on Twitter , 2018, Lang. Resour. Evaluation.

[29]  Maryam Heidari,et al.  Bidirectional Transformer based on online Text-based information to Implement Convolutional Neural Network Model For Secure Business Investment , 2020, 2020 IEEE International Symposium on Technology and Society (ISTAS).

[30]  Sun Yuqiang,et al.  The Comparative Research of the Ambiguity of Comprehension of Human-Machine Language , 2009, 2009 First International Workshop on Education Technology and Computer Science.

[31]  Yonas Woldemariam,et al.  Sentiment analysis in a cross-media analysis framework , 2016, 2016 IEEE International Conference on Big Data Analysis (ICBDA).

[32]  Maryam Heidari,et al.  Semantic Convolutional Neural Network model for Safe Business Investment by Using BERT , 2020, 2020 Seventh International Conference on Social Networks Analysis, Management and Security (SNAMS).