Learning to classify emotional content in crisis-related tweets

Social media is increasingly being used during crises. This makes it possible for crisis responders to collect and process crisis-related user generated content to allow for improved situational awareness. We describe a methodology for collecting a large number of relevant tweets and annotating them with emotional labels. This methodology has been used for creating a training data set consisting of manually annotated tweets from the Sandy hurricane. Those tweets have been utilized for building machine learning classifiers able to automatically classify new tweets. Results show that a support vector machine achieves the best results (60% accuracy on the multi-classification problem).

[1]  Jiri Trnka,et al.  Making use of new media for pan-European crisis communication , 2012, ISCRAM.

[2]  Björn Johansson,et al.  Dialogical emergency management and strategic awareness in emergency communication , 2011, ISCRAM.

[3]  Robert Thomson,et al.  Trusting tweets: The Fukushima disaster and information source credibility on Twitter , 2012, ISCRAM.

[4]  Andrea Zielinski,et al.  Multilingual analysis of twitter news in support of mass emergency events , 2012, ISCRAM.

[5]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[6]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[7]  Richard Colbaugh,et al.  Estimating the sentiment of social media content for security informatics applications , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[8]  Jeannie A. Stamberger,et al.  Crowd sentiment detection during disasters and crises , 2012, ISCRAM.

[9]  Fredrik Johansson,et al.  Using Video Prototyping as a Means to Involve Crisis Communication Personnel in the Design Process: Innovating Crisis Management by Creating a Social Media Awareness Tool , 2013, HCI.

[10]  Leysia Palen,et al.  Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, ICWSM.

[11]  Fredrik Johansson,et al.  Detecting Emergent Conflicts through Web Mining and Visualization , 2011, 2011 European Intelligence and Security Informatics Conference.

[12]  Nitin Indurkhya,et al.  Handbook of Natural Language Processing , 2010 .

[13]  Jie Yin,et al.  Using Social Media to Enhance Emergency Situation Awareness , 2012, IEEE Intelligent Systems.

[14]  Christoph Dosch,et al.  Alert4All: an Integrated Concept for Effective Population Alerting in Crisis Situations , 2011 .

[15]  Ahmed Abbasi,et al.  Affect Intensity Analysis of Dark Web Forums , 2007, 2007 IEEE Intelligence and Security Informatics.

[16]  Monika Büscher,et al.  Peripheral response: Microblogging during the 22/7/2011 Norway attacks , 2012, ISCRAM.

[17]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[18]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[19]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[20]  Fredrik Johansson,et al.  Estimating Citizen Alertness in Crises Using Social Media Monitoring and Analysis , 2012, 2012 European Intelligence and Security Informatics Conference.

[21]  Carlo Strapparava,et al.  Learning to identify emotions in text , 2008, SAC '08.

[22]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[23]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[24]  Hsinchun Chen,et al.  Affect Analysis of Web Forums and Blogs Using Correlation Ensembles , 2008, IEEE Transactions on Knowledge and Data Engineering.