Event relatedness assessment of Twitter messages for emergency response

Abstract The ubiquity of smartphones and social media such as Twitter is clearly blurring traditional boundaries between producers and consumers of information. This is especially the case in emergency situations where people in the scene create and share on-the-spot information about the incident in real time. However, despite the proven importance of such platforms, finding event-related information from the real-time feeds of thousands of tweets is a significant challenge. This paper introduces a novel method for detecting event-specific and informative tweets that are likely to be beneficial for emergency response. The method investigates a sample dataset of tweets which was collected during a storm event passing over a specific area. The sample is manually labelled by three emergency management experts who annotated the sample dataset to obtain the ground truth through identification of the event-related tweets. A selected number of representative event-related tweets are used to extract the common patterns and to define event related term-classes based on term frequency analysis. The term-classes are used to evaluate the event relatedness of a sample dataset through a relationship scoring process. Consequently, each sample tweet is given an event-relatedness score which indicates how related a tweet is to the storm event. The results are compared with the ground truth to determine the cut-off relatedness score and to evaluate the performance of the method. The results of the evaluation indicate that the proposed method is able to detect event-related tweets with about 87% accuracy in a timely manner.

[1]  Danah Boyd,et al.  Detecting Spam in a Twitter Network , 2009, First Monday.

[2]  Meera Narvekar,et al.  A review of techniques for sentiment analysis Of Twitter data , 2014, 2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT).

[3]  H. Rao,et al.  Twitter as a Rapid Response News Service: An Exploration in the Context of the 2008 China Earthquake , 2010, Electron. J. Inf. Syst. Dev. Ctries..

[4]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[5]  Khairullah Khan,et al.  A Review of Machine Learning Algorithms for Text-Documents Classification , 2010 .

[6]  Virgílio A. F. Almeida,et al.  Detecting Spammers on Twitter , 2010 .

[7]  Philip C. Treleaven,et al.  Social media analytics: a survey of techniques, tools and platforms , 2014, AI & SOCIETY.

[8]  Michiaki Tatsubori,et al.  Social web in disaster archives , 2012, WWW.

[9]  Tim Hess,et al.  Implementing precision irrigation in a humid climate – Recent experiences and on-going challenges , 2015 .

[10]  Hila Becker,et al.  Beyond Trending Topics: Real-World Event Identification on Twitter , 2011, ICWSM.

[11]  Feng Li,et al.  Listen to me - Evaluating the influence of micro-blogs , 2014, Decis. Support Syst..

[12]  Alexander Zipf,et al.  An Advanced Systematic Literature Review on Spatiotemporal Analyses of Twitter Data , 2015, Trans. GIS.

[13]  Chris Moore,et al.  Sharing music files: Tactics of a challenge to the industry , 2010, First Monday.

[14]  Bertrand De Longueville,et al.  "OMG, from here, I can see the flames!": a use case of mining location based social networks to acquire spatio-temporal data on forest fires , 2009, LBSN '09.

[15]  Scott A. Longwell,et al.  TWITTER AND DISASTERS , 2013 .

[16]  Sebastián A. Ríos,et al.  An automatic apnea screening algorithm for children , 2016, Expert Syst. Appl..

[17]  Heidi Kreibich,et al.  Social media as an information source for rapid flood inundation mapping , 2015 .

[18]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[19]  Yiming Yang,et al.  Learning approaches for detecting and tracking news events , 1999, IEEE Intell. Syst..

[20]  David A Asch,et al.  Decoding twitter: Surveillance and trends for cardiac arrest and resuscitation communication. , 2013, Resuscitation.

[21]  Serkan Günal,et al.  The impact of preprocessing on text classification , 2014, Inf. Process. Manag..

[22]  Nong Ye,et al.  Naïve Bayes Classifier , 2013 .

[23]  Rui Li,et al.  TEDAS: A Twitter-based Event Detection and Analysis System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[24]  Jie Yin,et al.  Using Social Media to Enhance Emergency Situation Awareness , 2012, IEEE Intelligent Systems.

[25]  Jennifer Duke,et al.  Methodological considerations in analyzing Twitter data. , 2013, Journal of the National Cancer Institute. Monographs.

[26]  Kamal Kant Bharadwaj,et al.  Exploring graph-based global similarity estimates for quality recommendations , 2014, Int. J. Comput. Sci. Eng..

[27]  Omaima Almatrafi,et al.  Application of location-based sentiment analysis using Twitter for identifying trends towards Indian general elections 2014 , 2015, IMCOM.

[28]  Matthew A. North,et al.  A Method for Implementing a Statistically Significant Number of Data Classes in the Jenks Algorithm , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[29]  Shirley Williams,et al.  What do people study when they study Twitter? Classifying Twitter related academic papers , 2013, J. Documentation.

[30]  Letha H. Etzkorn,et al.  Towards a semantic-based approach for software reusable component classification and retrieval , 2004, ACM-SE 42.

[31]  S. Winter,et al.  Towards credibility of micro-blogs: characterising witness accounts , 2015 .

[32]  Ponnurangam Kumaraguru,et al.  Credibility ranking of tweets during high impact events , 2012, PSOSM '12.

[33]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[34]  R. V. Krejcie,et al.  Determining Sample Size for Research Activities , 1970 .

[35]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[36]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[37]  George F. Jenks,et al.  ERROR ON CHOROPLETHIC MAPS: DEFINITION, MEASUREMENT, REDUCTION , 1971 .

[38]  Daniel Gayo-Avello,et al.  Nepotistic relationships in Twitter and their impact on rank prestige algorithms , 2010, Inf. Process. Manag..

[39]  Geert-Jan Houben,et al.  Semantics + filtering + search = twitcident. exploring information in social web streams , 2012, HT '12.

[40]  Fernando Diaz,et al.  Extracting information nuggets from disaster- Related messages in social media , 2013, ISCRAM.