TAQE: Tweet Retrieval-Based Infrastructure Damage Assessment During Disasters

Twitter is an active communication channel for the spreading of updated information in emergency situations. Retrieving specific information related to infrastructure damage offers the situational views to the concerned authorities, who can take necessary action to disburse help. However, such usages of Twitter demand significant accuracy of the retrieved information. Previous techniques on IR have not been able to capture the semantic variations satisfactorily in the tweets, due to low content quality and vocabulary gap, and consequently have failed to yield considerable performance. This has left ample scope for further improvement in this area of research. There are two major contributions of our work: 1) developing a relevant tweet retrieval framework that provides information about infrastructure damage and 2) assignment of a relative damage score to the affected regions so that the severity of the damage can be assessed. Our proposed technique involves a novel split-query-based mechanism with topic aligned query expansion (TAQE) to retrieve relevant tweets that are subsequently used for measuring the infrastructure damage across different locations. We report empirical results on multiple-crisis-related data sets to establish the efficacy of our approach to these events at different locations. Empirical validation of our proposed approach on manually annotated ground-truth data reveals considerably better performance metrics in terms of precision, recall, Bpref, and MAP over several state-of-the-art techniques.

[1]  Kripabandhu Ghosh,et al.  Extracting Resource Needs and Availabilities From Microblogs for Aiding Post-Disaster Relief Operations , 2019, IEEE Transactions on Computational Social Systems.

[2]  Jinglei Zhao,et al.  A proximity language model for information retrieval , 2009, SIGIR.

[3]  Hassan Sajjad,et al.  Robust Classification of Crisis-Related Data on Social Networks Using Convolutional Neural Networks , 2017, ICWSM.

[4]  Kripabandhu Ghosh,et al.  Automatic Matching of Resource Needs and Availabilities in Microblogs for Post-Disaster Relief , 2018, WWW.

[5]  Irina P. Temnikova,et al.  EMTerms 1.0: A Terminological Resource for Crisis Tweets , 2015, ISCRAM.

[6]  Muhammad Imran,et al.  Classifying and Summarizing Information from Microblogs During Epidemics , 2018, Information Systems Frontiers.

[7]  Ross Maciejewski,et al.  Understanding Twitter data with TweetXplorer , 2013, KDD.

[8]  Arkaitz Zubiaga,et al.  All-in-one: Multi-task Learning for Rumour Verification , 2018, COLING.

[9]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[10]  Firoj Alam,et al.  Domain Adaptation with Adversarial Training and Graph Embeddings , 2018, ACL.

[11]  Reza Zafarani,et al.  Whom should I follow?: identifying relevant users during crises , 2013, HT.

[12]  Carlos Castillo,et al.  What to Expect When the Unexpected Happens: Social Media Communications Across Crises , 2015, CSCW.

[13]  Qingpeng Zhang,et al.  Information Diffusion on Social Media During Natural Disasters , 2018, IEEE Transactions on Computational Social Systems.

[14]  Reza Zafarani,et al.  Real-Time Crisis Mapping Using Language Distribution , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[15]  Hassan Sajjad,et al.  Rapid Classification of Crisis-Related Data on Social Networks using Convolutional Neural Networks , 2016, ICWSM 2016.

[16]  Girish Keshav Palshikar,et al.  Weakly Supervised and Online Learning of Word Models for Classification to Detect Disaster Reporting Tweets , 2018, Information Systems Frontiers.

[17]  Marie-Francine Moens,et al.  WWW'18 Workshop on Exploitation of Social Media for Emergency Relief and Preparedness: Chairs' Welcome & Organization , 2018, WWW.

[18]  S. Sitharama Iyengar,et al.  Data-Driven Techniques in Disaster Information Management , 2017, ACM Comput. Surv..

[19]  Xiao Zhang,et al.  SensePlace2: GeoTwitter analytics support for situational awareness , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[20]  Shalini Priya,et al.  Identifying Infrastructure Damage during Earthquake using Deep Active Learning , 2019, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[21]  Kazuhiro Seki,et al.  Improving pseudo-relevance feedback via tweet selection , 2013, CIKM.

[22]  Yusheng Ji,et al.  Intelligent Disaster Response via Social Media Analysis A Survey , 2017, SKDD.

[23]  Prasenjit Majumder,et al.  Information Extraction from Microblog for Disaster Related Event , 2017, SMERP@ECIR.

[24]  L. Thapa,et al.  SPATIAL-TEMPORAL ANALYSIS OF SOCIAL MEDIA DATA RELATED TO NEPAL EARTHQUAKE 2015 , 2016 .

[25]  Fabrício Benevenuto,et al.  A Benchmark Comparison of State-of-the-Practice Sentiment Analysis Methods , 2015, ArXiv.

[26]  Barbara Poblete,et al.  Nowcasting earthquake damages with Twitter , 2019, EPJ Data Science.

[27]  Birgit Kirsch,et al.  E2mC: Improving Emergency Management Service Practice through Social Media and Crowdsourcing Analysis in Near Real Time , 2017, Sensors.

[28]  Maurizio Tesconi,et al.  Impromptu Crisis Mapping to Prioritize Emergency Response , 2016, Computer.

[29]  Liang Yang,et al.  Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations , 2018, IEEE Access.

[30]  Kripabandhu Ghosh,et al.  Microblog Retrieval for Post-Disaster Relief: Applying and Comparing Neural IR Models , 2017, ArXiv.

[31]  Huan Liu,et al.  A behavior analytics approach to identifying tweets from crisis regions , 2014, HT.

[32]  Craig MacDonald,et al.  Regional Sentiment Bias in Social Media Reporting During Crises , 2018, Inf. Syst. Frontiers.

[33]  Firoj Alam,et al.  CrisisMMD: Multimodal Twitter Datasets from Natural Disasters , 2018, ICWSM.

[34]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[35]  Fernando Diaz,et al.  Emergency-relief coordination on social media: Automatically matching resource requests and offers , 2013, First Monday.

[36]  Quan Z. Sheng,et al.  SNAF: Observation filtering and location inference for event monitoring on twitter , 2018, World Wide Web.

[37]  Carlos Castillo,et al.  AIDR: artificial intelligence for disaster response , 2014, WWW.

[38]  W. Bruce Croft,et al.  A Language Modeling Approach to Information Retrieval , 1998, SIGIR Forum.

[39]  J. Fowler,et al.  Rapid assessment of disaster damage using social media activity , 2016, Science Advances.

[40]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[41]  Iryna Gurevych,et al.  Cross-Genre and Cross-Domain Detection of Semantic Uncertainty , 2012, CL.

[42]  Vincze Veronika,et al.  Uncertainty Detection in Natural Language Texts , 2015 .

[43]  Davide Buscaldi,et al.  Sentiment Analysis on Microblogs for Natural Disasters Management: a Study on the 2014 Genoa Floodings , 2015, WWW.

[44]  Maurizio Tesconi,et al.  CrisMap: a Big Data Crisis Mapping System Based on Damage Detection and Geoparsing , 2018, Information Systems Frontiers.

[45]  Stuart E. Middleton,et al.  Real-Time Crisis Mapping of Natural Disasters Using Social Media , 2014, IEEE Intelligent Systems.

[46]  István Hegedüs,et al.  Research Paper: Semi-automated Construction of Decision Rules to Predict Morbidities from Clinical Texts , 2009, J. Am. Medical Informatics Assoc..

[47]  Joydeep Chandra,et al.  Characterizing Infrastructure Damage After Earthquake: A Split-Query Based IR Approach , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[48]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[49]  Fernando Diaz,et al.  Processing Social Media Messages in Mass Emergency: Survey Summary , 2018, WWW.

[50]  Yutaka Matsuo,et al.  Tweet Analysis for Real-Time Event Detection and Earthquake Reporting System Development , 2013, IEEE Transactions on Knowledge and Data Engineering.

[51]  Joydeep Chandra,et al.  Where should one get news updates: Twitter or Reddit , 2019, Online Soc. Networks Media.

[52]  Haim Levkowitz,et al.  Introduction to information retrieval (IR) , 2008 .

[53]  Muhammad Imran,et al.  Summarizing Situational and Topical Information During Crises , 2016, ArXiv.

[54]  Somprakash Bandyopadhyay,et al.  Identifying Post-Disaster Resource Needs and Availabilities from Microblogs , 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).