Summarizing Situational Tweets in Crisis Scenarios: An Extractive-Abstractive Approach

Microblogging platforms such as Twitter are widely used by eyewitnesses and affected people to post situational updates during mass convergence events such as natural and man-made disasters. These crisis-related messages disperse among multiple classes/categories such as infrastructure damage, shelter needs, information about missing, injured, and dead people. Moreover, we observe that sometimes people post information about their missing relatives and friends with personal details such as names and last seen location. The information requirements of different stakeholders (government, NGOs, and rescue workers) also vary a lot. This brings twofold challenges: 1) extracting important high-level situational updates from these messages, assigning them appropriate categories, and finally summarizing big trove of information in each category and 2) extracting small-scale time-critical sparse updates related to missing or trapped people. In this article, we propose a classification-summarization framework which first assigns tweets into different situational classes and then summarizes those tweets. In the summarization phase, we propose a two-step extractive-abstractive summarization framework. In the first step, it extracts a set of important tweets from the whole set of information, develops a bigram-based word-graph from those tweets, and generates paths by traversing the word-graph. Next, it uses an optimization technique based on integer linear programming (ILP) to select the most important tweets and paths based on different optimization parameters such as informativeness and coverage of content words. Apart from general classwise summarization, we also show the customization of our summarization model to address time-critical sparse information needs (e.g., missing relatives). Our proposed method is time- and memory-efficient and shows better performance than state-of-the-art methods in terms of both quantitative and qualitative judgment.

[1]  Zhenhua Wang,et al.  Sumblr: continuous summarization of evolving tweet streams , 2013, SIGIR.

[2]  Niloy Ganguly,et al.  Extracting and Summarizing Situational Information from the Twitter Social Media during Disasters , 2018, ACM Trans. Web.

[3]  Qingpeng Zhang,et al.  Information Diffusion on Social Media During Natural Disasters , 2018, IEEE Transactions on Computational Social Systems.

[4]  Fernando Diaz,et al.  Predicting Salient Updates for Disaster Summarization , 2015, ACL.

[5]  Noah A. Smith,et al.  A Dependency Parser for Tweets , 2014, EMNLP.

[6]  Roshni Chakraborty,et al.  Tweet Summarization of News Articles: An Objective Ordering-Based Perspective , 2019, IEEE Transactions on Computational Social Systems.

[7]  Andrei Olariu Efficient Online Summarization of Microblogging Streams , 2014, EACL.

[8]  Diego López-de-Ipiña,et al.  Detection and Extracting of Emergency Knowledge from Twitter Streams , 2012, UCAmI.

[9]  Katja Filippova,et al.  Multi-Sentence Compression: Finding Shortest Paths in Word Graphs , 2010, COLING.

[10]  Kripabandhu Ghosh,et al.  Extracting Resource Needs and Availabilities From Microblogs for Aiding Post-Disaster Relief Operations , 2019, IEEE Transactions on Computational Social Systems.

[11]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[12]  Leysia Palen,et al.  Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, ICWSM.

[13]  Sarah Vieweg,et al.  Processing Social Media Messages in Mass Emergency , 2014, ACM Comput. Surv..

[14]  Sune Lehmann,et al.  Tweetin' in the Rain: Exploring Societal-Scale Effects of Weather on Mood , 2012, ICWSM.

[15]  Wei Xu,et al.  A Preliminary Study of Tweet Summarization using Information Extraction , 2013 .

[16]  Minh-Tien Nguyen,et al.  TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets During a Disaster for Reaction , 2015, PAKDD.

[17]  Geert-Jan Houben,et al.  Groundhog day: near-duplicate detection on Twitter , 2013, WWW.

[18]  Niloy Ganguly,et al.  Characterizing communal microblogs during disaster events , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[19]  Niloy Ganguly,et al.  Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach , 2015, CIKM.

[20]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[21]  Marilyn A. Walker,et al.  SPoT: A Trainable Sentence Planner , 2001, NAACL.

[22]  Ashish Sharma,et al.  Characterizing and Countering Communal Microblogs During Disaster Events , 2018, IEEE Transactions on Computational Social Systems.

[23]  Dan Klein,et al.  Named Entity Recognition with Character-Level Models , 2003, CoNLL.

[24]  Gurpreet Singh Lehal,et al.  A Survey of Text Summarization Extractive Techniques , 2010 .

[25]  C. Castillo,et al.  Big Crisis Data: Social Media in Disasters and Time-Critical Situations , 2019 .

[26]  Prasenjit Mitra,et al.  Multi-Document Abstractive Summarization Using ILP Based Multi-Sentence Compression , 2015, IJCAI.

[27]  Carlos Castillo,et al.  AIDR: artificial intelligence for disaster response , 2014, WWW.

[28]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[29]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[30]  Bowen Zhou,et al.  SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.

[31]  Tiejun Zhao,et al.  Neural Document Summarization by Jointly Learning to Score and Select Sentences , 2018, ACL.

[32]  Muhammad Imran,et al.  Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages , 2016, LREC.

[33]  U. Ajay,et al.  On Summarization and Timeline Generation for Evolutionary Tweet Streams , 2016 .

[34]  Alireza Louni,et al.  Who Spread That Rumor: Finding the Source of Information in Large Online Social Networks With Probabilistically Varying Internode Relationship Strengths , 2018, IEEE Transactions on Computational Social Systems.

[35]  Muhammad Imran,et al.  Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs , 2018, SIGIR.

[36]  Muhammad Imran,et al.  Integrating Social Media Communications into the Rapid Assessment of Sudden Onset Disasters , 2014, SocInfo.

[37]  Huiji Gao,et al.  Harnessing the Crowdsourcing Power of Social Media for Disaster Relief , 2011, IEEE Intelligent Systems.

[38]  Iadh Ounis,et al.  Real-Time Detection, Tracking, and Monitoring of Automatically Discovered Events in Social Media , 2014, ACL.

[39]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[40]  Muhammad Imran,et al.  Summarizing Situational Tweets in Crisis Scenario , 2016, HT.