Going Beyond Content Richness: Verified Information Aware Summarization of Crisis-Related Microblogs

High-impact catastrophic events (bomb attacks, shootings) trigger posting of large volume of information on social media platforms such as Twitter. Recent works have proposed content-aware systems for summarizing this information, thereby facilitating post-disaster services. However, a significant proportion of the posted content is unverified, which restricts the practical usage of the existing summarization systems. In this paper, we work on the novel task of generating verified summaries of information posted on Twitter during disasters. We first jointly learn representations of content-classes and expression-classes of tweets posted during disasters using a novel LDA-based generative model. These representations of content & expression classes are used in conjunction with pre-disaster user behavior and temporal signals (replies) for training a Tree-LSTM based tweet-verification model. The model infers tweet verification probabilities which are used, besides information content of tweets, in an Integer Linear Programming (ILP) framework for generating the desired verified summaries. The summaries are fine-tuned using the class information of the tweets as obtained from the LDA-based generative model. Extensive experiments are performed on a publicly-available labeled dataset of man-made disasters which demonstrate the effectiveness of our tweet-verification (3-13% gain over baselines) and summarization (12-48% gain in verified content proportion, 8-13% gain in ROUGE-score over state-of-the-art) systems. We make implementations of our various modules available online.

[1]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[2]  Kyomin Jung,et al.  Prominent Features of Rumor Propagation in Online Social Media , 2013, 2013 IEEE 13th International Conference on Data Mining.

[3]  Carlos Castillo,et al.  AIDR: artificial intelligence for disaster response , 2014, WWW.

[4]  Anupam Joshi,et al.  Faking Sandy: characterizing and identifying fake images on Twitter during Hurricane Sandy , 2013, WWW.

[5]  Arkaitz Zubiaga,et al.  Crowdsourcing the Annotation of Rumourous Conversations in Social Media , 2015, WWW.

[6]  Kyumin Lee,et al.  The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News , 2018, SIGIR.

[7]  Jacob Ratkiewicz,et al.  Political Polarization on Twitter , 2011, ICWSM.

[8]  Minh-Tien Nguyen,et al.  TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets During a Disaster for Reaction , 2015, PAKDD.

[9]  Ingmar Weber,et al.  Get Back! You Don't Know Me Like That: The Social Mediation of Fact Checking Interventions in Twitter Conversations , 2014, ICWSM.

[10]  Fernando Diaz,et al.  Predicting Salient Updates for Disaster Summarization , 2015, ACL.

[11]  Kate Starbird,et al.  A Closer Look at the Self-Correcting Crowd: Examining Corrections in Online Rumors , 2017, CSCW.

[12]  Li Zeng,et al.  #Unconfirmed: Classifying Rumor Stance in Crisis-Related Social Media Messages , 2016, ICWSM.

[13]  Muhammad Imran,et al.  Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages , 2016, LREC.

[14]  Sarah Vieweg,et al.  Processing Social Media Messages in Mass Emergency , 2014, ACM Comput. Surv..

[15]  Wei Gao,et al.  Detecting Rumors from Microblogs with Recurrent Neural Networks , 2016, IJCAI.

[16]  Huan Liu,et al.  Tracing Fake-News Footprints: Characterizing Social Media Messages by How They Propagate , 2018, WSDM.

[17]  James Allan,et al.  Explaining Controversy on Social Media via Stance Summarization , 2018, SIGIR.

[18]  Chen Huang,et al.  Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake , 2011, CSCW.

[19]  Justin Cheng,et al.  Rumor Cascades , 2014, ICWSM.

[20]  Sungyong Seo,et al.  CSI: A Hybrid Deep Model for Fake News Detection , 2017, CIKM.

[21]  C. Castillo,et al.  Big Crisis Data: Social Media in Disasters and Time-Critical Situations , 2019 .

[22]  Wei Gao,et al.  Detect Rumors Using Time Series of Social Context Information on Microblogging Websites , 2015, CIKM.

[23]  Niloy Ganguly,et al.  Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach , 2015, CIKM.

[24]  Christopher D. Manning,et al.  Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[25]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[26]  Muhammad Imran,et al.  Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs , 2018, SIGIR.

[27]  Danushka Bollegala,et al.  Multi-tweet Summarization of Real-Time Events , 2013, 2013 International Conference on Social Computing.

[28]  Hongyan Liu,et al.  Detecting Event Rumors on Sina Weibo Automatically , 2013, APWeb.

[29]  Kate Starbird,et al.  Engage Early, Correct More: How Journalists Participate in False Rumors Online during Crisis Events , 2018, CHI.

[30]  Ee-Peng Lim,et al.  Finding Bursty Topics from Microblogs , 2012, ACL.

[31]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[32]  Pankaj K. Agarwal,et al.  Toward Computational Fact-Checking , 2014, Proc. VLDB Endow..

[33]  Jie Yin,et al.  Emergency situation awareness from twitter for crisis management , 2012, WWW.

[34]  Muhammad Imran,et al.  Summarizing Situational Tweets in Crisis Scenario , 2016, HT.

[35]  Wei Gao,et al.  Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning , 2017, ACL.

[36]  Andreas Vlachos,et al.  Identification and Verification of Simple Claims about Statistical Properties , 2015, EMNLP.

[37]  Graham Neubig,et al.  Safety Information Mining — What can NLP do in a disaster— , 2011, IJCNLP.

[38]  Deepayan Chakrabarti,et al.  Event Summarization Using Tweets , 2011, ICWSM.

[39]  Arkaitz Zubiaga,et al.  Exploiting Context for Rumour Detection in Social Media , 2017, SocInfo.

[40]  Li Zeng,et al.  Rumors at the Speed of Light? Modeling the Rate of Rumor Transmission During Crisis , 2016, 2016 49th Hawaii International Conference on System Sciences (HICSS).

[41]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[42]  Fan Yang,et al.  Automatic detection of rumor on Sina Weibo , 2012, MDS '12.

[43]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[44]  Kalina Bontcheva,et al.  TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text , 2013, RANLP.

[45]  Xiaomo Liu,et al.  Real-time Rumor Debunking on Twitter , 2015, CIKM.

[46]  Jun Zhang,et al.  Call Attention to Rumors: Deep Attention Based Recurrent Neural Networks for Early Rumor Detection , 2017, ArXiv.