Tracking Disaster Footprints with Social Streaming Data

Social media has become an indispensable tool in the face of natural disasters due to its broad appeal and ability to quickly disseminate information. For instance, Twitter is an important source for disaster responders to search for (1) topics that have been identified as being of particular interest over time, i.e., common topics such as “disaster rescue”; (2) new emerging themes of disaster-related discussions that are fast gathering in social media streams (Saha and Sindhwani 2012), i.e., distinct topics such as “the latest tsunami destruction”. To understand the status quo and allocate limited resources to most urgent areas, emergency managers need to quickly sift through relevant topics generated over time and investigate their commonness and distinctiveness. A major obstacle to the effective usage of social media, however, is its massive amount of noisy and undesired data. Hence, a naive method, such as set intersection/difference to find common/distinct topics, is often not practical. To address this challenge, this paper studies a new topic tracking problem that seeks to effectively identify the common and distinct topics with social streaming data. The problem is important as it presents a promising new way to efficiently search for accurate information during emergency response. This is achieved by an online Nonnegative Matrix Factorization (NMF) scheme that conducts a faster update of latent factors, and a joint NMF technique that seeks the balance between the reconstruction error of topic identification and the losses induced by discovering common and distinct topics. Extensive experimental results on real-world datasets collected during Hurricane Harvey and Florence reveal the effectiveness of our framework.

[1]  Vikas Sindhwani,et al.  Learning evolving and emerging topics in social media: a dynamic nmf approach with temporal regularization , 2012, WSDM '12.

[2]  Francis R. Bach,et al.  Online Learning for Latent Dirichlet Allocation , 2010, NIPS.

[3]  Jure Leskovec,et al.  QUOTUS: The Structure of Political Media Coverage as Revealed by Quoting Patterns , 2015, WWW.

[4]  Anushree Dave,et al.  Digital Humanitarians: How Big Data Is Changing the Face of Humanitarian Response , 2017, Journal of Bioethical Inquiry.

[5]  Hui Zhang,et al.  Experimental explorations on short text topic mining between LDA and NMF based Schemes , 2019, Knowl. Based Syst..

[6]  Huiji Gao,et al.  Harnessing the Crowdsourcing Power of Social Media for Disaster Relief , 2011, IEEE Intelligent Systems.

[7]  Vincent Yan Fu Tan,et al.  Online Nonnegative Matrix Factorization with General Divergences , 2016, AISTATS.

[8]  Fei Wang,et al.  Efficient Document Clustering via Online Nonnegative Matrix Factorizations , 2011, SDM.

[9]  K. Selçuk Candan,et al.  GI-NMF: Group Incremental Non-Negative Matrix Factorization on Data Streams , 2014, CIKM.

[10]  J Brian Houston,et al.  Social media and disasters: a functional framework for social media use in disaster planning, response, and research. , 2015, Disasters.

[11]  Jaegul Choo,et al.  Short-Text Topic Modeling via Non-negative Matrix Factorization Enriched with Local Word-Context Correlations , 2018, WWW.

[12]  Jérôme Idier,et al.  Algorithms for Nonnegative Matrix Factorization with the β-Divergence , 2010, Neural Computation.

[13]  Qiang Yang,et al.  Detect and Track Latent Factors with Online Nonnegative Matrix Factorization , 2007, IJCAI.

[14]  Chaomei Chen,et al.  Dynamic topic detection and tracking: A comparison of HDP, C‐word, and cocitation methods , 2014, J. Assoc. Inf. Sci. Technol..

[15]  Mohammad Ali Abbasi,et al.  TweetTracker: An Analysis Tool for Humanitarian and Disaster Relief , 2011, ICWSM.

[16]  Ciro Cattuto,et al.  Dynamical classes of collective attention in twitter , 2011, WWW.

[17]  Ling Chen,et al.  Hierarchical online NMF for detecting and tracking topic hierarchies in a text stream , 2018, Pattern Recognit..

[18]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[19]  Jaegul Choo,et al.  Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization , 2015, KDD.

[20]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[21]  K. Selçuk Candan,et al.  IMS-DTM: Incremental Multi-Scale Dynamic Topic Models , 2018, AAAI.

[22]  Yusheng Ji,et al.  Intelligent Disaster Response via Social Media Analysis A Survey , 2017, SKDD.

[23]  Gert R. G. Lanckriet,et al.  Leveraging Social Context for Modeling Topic Evolution , 2015, KDD.

[24]  Marco Saerens,et al.  A time-based collective factorization for topic discovery and monitoring in news , 2014, WWW.

[25]  Eugene Agichtein,et al.  TM-LDA: efficient online modeling of latent topic transitions in social media , 2012, KDD.

[26]  Chris H. Q. Ding,et al.  Orthogonal nonnegative matrix t-factorizations for clustering , 2006, KDD '06.