Generating textual storyline to improve situation awareness in disaster management

Hurricane Sandy affected the east coast of U.S. in 2012 and posed immense threats to businesses, human lives and properties. In order to minimize the consequent loss of a catastrophe like this, a critical task in disaster management is to understand situation updates about the disaster from a large number of disaster-related documents, and obtain a big picture of the disaster's trends and how it affects different areas. In this paper, we present a two-layer storyline generation framework which generates an overall or a global storyline of the disaster events in the first layer, and provides condensed information about specific regions affected by the disaster (i.e., a location-specific storyline) in the second layer. To generate the overall storyline of a disaster, we consider both temporal and spatial factors, which are encoded using integer linear programming. While for location-specific storylines, we employ a Steiner tree based method. Compared with the previous work of storyline generation, which generates flat storylines without considering spatial information, our framework is more suitable for large-scale disaster events. We further demonstrate the efficacy of our proposed framework through the evaluation on the datasets of three major hurricane disasters.

[1]  Chris H. Q. Ding,et al.  Integrating Clustering and Multi-Document Summarization by Bi-Mixture Probabilistic Latent Semantic Analysis (PLSA) with Sentence Bases , 2011, AAAI.

[2]  Ani Nenkova,et al.  Automatic Summarization , 2011, ACL.

[3]  Sudipto Guha,et al.  Approximation algorithms for directed Steiner problems , 1999, SODA '98.

[4]  Angel X. Chang,et al.  SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[5]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[6]  Tao Li,et al.  Multi-document summarization via submodularity , 2012, Applied Intelligence.

[7]  Dragomir R. Radev,et al.  Introduction to the Special Issue on Summarization , 2002, CL.

[8]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[9]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[10]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[11]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies , 2000, ArXiv.

[12]  Chris H. Q. Ding,et al.  Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization , 2008, SIGIR '08.

[13]  Tao Li,et al.  Natural event summarization , 2011, CIKM '11.

[14]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[15]  Furu Wei,et al.  Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization , 2008, SIGIR '08.

[16]  James Allan,et al.  Relevance models for topic detection and tracking , 2002 .

[17]  Dafna Shahaf,et al.  Trains of thought: generating information maps , 2012, WWW.

[18]  Tao Li,et al.  Generating Pictorial Storylines Via Minimum-Weight Connected Dominating Set Approximation in Multi-View Graphs , 2012, AAAI.

[19]  Dragomir R. Radev,et al.  LexPageRank: Prestige in Multi-Document Text Summarization , 2004, EMNLP.

[20]  Liang Tang,et al.  Data Mining Meets the Needs of Disaster Information Management , 2013, IEEE Transactions on Human-Machine Systems.

[21]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[22]  Kalina Bontcheva,et al.  Robust Generic and Query-based Summarization , 2003, EACL.

[23]  Tao Li,et al.  Ontology-enriched multi-document summarization in disaster management , 2010, SIGIR.

[24]  Ricardo Baeza-Yates,et al.  Effectiveness of Temporal Snippets , 2009 .

[25]  Tao Li,et al.  Multi-Document Summarization via the Minimum Dominating Set , 2010, COLING.

[26]  Tao Li,et al.  An Empirical Study of Ontology-Based Multi-Document Summarization in Disaster Management , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[27]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[28]  Helena Ahonen-Myka,et al.  Simple Semantics in Topic Detection and Tracking , 2004, Information Retrieval.

[29]  Chen Lin,et al.  Generating event storylines from microblogs , 2012, CIKM.