dTexSL: A dynamic disaster textual storyline generating framework

Effectively capturing the status information and improving situational awareness is the most important task in disaster information management. Due to the rapid increase of online information, this task becomes very challenging. Existing information retrieval and text summarization methods can solve information overload problem to some extent, however, they suffer from some limitations: lacking theme structure, ignoring spatial information, and unable to update information on the real time events. In this paper, we propose a dynamic disaster storyline generation framework, which generates a global storyline describing the evolution of the disaster events in the high-level layer and provides condensed information about specific regions affected by the disaster in the local-level layer. The proposed framework considers both uniqueness and relevance for representative document selection, uses Maximal Marginal Relevance to generate summaries from each local document set, and utilizes dynamic Steiner tree to implement the information update. Comprehensive experiments on typhoons data sets demonstrate the effectiveness of the proposed methods in each level and the overall framework.

[1]  Makoto Imase,et al.  Dynamic Steiner Tree Problem , 1991, SIAM J. Discret. Math..

[2]  Yan Zhang,et al.  Evolutionary timeline summarization: a balanced optimization framework via iterative substitution , 2011, SIGIR.

[3]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[4]  Tao Li,et al.  Ontology-enriched multi-document summarization in disaster management , 2010, SIGIR.

[5]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies , 2000, ArXiv.

[6]  Chen Lin,et al.  Generating event storylines from microblogs , 2012, CIKM.

[7]  Tao Li,et al.  An improved textual storyline generating framework for disaster information management , 2017, 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE).

[8]  Arunima Jaiswal,et al.  Trends in Extractive and Abstractive Techniques in Text Summarization , 2015 .

[9]  Tao Li,et al.  Generating Pictorial Storylines Via Minimum-Weight Connected Dominating Set Approximation in Multi-View Graphs , 2012, AAAI.

[10]  Sudipto Guha,et al.  Approximation algorithms for directed Steiner problems , 1999, SODA '98.

[11]  Xiuzhen Zhang,et al.  A probabilistic method for emerging topic tracking in Microblog stream , 2016, World Wide Web.

[12]  Min Chen,et al.  Modeling Methodology for Component Reuse and System Integration for Hurricane Loss Projection Application , 2006, 2006 IEEE International Conference on Information Reuse & Integration.

[13]  James Allan,et al.  Text classification and named entities for new event detection , 2004, SIGIR '04.

[14]  Jackie Ck Cheung Comparing Abstractive and Extractive Summarization of Evaluative Text: Controversiality and Content Selection , 2008 .

[15]  T. V. Geetha,et al.  Abstractive Summarization: A Hybrid Approach for the Compression of Semantic Graphs , 2016, Int. J. Semantic Web Inf. Syst..

[16]  Wang Bing-Hong,et al.  Node importance measurement based on the degree and clustering coefficient information , 2013 .

[17]  Wubai Zhou,et al.  Generating textual storyline to improve situation awareness in disaster management , 2014, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014).

[18]  Tao Li,et al.  An Empirical Study of Ontology-Based Multi-Document Summarization in Disaster Management , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[19]  ChengXiang Zhai,et al.  Discovering evolutionary theme patterns from text: an exploration of temporal text mining , 2005, KDD '05.

[20]  Dragomir R. Radev,et al.  LexPageRank: Prestige in Multi-Document Text Summarization , 2004, EMNLP.