Generating timeline summaries with social media attention

Timeline generation is an important research task which can help users to have a quick understanding of the overall evolution of one given topic. Previous methods simply split the time span into fixed, equal time intervals without studying the role of the evolutionary patterns of the underlying topic in timeline generation. In addition, few of these methods take users’ collective interests into considerations to generate timelines.We consider utilizing social media attention to address these two problems due to the facts: 1) social media is an important pool of real users’ collective interests; 2) the information cascades generated in it might be good indicators for boundaries of topic phases. Employing Twitter as a basis, we propose to incorporate topic phases and user’s collective interests which are learnt from social media into a unified timeline generation algorithm.We construct both one informativeness-oriented and three interestingness-oriented evaluation sets over five topics.We demonstrate that it is very effective to generate both informative and interesting timelines. In addition, our idea naturally leads to a novel presentation of timelines, i.e., phase based timelines, which can potentially improve user experience.

[1]  James Allan,et al.  Automatic generation of overview timelines , 2000, SIGIR '00.

[2]  Juan-Zi Li,et al.  Social context summarization , 2011, SIGIR.

[3]  Hector Garcia-Molina,et al.  Overview of multidatabase transaction management , 2005, The VLDB Journal.

[4]  Xiaojun Wan,et al.  Multi-document summarization using cluster-based link analysis , 2008, SIGIR '08.

[5]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[6]  Dragomir R. Radev,et al.  LexPageRank: Prestige in Multi-Document Text Summarization , 2004, EMNLP.

[7]  John D. Lafferty,et al.  Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.

[8]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[9]  Philip S. Yu,et al.  Parameter Free Bursty Events Detection in Text Streams , 2005, VLDB.

[10]  Divyakant Agrawal,et al.  Structural Trend Analysis for Online Social Networks , 2011, Proc. VLDB Endow..

[11]  Yong Yu,et al.  Enhancing diversity, coverage and balance for summarization through structure learning , 2009, WWW '09.

[12]  Xiaojun Wan,et al.  Manifold-Ranking Based Topic-Focused Multi-Document Summarization , 2007, IJCAI.

[13]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[14]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[15]  Jade Goldstein-Stewart,et al.  Summarizing text documents: sentence selection and evaluation metrics , 1999, SIGIR '99.

[16]  Mizuki Morita,et al.  Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter , 2011, EMNLP.

[17]  Yan Zhang,et al.  Evolutionary timeline summarization: a balanced optimization framework via iterative substitution , 2011, SIGIR.

[18]  Hai Leong Chieu,et al.  Query based event extraction along a timeline , 2004, SIGIR '04.

[19]  Hila Becker,et al.  Hip and trendy: Characterizing emerging trends on Twitter , 2011, J. Assoc. Inf. Sci. Technol..

[20]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[21]  Xiaojun Wan,et al.  Single Document Summarization with Document Expansion , 2007, AAAI.

[22]  Jian-Yun Nie,et al.  Summarize What You Are Interested In: An Optimization Framework for Interactive Personalized Summarization , 2011, EMNLP.

[23]  James Allan,et al.  Temporal summaries of new topics , 2001, SIGIR '01.

[24]  Yang Song,et al.  Identifying Event-related Bursts via Social Media Activities , 2012, EMNLP.

[25]  Eduard H. Hovy,et al.  From Single to Multi-document Summarization , 2002, ACL.

[26]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[27]  Anton Leuski,et al.  iNeATS: Interactive Multi-Document Summarization , 2003, ACL.

[28]  Arkaitz Zubiaga,et al.  Classifying trending topics: a typology of conversation triggers on Twitter , 2011, CIKM '11.

[29]  Dragomir R. Radev,et al.  DivRank: the interplay of prestige and diversity in information networks , 2010, KDD.

[30]  Yan Zhang,et al.  Timeline Generation through Evolutionary Trans-Temporal Summarization , 2011, EMNLP.

[31]  Chin-Yew Lin,et al.  From Single to Multi-document Summarization : A Prototype System and its Evaluation , 2002 .

[32]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[33]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.