Event detection and popularity prediction in microblogging

Abstract As one of the most influential social media platforms, microblogging is becoming increasingly popular in the last decades. Each day a large amount of events appear and spread in microblogging. The spreading of events and corresponding comments on them can greatly influence the public opinion. It is practical important to discover new emerging events in microblogging and predict their future popularity. Traditional event detection and information diffusion models cannot effectively handle our studied problem, because most existing methods focus only on event detection but ignore to predict their future trend. In this paper, we propose a new approach to detect burst novel events and predict their future popularity simultaneously. Specifically, we first detect events from online microblogging stream by utilizing multiple types of information, i.e., term frequency, and user׳s social relation. Meanwhile, the popularity of detected event is predicted through a proposed diffusion model which takes both the content and user information of the event into account. Extensive evaluations on two real-world datasets demonstrate the effectiveness of our approach on both event detection and their popularity prediction.

[1]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[2]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[4]  D. Watts,et al.  A generalized model of social and biological contagion. , 2005, Journal of theoretical biology.

[5]  Philip S. Yu,et al.  Parameter Free Bursty Events Detection in Text Streams , 2005, VLDB.

[6]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.

[7]  Jure Leskovec,et al.  Modeling Information Diffusion in Implicit Networks , 2010, 2010 IEEE International Conference on Data Mining.

[8]  Ee-Peng Lim,et al.  Analyzing feature trajectories for event detection , 2007, SIGIR.

[9]  Ari Rappoport,et al.  What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities , 2012, WSDM '12.

[10]  Michael Gertz,et al.  EvenTweet: Online Localized Event Detection from Twitter , 2013, Proc. VLDB Endow..

[11]  Masahiro Kimura,et al.  Selecting Information Diffusion Models over Social Networks for Behavioral Analysis , 2010, ECML/PKDD.

[12]  SzaboGabor,et al.  Predicting the popularity of online content , 2010 .

[13]  Chenliang Li,et al.  Twevent: segment-based event detection from tweets , 2012, CIKM.

[14]  Yi-Shin Chen,et al.  Event identification for social streams using keyword-based evolving graph sequences , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[15]  Prasenjit Mitra,et al.  Event detection with spatial latent Dirichlet allocation , 2011, JCDL '11.

[16]  Masahiro Kimura,et al.  Learning Information Diffusion Model for Extracting Influential Nodes in a Social Network , 2010 .

[17]  John Yen,et al.  Probabilistic Community Discovery Using Hierarchical Latent Gaussian Mixture Model , 2007, AAAI.

[18]  Charles L. A. Clarke,et al.  Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval , 2007, SIGIR 2007.

[19]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[20]  Ling Chen,et al.  Event detection from flickr data through wavelet-based spatial analysis , 2009, CIKM.

[21]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[22]  Mario Cataldi,et al.  Emerging topic detection on Twitter based on temporal and social terms evaluation , 2010, MDMKDD '10.

[23]  Miles Osborne,et al.  Streaming First Story Detection with application to Twitter , 2010, NAACL.

[24]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[25]  Christos Faloutsos,et al.  Patterns of Cascading Behavior in Large Blog Graphs , 2007, SDM.

[26]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[27]  N. Meade,et al.  Modelling and forecasting the diffusion of innovation – A 25-year review , 2006 .

[28]  Haewoon Kwak,et al.  Finding influentials based on the temporal order of information adoption in twitter , 2010, WWW '10.

[29]  Jiawei Han,et al.  The Joint Inference of Topic Diffusion and Evolution in Social Communities , 2011, 2011 IEEE 11th International Conference on Data Mining.

[30]  Susan T. Dumais,et al.  Mark my words!: linguistic style accommodation in social media , 2011, WWW.

[31]  Junjie Yao,et al.  Temporal and Social Context Based Burst Detection from Folksonomies , 2010, AAAI.

[32]  Vahab S. Mirrokni,et al.  Optimal marketing strategies over social networks , 2008, WWW.

[33]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[34]  A. Banerjee,et al.  Social Topic Models for Community Extraction , 2008 .

[35]  Gautam Shroff,et al.  Catching the Long-Tail: Extracting Local News Events from Twitter , 2012, ICWSM.

[36]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[37]  Ee-Peng Lim,et al.  Who is Retweeting the Tweeters? Modeling, Originating, and Promoting Behaviors in the Twitter Network , 2012, TMIS.

[38]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[39]  Mounia Lalmas,et al.  A survey on the use of relevance feedback for information access systems , 2003, The Knowledge Engineering Review.

[40]  Hector Garcia-Molina,et al.  Overview of multidatabase transaction management , 2005, The VLDB Journal.

[41]  Masahiro Kimura,et al.  Minimizing the Spread of Contamination by Blocking Links in a Network , 2008, AAAI.

[42]  Jiebo Luo,et al.  The wisdom of social multimedia: using flickr for prediction and forecast , 2010, ACM Multimedia.

[43]  Thomas F. Coleman,et al.  A Reflective Newton Method for Minimizing a Quadratic Function Subject to Bounds on Some of the Variables , 1992, SIAM J. Optim..

[44]  Gao Cong,et al.  On predicting the popularity of newly emerging hashtags in Twitter , 2013, J. Assoc. Inf. Sci. Technol..

[45]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[46]  Hengchang Liu,et al.  Spatio-temporal Spread of Events in Social Networks: A Gas Shortage Case Study , 2013, MILCOM 2013 - 2013 IEEE Military Communications Conference.

[47]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.