Predicting future popularity trend of events in microblogging platforms

The fast information sharing on Twitter from millions of users all over the world leads to almost real-time reporting of events. It is extremely important for business and administrative decision makers to learn events’ popularity as quickly as possible, as it can buy extra precious time for them to make informed decisions. Therefore, we introduce the problem of predicting future popularity trend of events on microblogging platforms. Traditionally, trend prediction has been performed by using time series analysis of past popularity to forecast the future popularity changes. As we can encode the rich Twitter dynamics using a rich variety of features from microblogging data, we explore regression, classification and hybrid approaches, using a large set of popularity, social and event features, to predict event popularity. Experimental results on two real datasets of 18382 events extracted from ~133 million tweets show the effectiveness of the extracted features and learning approaches. The predicted popularity trend of events can be directly used for a variety of applications including recommendation systems, ad keywords bidding price decisions, stock trading decisions, dynamic ticket pricing for sports events, etc.

[1]  James Allan,et al.  Automatic generation of overview timelines , 2000, SIGIR '00.

[2]  Jugal K. Kalita,et al.  Summarizing Microblogs Automatically , 2010, NAACL.

[3]  Bo Zhao,et al.  PET: a statistical model for popular events tracking in social communities , 2010, KDD.

[4]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[5]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[6]  Tad Hogg,et al.  Using a model of social dynamics to predict popularity of news , 2010, WWW '10.

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  Leysia Palen,et al.  Chatter on the red: what hazards threat reveals about the social life of microblogged information , 2010, CSCW '10.

[9]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[10]  Matthew Hurst,et al.  Event Detection and Tracking in Social Streams , 2009, ICWSM.

[11]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[12]  Brian D. Davison,et al.  Predicting popular messages in Twitter , 2011, WWW.

[13]  Nello Cristianini,et al.  Flu Detector - Tracking Epidemics on Twitter , 2010, ECML/PKDD.

[14]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[15]  Scott Counts,et al.  Predicting the Speed, Scale, and Range of Information Diffusion in Twitter , 2010, ICWSM.

[16]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[17]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[18]  Ralf Herbrich,et al.  Predicting Information Spreading in Twitter , 2010 .

[19]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[20]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.