Predicting popularity via a generative model with adaptive peeking window

Abstract Predicting the popularity of online content is an important issue for uncovering rules governing collective human behaviors in networks. Also, content popularity prediction finds application in an array of areas. In this paper, we propose a generative approach using a Hawkes process, the purpose of which is to predict the eventual popularity of online content from the observed initial period of the information cascade. Our approach is distinguished from existing approaches via its ability to explore an adaptive peeking window and its added remarkable predictive power that was inherited from the feature-driven method. Further, we proposed a procedure for exploring an adaptive peeking window, which allows us to answer the question, “How can we obtain the most effective part of the history to make an accurate prediction?” And the added predictive layer bridges the gap between the generative method and the feature-driven method. Empirical studies on real world datasets demonstrate that the proposed method significantly outperformed the existing approaches.

[1]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[2]  Fei Wang,et al.  Uncovering and predicting the dynamic process of information cascades with survival model , 2017, Knowledge and Information Systems.

[3]  A. Hawkes,et al.  A cluster process representation of a self-exciting process , 1974, Journal of Applied Probability.

[4]  T. E. Harris,et al.  The Theory of Branching Processes. , 1963 .

[5]  Xiaohua Hu,et al.  Video Popularity Prediction by Sentiment Propagation via Implicit Network , 2015, CIKM.

[6]  Bernardo A. Huberman,et al.  The Pulse of News in Social Media: Forecasting Popularity , 2012, ICWSM.

[7]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[8]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[9]  Ke Xu,et al.  On popularity prediction of videos shared in online social networks , 2013, CIKM.

[10]  Jake M. Hofman,et al.  Prediction and explanation in social systems , 2017, Science.

[11]  Gleb Gusev,et al.  Prediction of retweet cascade size over time , 2012, CIKM.

[12]  Dan Cosley,et al.  Predictability of Popularity: Gaps between Prediction and Understanding , 2016, ICWSM.

[13]  Le Song,et al.  Learning Social Infectivity in Sparse Low-rank Networks Using Multi-dimensional Hawkes Processes , 2013, AISTATS.

[14]  Serge Fdida,et al.  A survey on predicting the popularity of web content , 2014, Journal of Internet Services and Applications.

[15]  Flavio Figueiredo,et al.  TrendLearner: Early prediction of popularity trends of user generated content , 2014, Inf. Sci..

[16]  Shuang-Hong Yang,et al.  Mixture of Mutually Exciting Processes for Viral Diffusion , 2013, ICML.

[17]  George E. Tita,et al.  Self-Exciting Point Process Modeling of Crime , 2011 .

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  Donald L. Snyder,et al.  Random Point Processes in Time and Space , 1991 .

[20]  Emily B. Fox,et al.  A Bayesian Approach for Predicting the Popularity of Tweets , 2013, ArXiv.

[21]  A. Vespignani,et al.  Competition among memes in a world with limited attention , 2012, Scientific Reports.

[22]  Didier Sornette,et al.  Robust dynamic classes revealed by measuring the response function of a social system , 2008, Proceedings of the National Academy of Sciences.

[23]  A. Hawkes Spectra of some self-exciting and mutually exciting point processes , 1971 .

[24]  Kavé Salamatian,et al.  Modeling and predicting the popularity of online contents with Cox proportional hazard regression model , 2012, Neurocomputing.

[25]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[26]  D. Marsan,et al.  Extending Earthquakes' Reach Through Cascading , 2008, Science.

[27]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[28]  James Caverlee,et al.  Spatio-temporal meme prediction: learning what hashtags will be popular where , 2013, CIKM.

[29]  Jure Leskovec,et al.  Can cascades be predicted? , 2014, WWW.

[30]  Huzefa Rangwala,et al.  Digging Digg: Comment Mining, Popularity Prediction, and Social Network Analysis , 2009, 2009 International Conference on Web Information Systems and Mining.

[31]  Hui Xiong,et al.  Predicting the Popularity of Online Serials with Autoregressive Models , 2014, CIKM.