LARM: A Lifetime Aware Regression Model for Predicting YouTube Video Popularity

Online content popularity prediction provides substantial value to a broad range of applications in the end-to-end social media systems, from network resource allocation to targeted advertising. While using historical popularity can predict the near-term popularity with a reasonable accuracy, the bursty nature of online content popularity evolution makes it difficult to capture the correlation between historical data and future data in the long term. Although various existing efforts have been made toward long-term prediction, they need to accumulate a long enough historical data before the prediction and their model assumptions cannot be applied to the complex YouTube networks with inherent unpredictability. In this paper, we aim to achieve fast prediction of long-term video popularity in the complex YouTube networks. We propose LARM, a lifetime aware regression model, representing the first work that leverages content lifetime to compensate the insufficiency of historical data without assumptions of network structure. The proposed LARM is empowered by a lifetime metric that is both predictable via early-accessible features and adaptable to different observation intervals, as well as a set of specialized regression models to handle different classes of videos with different lifetime. We validate LARM on two YouTube data sets with hourly and daily observation intervals. Experimental results indicate that LARM outperforms several non-trivial baselines from the literature by up to 20% and 18% of prediction error reduction in the two data sets.

[1]  Jiqiang Wu,et al.  Modeling Dynamics of Online Video Popularity , 2016, IEEE Transactions on Multimedia.

[2]  Anirban Mahanti,et al.  Characterizing and Predicting Viral-and-Popular Video Content , 2015, CIKM.

[3]  Christos Faloutsos,et al.  Rise and fall patterns of information diffusion: model and implications , 2012, KDD.

[4]  Jussara M. Almeida,et al.  Using early view patterns to predict the popularity of youtube videos , 2013, WSDM.

[5]  Christos Faloutsos,et al.  Vote-and-Comment: Modeling the Coevolution of User Interactions in Social Voting Web Sites , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[6]  Yiqun Liu,et al.  Predicting the popularity of web 2.0 items based on user comments , 2014, SIGIR.

[7]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[8]  Scott Sanner,et al.  The Lifecyle of a Youtube Video: Phases, Content and Popularity , 2015, ICWSM.

[9]  Flavio Figueiredo,et al.  On the prediction of popularity of trends and hits for user generated videos , 2013, WSDM.

[10]  Duncan J. Watts,et al.  Exploring Limits to Prediction in Complex Social Systems , 2016, WWW.

[11]  Niklas Carlsson,et al.  The untold story of the clones: content-agnostic factors that impact YouTube video popularity , 2012, KDD.

[12]  Jilin Zhang,et al.  How YouTube videos are discovered and its impact on video views , 2015, Multimedia Tools and Applications.

[13]  Scott Sanner,et al.  Can this video be promoted? - Endogenous and exogenous popularity processes in social media , 2016, ArXiv.

[14]  Albert-László Barabási,et al.  Modeling and Predicting Popularity Dynamics via Reinforced Poisson Processes , 2014, AAAI.

[15]  Chang Wen Chen,et al.  Forecasting initial popularity of just-uploaded user-generated videos , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[16]  Saverio Niccolini,et al.  A peek into the future: predicting the evolution of popularity in user generated content , 2013, WSDM.

[17]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[18]  Rediet Abebe Can Cascades be Predicted? , 2014 .

[19]  Ibrahim Matta,et al.  Describing and forecasting video access patterns , 2011, 2011 Proceedings IEEE INFOCOM.

[20]  Lifeng Sun,et al.  Propagation-based social-aware replication for social video contents , 2012, ACM Multimedia.

[21]  Lada A. Adamic,et al.  Detecting Large Reshare Cascades in Social Networks , 2017, WWW.

[22]  Flavio Figueiredo,et al.  On the Dynamics of Social Media Popularity: A YouTube Case Study , 2014, TOIT.

[23]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[24]  Tao Mei,et al.  Towards Cross-Domain Learning for Social Video Popularity Prediction , 2013, IEEE Transactions on Multimedia.

[25]  Zhoujun Li,et al.  Burst Time Prediction in Cascades , 2015, AAAI.

[26]  Hong Cheng,et al.  A Model-Free Approach to Infer the Diffusion Network from Event Cascade , 2016, CIKM.

[27]  Swapnil Mishra,et al.  Feature Driven and Point Process Approaches for Popularity Prediction , 2016, CIKM.

[28]  Jun Zhou,et al.  A Novel Time Series Approach for Predicting the Long-Term Popularity of Online Videos , 2016, IEEE Transactions on Broadcasting.

[29]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.