TrendLearner: Early prediction of popularity trends of user generated content

We here focus on the problem of predicting the popularity trend of user generated content (UGC) as early as possible. Taking YouTube videos as case study, we propose a novel two-step learning approach that: (1) extracts popularity trends from previously uploaded objects, and (2) predicts trends for new content. Unlike previous work, our solution explicitly addresses the inherent tradeoff between prediction accuracy and remaining interest in the content after prediction, solving it on a per-object basis. Our experimental results show great improvements of our solution over alternatives, and its applicability to improve the accuracy of state-of-the-art popularity prediction methods.

[1]  Flavio Figueiredo,et al.  Improving the Effectiveness of Content Popularity Prediction Methods using Time Series Trends , 2014, ArXiv.

[2]  Andrew Y. Ng,et al.  Learning Feature Representations with K-Means , 2012, Neural Networks: Tricks of the Trade.

[3]  D.,et al.  Regression Models and Life-Tables , 2022 .

[4]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[5]  Virgílio A. F. Almeida,et al.  Capacity Planning for Web Services: Metrics, Models, and Methods , 2001 .

[6]  Flavio Figueiredo,et al.  The tube over time: characterizing popularity growth of youtube videos , 2011, WSDM '11.

[7]  Balachander Krishnamurthy,et al.  Best paper -- Follow the money: understanding economics of online aggregation and advertising , 2013, Internet Measurement Conference.

[8]  Athena Vakali,et al.  Social networking trends and dynamics detection via a cloud-based framework design , 2012, WWW.

[9]  Tad Hogg,et al.  Using a model of social dynamics to predict popularity of news , 2010, WWW '10.

[10]  Devavrat Shah,et al.  A Latent Source Model for Nonparametric Time Series Classification , 2013, NIPS.

[11]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[12]  Raj Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[13]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[14]  Milad Shokouhi,et al.  Behavioral dynamics on the web: Learning, modeling, and prediction , 2013, TOIS.

[15]  Ke Xu,et al.  On popularity prediction of videos shared in online social networks , 2013, CIKM.

[16]  Sergei Vassilvitskii,et al.  Sharding social networks , 2013, WSDM.

[17]  Eamonn J. Keogh,et al.  Time series shapelets: a novel technique that allows accurate, interpretable and fast classification , 2010, Data Mining and Knowledge Discovery.

[18]  Virgílio A. F. Almeida,et al.  Finding trendsetters in information networks , 2012, KDD.

[19]  Bernard Zenko,et al.  Is Combining Classifiers with Stacking Better than Selecting the Best One? , 2004, Machine Learning.

[20]  Christos Faloutsos,et al.  Rise and fall patterns of information diffusion: model and implications , 2012, KDD.

[21]  Hsinchun Chen,et al.  Social Media Analytics and Intelligence , 2010, IEEE Intell. Syst..

[22]  A. Gilles,et al.  The Art of Computer Systems Performance Analysis (Techniques for Experimental Design, Measurement, Simulation, and Modeling) , 1992 .

[23]  Yi Yang,et al.  Viral Video Style: A Closer Look at Viral Videos on YouTube , 2014, ICMR.

[24]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.

[25]  Stefan Stieglitz,et al.  Social Media Analytics , 2014, Business & Information Systems Engineering.

[26]  Kavé Salamatian,et al.  An Approach to Model and Predict the Popularity of Online Contents with Explanatory Factors , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[27]  Noah A. Smith,et al.  What's Worthy of Comment? Content and Comment Volume in Political Blogs , 2010, ICWSM.

[28]  Didier Sornette,et al.  Robust dynamic classes revealed by measuring the response function of a social system , 2008, Proceedings of the National Academy of Sciences.

[29]  MahantiAnirban,et al.  Characterizing and modelling popularity of user-generated videos , 2011 .

[30]  Stanislav Nikolov Trend or no trend : a novel nonparametric method for classifying time series , 2012 .

[31]  Jussara M. Almeida,et al.  Using early view patterns to predict the popularity of youtube videos , 2013, WSDM.

[32]  Philip S. Yu,et al.  Deriving latent social impulses to determine longevous videos , 2014, WWW '14 Companion.

[33]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[34]  Saverio Niccolini,et al.  A peek into the future: predicting the evolution of popularity in user generated content , 2013, WSDM.

[35]  Yehuda Koren,et al.  Expediting search trend detection via prediction of query counts , 2013, WSDM.

[36]  Jure Leskovec,et al.  Modeling Information Diffusion in Implicit Networks , 2010, 2010 IEEE International Conference on Data Mining.

[37]  Jürgen Pfeffer,et al.  Characterizing the life cycle of online news stories using social media reactions , 2013, CSCW.

[38]  Flavio Figueiredo,et al.  On the Dynamics of Social Media Popularity: A YouTube Case Study , 2014, TOIT.

[39]  Yong-Yeol Ahn,et al.  Analyzing the Video Popularity Characteristics of Large-Scale User Generated Content Systems , 2009, IEEE/ACM Transactions on Networking.

[40]  Wang-Chien Lee,et al.  A straw shows which way the wind blows: ranking potentially popular items from early votes , 2012, WSDM '12.