Bio-inspired models for characterizing YouTube viewcout

The goal of this paper is to study the behaviour of viewcount in YouTube. We first propose several bio-inspired models for the evolution of the viewcount of YouTube videos. We show, using a large set of empirical data, that the viewcount for 90% of videos in YouTube can indeed be associated to at least one of these models, with a Mean Error which does not exceed 5%. We derive automatic ways of classifying the viewcount curve into one of these models and of extracting the most suitable parameters of the model. We study empirically the impact of videos' popularity and category on the evolution of its viewcount. We finally use the above classification along with the automatic parameters extraction in order to predict the evolution of videos' viewcount.

[1]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[2]  Niklas Carlsson,et al.  Characterizing web-based video sharing workloads , 2009, WWW '09.

[3]  W. Edwards Deming The Chi-Test and Curve Fitting , 1934 .

[4]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[5]  Jiangchuan Liu,et al.  Statistics and Social Network of YouTube Videos , 2008, 2008 16th Interntional Workshop on Quality of Service.

[6]  Vijay Mahajan,et al.  Chapter 8 New-product diffusion models , 1993, Marketing.

[7]  Christos Faloutsos,et al.  Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining , 2013, ASONAM 2013.

[8]  Zongpeng Li,et al.  Youtube traffic characterization: a view from the edge , 2007, IMC '07.

[9]  Alexander Grey,et al.  The Mathematical Theory of Infectious Diseases and Its Applications , 1977 .

[10]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[11]  Donald F. Towsley,et al.  The effect of network topology on the spread of epidemics , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[12]  Georges Linarès,et al.  Modelling View-count Dynamics in YouTube , 2014, ArXiv.

[13]  Christos Faloutsos,et al.  Epidemic thresholds in real networks , 2008, TSEC.

[14]  Michalis Faloutsos,et al.  A First Step Towards Understanding Popularity in YouTube , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[15]  Francesco De Pellegrini,et al.  YOUStatAnalyzer: a tool for analysing the dynamics of YouTube content popularity , 2013, VALUETOOLS.

[16]  L. Meyers Contact network epidemiology: Bond percolation applied to infectious disease prediction and control , 2006 .

[17]  Santo Fortunato,et al.  Traffic in Social Media II: Modeling Bursty Popularity , 2010, 2010 IEEE Second International Conference on Social Computing.

[18]  Yong-Yeol Ahn,et al.  Analyzing the Video Popularity Characteristics of Large-Scale User Generated Content Systems , 2009, IEEE/ACM Transactions on Networking.

[19]  F. Bass The Relationship between Diffusion Rates, Experience Curves, and Demand Elasticities for Consumer Durable Technological Innovations , 1980 .

[20]  N. Ling The Mathematical Theory of Infectious Diseases and its applications , 1978 .

[21]  Didier Sornette,et al.  Viral, Quality, and Junk Videos on YouTube: Separating Content from Noise in an Information-Rich Environment , 2008, AAAI Spring Symposium: Social Information Processing.