A comprehensive data-driven approach to evaluating quality of experience on large-scale internet video service

Internet video is currently one of the most popular Internet services, accounting for more than 50% of the overall Internet traffic. Various access patterns of users worldwide contribute to increasing complex scenarios of Internet video. Under the circumstances, there emerges a valuable yet challenging topic that industry and academia keep focusing on: how to evaluate quality of experience (QoE) of Internet video in the wild? In this paper, we propose a comprehensive data-driven approach to evaluate QoE based on massive dataset extracted from a large-scale Internet video service provider. With the methods of feature engineering and fuzzy theory, we exploit user's situational features and session's quality features. Then we conceive an appropriate QoE evaluating model called bagging-based Bayesian factorization machine to correlate the aforementioned features with user's QoE. The experimental results demonstrate that our approach is adaptive for QoE evaluation on Internet video both in efficiency and effectiveness. Moreover, our approach achieves higher degree of accuracy compared with baseline methods, including what associated works present.

[1]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[2]  Shou-De Lin,et al.  Feature Engineering and Classifier Ensemble for KDD Cup 2010 , 2010, KDD 2010.

[3]  Vyas Sekar,et al.  Shedding light on the structure of internet video quality problems in the wild , 2013, CoNEXT.

[4]  Walter Willinger,et al.  Proceedings of the 2011 ACM SIGCOMM conference on Internet measurement conference , 2011, IMC 2011.

[5]  Yin Zhang,et al.  Q-score: proactive service quality assessment in a large IPTV system , 2011, IMC '11.

[6]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[7]  Stephen R. Gulliver,et al.  Defining user perception of distributed multimedia quality , 2006, TOMCCAP.

[8]  Ramesh K. Sitaraman,et al.  Understanding the effectiveness of video ads: a measurement study , 2013, Internet Measurement Conference.

[9]  Bruce M. Maggs,et al.  An analysis of live streaming workloads on the internet , 2004, IMC '04.

[10]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[11]  Srinivasan Seshan,et al.  Analyzing the potential benefits of CDN augmentation strategies for internet video workloads , 2013, Internet Measurement Conference.

[12]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[13]  Hervé Abdi,et al.  Correspondence Analysis , 2014, Encyclopedia of Social Network Analysis and Mining.

[14]  Christoph Freudenthaler,et al.  Bayesian Factorization Machines , 2011 .

[15]  Balachander Krishnamurthy,et al.  Best paper -- Follow the money: understanding economics of online aggregation and advertising , 2013, Internet Measurement Conference.

[16]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[17]  Ramesh K. Sitaraman,et al.  Video Stream Quality Impacts Viewer Behavior: Inferring Causality Using Quasi-Experimental Designs , 2012, IEEE/ACM Transactions on Networking.

[18]  Srinivasan Seshan,et al.  Developing a predictive model of quality of experience for internet video , 2013, SIGCOMM.

[19]  Vyas Sekar,et al.  Understanding the impact of video quality on user engagement , 2011, SIGCOMM.