On Migratory Behavior in Video Consumption

Today's video streaming market is crowded with various content providers (CPs). For individual CPs, understanding user behavior, in particular how users migrate among different CPs, is crucial for improving users' on-site experience and the CP's chance of success. In this paper, we take a data-driven approach to analyze and model user migration behavior in video streaming, i.e., users switching content provider during active sessions. Based on a large ISP dataset over two months (6 major content providers, 3.8 million users, and 315 million video requests), we study common migration patterns and reasons of migration. We find that migratory behavior is prevalent: 66% of users switch CPs with an average switching frequency of 13%. In addition, migration behaviors are highly diverse: regardless large or small CPs, they all have dedicated groups of users who like to switch to them for certain types of videos. Regarding reasons of migration, we find CP service quality rarely causes migration, while a few popular videos play a bigger role. Nearly 60% of cross-site migrations are landed to 0.14% top videos. Finally, we validate our findings by building an accurate regression model to predict user migration frequency, and discuss the implications of our results to CPs.

[1]  Reza Zafarani,et al.  Understanding User Migration Patterns in Social Media , 2011, AAAI.

[2]  Derek Ruths,et al.  User Migration in Online Social Networks: A Case Study on Reddit During a Period of Community Unrest , 2016, ICWSM.

[3]  Ruixi Yuan,et al.  Measurement and analysis of a large scale commercial mobile internet TV system , 2011, IMC '11.

[4]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[5]  Ben Y. Zhao,et al.  Understanding user behavior in large-scale video-on-demand systems , 2006, EuroSys.

[6]  Mor Naaman,et al.  A Data-Driven Study of View Duration on YouTube , 2016, ICWSM.

[7]  Gang Wang,et al.  A First Look at User Switching Behaviors Over Multiple Video Content Providers , 2017, ICWSM.

[8]  Anirban Mahanti,et al.  Characterizing and Predicting Viral-and-Popular Video Content , 2015, CIKM.

[9]  Vyas Sekar,et al.  Understanding the impact of video quality on user engagement , 2011, SIGCOMM.

[10]  Robert H. Kewley,et al.  Data strip mining for the virtual design of pharmaceuticals with neural networks , 2000, IEEE Trans. Neural Networks Learn. Syst..

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Gaogang Xie,et al.  On the geographic patterns of a large-scale mobile video-on-demand system , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[13]  Gang Wang,et al.  Anatomy of a Personalized Livestreaming System , 2016, Internet Measurement Conference.

[14]  Ke Xu,et al.  On popularity prediction of videos shared in online social networks , 2013, CIKM.

[15]  Haiyi Zhu,et al.  Selecting an effective niche: an ecological view of the success of online communities , 2014, CHI.

[16]  Virgílio A. F. Almeida,et al.  Characterizing user behavior in online social networks , 2009, IMC '09.

[17]  Cecilia Mascolo,et al.  Track globally, deliver locally: improving content delivery networks by tracking geographic social cascades , 2011, WWW.

[18]  Gaogang Xie,et al.  Watching videos from everywhere: a study of the PPTV mobile VoD system , 2012, IMC '12.

[19]  Mirjam Wattenhofer,et al.  YouTube around the world: geographic popularity of videos , 2012, WWW.

[20]  Pablo Rodriguez,et al.  Watching television over an IP network , 2008, IMC '08.

[21]  Songqing Chen,et al.  The stretched exponential distribution of internet media access patterns , 2008, PODC '08.

[22]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[23]  Inderjit S. Dhillon,et al.  A Divisive Information-Theoretic Feature Clustering Algorithm for Text Classification , 2003, J. Mach. Learn. Res..

[24]  Robert N. M. Watson,et al.  Ignoring the Great Firewall of China , 2006, Privacy Enhancing Technologies.

[25]  Cheng Huang,et al.  Challenges, design and analysis of a large-scale p2p-vod system , 2008, SIGCOMM '08.

[26]  Aleksandar Kuzmanovic,et al.  Mosaic: quantifying privacy leakage in mobile networks , 2013, SIGCOMM.

[27]  Anne-Marie Kermarrec,et al.  Content and geographical locality in user-generated content sharing systems , 2012, NOSSDAV '12.

[28]  Gaogang Xie,et al.  User Behavior Characterization of a Large-scale Mobile Live Streaming System , 2015, WWW.

[29]  Aniket Kittur,et al.  The impact of membership overlap on the survival of online communities , 2013, ICIS.

[30]  Gang Wang,et al.  Unsupervised Clickstream Clustering for User Behavior Analysis , 2016, CHI.

[31]  Ramesh K. Sitaraman,et al.  Video Stream Quality Impacts Viewer Behavior: Inferring Causality Using Quasi-Experimental Designs , 2012, IEEE/ACM Transactions on Networking.

[32]  Nishanth R. Sastry,et al.  On factors affecting the usage and adoption of a nation-wide TV streaming service , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[33]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[34]  K. K. Ramakrishnan,et al.  Understanding couch potatoes: measurement and modeling of interactive usage of IPTV at large scale , 2011, IMC '11.

[35]  Ning Xia,et al.  Inside the bird's nest: measurements of large-scale live VoD from the 2008 olympics , 2009, IMC '09.