YouTube live and Twitch: a tour of user-generated live streaming systems

User-Generated live video streaming systems are services that allow anybody to broadcast a video stream over the Internet. These Over-The-Top services have recently gained popularity, in particular with e-sport, and can now be seen as competitors of the traditional cable TV. In this paper, we present a dataset for further works on these systems. This dataset contains data on the two main user-generated live streaming systems: Twitch and the live service of YouTube. We got three months of traces of these services from January to April 2014. Our dataset includes, at every five minutes, the identifier of the online broadcaster, the number of people watching the stream, and various other media information. In this paper, we introduce the dataset and we make a preliminary study to show the size of the dataset and its potentials. We first show that both systems generate a significant traffic with frequent peaks at more than 1 Tbps. Thanks to more than a million unique uploaders, Twitch is in particular able to offer a rich service at anytime. Our second main observation is that the popularity of these channels is more heterogeneous than what have been observed in other services gathering user-generated content.

[1]  Yong-Yeol Ahn,et al.  Analyzing the Video Popularity Characteristics of Large-Scale User Generated Content Systems , 2009, IEEE/ACM Transactions on Networking.

[2]  Alberto Blanc,et al.  Transcoding live adaptive video streams at a massive scale in the cloud , 2015, MMSys.

[3]  Zhi-Li Zhang,et al.  YouTube traffic dynamics and its interplay with a tier-1 ISP: an ISP perspective , 2010, IMC '10.

[4]  Gwendal Simon,et al.  DASH in Twitch: Adaptive Bitrate Streaming in Live Game Streaming Platforms , 2014, VideoNext '14.

[5]  Marco Mellia,et al.  YouTube everywhere: impact of device and infrastructure synergies on user experience , 2011, IMC '11.

[6]  Hossam S. Hassanein,et al.  A survey of peer-to-peer live video streaming schemes - An algorithmic perspective , 2012, Comput. Networks.

[7]  Siqi Shen,et al.  The XFire online meta-gaming network: observation and high-level analysis , 2011, 2011 IEEE International Workshop on Haptic Audio Visual Environments and Games.

[8]  Keith W. Ross,et al.  A Measurement Study of a Large-Scale P2P IPTV System , 2007, IEEE Transactions on Multimedia.

[9]  Alex Borges Vieira,et al.  Characterizing SopCast client behavior , 2012, Comput. Commun..

[10]  Loïc Cerf,et al.  Watch me playing, i am a professional: a first study on video game live streaming , 2012, WWW.

[11]  Vivek S. Pai,et al.  Towards understanding modern web traffic , 2011, SIGMETRICS '11.

[12]  Fabrice Guillemin,et al.  Experimental analysis of caching efficiency for YouTube traffic in an ISP network , 2013, Proceedings of the 2013 25th International Teletraffic Congress (ITC).

[13]  Michalis Faloutsos,et al.  A First Step Towards Understanding Popularity in YouTube , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[14]  Zhi-Li Zhang,et al.  Counting YouTube videos via random prefix sampling , 2011, IMC '11.

[15]  Gwendal Simon,et al.  How Neutral is a CDN? An economic approach , 2014, 10th International Conference on Network and Service Management (CNSM) and Workshop.

[16]  Virgílio A. F. Almeida,et al.  A hierarchical characterization of a live streaming media workload , 2006, TNET.