Three-year Trends in YouTube Video Content and Encoding

Despite the dominance of YouTube streaming traffic, there have been few studies focusing on characterizing YouTube videos over time. Given the sheer volume of YouTube videos, we created a custom crawler which took snapshots of popular YouTube channels and ran the crawler daily for the past 3 years. This provides YouTube video trends from 2018–2020 for over 160k videos, considering media type, duration, bit rate, resolution, codec, encoding format, and popularity. Analysis of the data shows YouTube videos have increased frame rates, resolutions and durations over this time, with the biggest clips consuming over 200 Mb/s and being over 3 hours long, accompanied by corresponding changes in encoding rates and codecs. Our analysis and the resulting dataset we make public should be beneficial for traffic shaping or CDN deployment strategies.

[1]  Jiangchuan Liu,et al.  Statistics and Social Network of YouTube Videos , 2008, 2008 16th Interntional Workshop on Quality of Service.

[2]  Roger Wattenhofer,et al.  The YouTube Social Network , 2012, ICWSM.

[3]  Konstantina Papagiannaki,et al.  Measuring Video QoE from Encrypted Traffic , 2016, Internet Measurement Conference.

[4]  Feng Li,et al.  Silhouette: Identifying YouTube Video Flows from Encrypted Traffic , 2018, NOSSDAV.

[5]  Fan Yang,et al.  The QUIC Transport Protocol: Design and Internet-Scale Deployment , 2017, SIGCOMM.

[6]  Stefan Valentin,et al.  Classifying flows and buffer state for youtube's HTTP adaptive streaming service in mobile networks , 2018, MMSys.

[7]  Lea Skorin-Kapov,et al.  A machine learning approach to classifying YouTube QoE based on encrypted network traffic , 2017, Multimedia Tools and Applications.

[8]  Ethan Katz-Bassett,et al.  BingeOn Under the Microscope: Understanding T-Mobiles Zero-Rating Implementation , 2016, Internet-QoE '16.

[9]  Mark Claypool,et al.  Characteristics of streaming media stored on the Web , 2005, TOIT.

[10]  Feng Li,et al.  Who is the King of the Hill? Traffic Analysis over a 4G Network , 2018, 2018 IEEE International Conference on Communications (ICC).

[11]  Dilip Kumar Krishnappa,et al.  DASHing YouTube: An analysis of using DASH in YouTube video service , 2013, 38th Annual IEEE Conference on Local Computer Networks.

[12]  Mirjam Wattenhofer,et al.  YouTube around the world: geographic popularity of videos , 2012, WWW.