Learning to Predict Streaming Video QoE: Distortions, Rebuffering and Memory

Mobile streaming video data accounts for a large and increasing percentage of wireless network traffic. The available bandwidths of modern wireless networks are often unstable, leading to difficulties in delivering smooth, high-quality video. Streaming service providers such as Netflix and YouTube attempt to adapt their systems to adjust in response to these bandwidth limitations by changing the video bitrate or, failing that, allowing playback interruptions (rebuffering). Being able to predict end user' quality of experience (QoE) resulting from these adjustments could lead to perceptually-driven network resource allocation strategies that would deliver streaming content of higher quality to clients, while being cost effective for providers. Existing objective QoE models only consider the effects on user QoE of video quality changes or playback interruptions. For streaming applications, adaptive network strategies may involve a combination of dynamic bitrate allocation along with playback interruptions when the available bandwidth reaches a very low value. Towards effectively predicting user QoE, we propose Video Assessment of TemporaL Artifacts and Stalls (Video ATLAS): a machine learning framework where we combine a number of QoE-related features, including objective quality features, rebuffering-aware features and memory-driven features to make QoE predictions. We evaluated our learning-based QoE prediction model on the recently designed LIVE-Netflix Video QoE Database which consists of practical playout patterns, where the videos are afflicted by both quality changes and rebuffering events, and found that it provides improved performance over state-of-the-art video quality metrics while generalizing well on different datasets. The proposed algorithm is made publicly available at this http URL release_v2.rar.

[1]  Alan C. Bovik,et al.  Study of the effects of stalling events on the quality of experience of mobile streaming videos , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[2]  Phuoc Tran-Gia,et al.  Quantification of YouTube QoE via Crowdsourcing , 2011, 2011 IEEE International Symposium on Multimedia.

[3]  Alan C. Bovik,et al.  A Completely Blind Video Integrity Oracle , 2016, IEEE Transactions on Image Processing.

[4]  Rajiv Soundararajan,et al.  Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Graça Bressan,et al.  Quality metric to assess video streaming service over TCP considering temporal location of pauses , 2012, IEEE Transactions on Consumer Electronics.

[6]  Alan C. Bovik,et al.  Delivery quality score model for Internet video , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[7]  Khaled El-Maleh,et al.  Perceptual Temporal Quality Metric for Compressed Video , 2007, IEEE Transactions on Multimedia.

[8]  Bernd Girod,et al.  Analysis of Packet Loss for Compressed Video: Effect of Burst Losses and Correlation Between Error Frames , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Alan C. Bovik,et al.  Automatic Prediction of Perceptual Image and Video Quality , 2013, Proceedings of the IEEE.

[10]  Alan C. Bovik,et al.  Motion silencing of flicker distortions on naturalistic videos , 2015, Signal Process. Image Commun..

[11]  Srinivasan Seshan,et al.  Developing a predictive model of quality of experience for internet video , 2013, SIGCOMM.

[12]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[13]  J. Astola,et al.  ON BETWEEN-COEFFICIENT CONTRAST MASKING OF DCT BASIS FUNCTIONS , 2007 .

[14]  Gustavo de Veciana,et al.  Modeling the Time—Varying Subjective Quality of HTTP Video Streams With Rate Adaptations , 2013, IEEE Transactions on Image Processing.

[15]  Zhou Wang,et al.  Video quality assessment based on structural distortion measurement , 2004, Signal Process. Image Commun..

[16]  Margaret H. Pinson,et al.  Temporal Video Quality Model Accounting for Variable Frame Delay Distortions , 2014, IEEE Transactions on Broadcasting.

[17]  A. Stuart,et al.  Non-Parametric Statistics for the Behavioral Sciences. , 1957 .

[18]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[19]  Martin Slanina,et al.  “To pool or not to pool”: A comparison of temporal pooling methods for HTTP adaptive video streaming , 2013, 2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX).

[20]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[21]  Alan Conrad Bovik,et al.  Study of Temporal Effects on Subjective Video Quality of Experience , 2017, IEEE Transactions on Image Processing.

[22]  Gerardo Rubino,et al.  Quality of experience estimation for adaptive HTTP/TCP video streaming using H.264/AVC , 2012, 2012 IEEE Consumer Communications and Networking Conference (CCNC).

[23]  Alan C. Bovik,et al.  Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[24]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[25]  Stefano Tubaro,et al.  A H.264/AVC video database for the evaluation of quality metrics , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  A. Bovik A VISUAL INFORMATION FIDELITY APPROACH TO VIDEO QUALITY ASSESSMENT , 2005 .

[27]  Alan C. Bovik,et al.  Temporal hysteresis model of time varying subjective video quality , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28]  Alan C. Bovik,et al.  A time-varying subjective quality model for mobile streaming videos with stalling events , 2015, SPIE Optical Engineering + Applications.

[29]  Alan C. Bovik,et al.  Video Quality Pooling Adaptive to Perceptual Distortion Severity , 2013, IEEE Transactions on Image Processing.

[30]  Stefan Winkler,et al.  Analysis of Public Image and Video Databases for Quality Assessment , 2012, IEEE Journal of Selected Topics in Signal Processing.

[31]  Alan C. Bovik,et al.  Image information and visual quality , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[33]  Kai Zeng,et al.  Display device-adapted video quality-of-experience assessment , 2015, Electronic Imaging.

[34]  Christophe Charrier,et al.  Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[35]  Lei Zhang,et al.  Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index , 2013, IEEE Transactions on Image Processing.

[36]  Terence C. Mills,et al.  Time series techniques for economists , 1990 .

[37]  Yining Qi,et al.  The Effect of Frame Freezing and Frame Skipping on Video Quality , 2006, 2006 International Conference on Intelligent Information Hiding and Multimedia.

[38]  Yuukou Horita,et al.  NR objective continuous video quality assessment model based on frame quality measure , 2008, 2008 15th IEEE International Conference on Image Processing.

[39]  Rajiv Soundararajan,et al.  Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[40]  Alan C. Bovik,et al.  A Structural Similarity Metric for Video Based on Motion Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[41]  Gustavo de Veciana,et al.  Video Quality Assessment on Mobile Devices: Subjective, Behavioral and Objective Studies , 2012, IEEE Journal of Selected Topics in Signal Processing.

[42]  Zhengfang Duanmu,et al.  A Quality-of-Experience Index for Streaming Video , 2017, IEEE Journal of Selected Topics in Signal Processing.

[43]  Hocine Cherifi,et al.  Sporadic frame dropping impact on quality perception , 2004, IS&T/SPIE Electronic Imaging.

[44]  Alan C. Bovik,et al.  Visual quality assessment algorithms: what does the future hold? , 2010, Multimedia Tools and Applications.

[45]  S. E. Avons,et al.  Recency and duration neglect in subjective assessment of television picture quality , 2001 .