Referenceless Rate-Distortion Modeling with Learning from Bitstream and Pixel Features

Generally, adaptive bitrates for variable Internet bandwidths can be obtained through multi-pass coding. Referenceless prediction-based methods show practical benefits compared with multi-pass coding to avoid excessive computational resource consumption, especially in low-latency circumstances. However, most of them fail to predict precisely due to the complex inner structure of modern codecs. Therefore, to improve the fidelity of prediction, we propose a referenceless prediction-based R-QP modeling (PmR-QP) method to estimate bitrate by leveraging a deep learning algorithm with only one-pass coding. It refines the global rate-control paradigm in modern codecs on flexibility and applicability with few adjustments as possible. By exploring the potentials of bitstream and pixel features from the prerequisite of one-pass coding, it can reach the expectation of bitrate estimation in terms of precision. To be more specific, we first describe the R-QP relationship curve as a robust quadratic R-QP modeling function derived from the Cauchy-based distribution. Second, we simplify the modeling function by fastening one operational point of the relationship curve received from the coding process. Third, we learn the model parameters from bitstream and pixel features, named them hybrid referenceless features, comprising texture information, hierarchical coding structure, and selected modes in intra-prediction. Extensive experiments demonstrate the proposed method significantly decreases the proportion of samples' bitrate estimation error within 10% by 24.60% on average over the state-of-the-art.

[1]  Jörn Ostermann,et al.  Deep learning-based intra prediction mode decision for HEVC , 2016, 2016 Picture Coding Symposium (PCS).

[2]  Ming-Ting Sun,et al.  Digital Video Transcoding , 2005, Proceedings of the IEEE.

[3]  Wen Gao,et al.  Quadratic ρ-domain based rate control algorithm for HEVC , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  N. R. Goodman Statistical analysis based on a certain multivariate complex Gaussian distribution , 1963 .

[5]  Mandyam D. Srinath,et al.  Statistical distributions of image DCT coefficients , 1986 .

[6]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Shan Liu,et al.  Yoco: Light-Weight Rate Control Model Learning , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[8]  Yücel Altunbasak,et al.  Frame bit allocation for the H.264/AVC video coder via Cauchy-density-based rate and distortion models , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[10]  Jeong-Hoon Park,et al.  Block Partitioning Structure in the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Izquierdo Ebroul,et al.  Estimation of Rate Control Parameters for Video Coding Using CNN , 2018 .

[12]  Hua Yang,et al.  Advances in Recursive Per-Pixel End-to-End Distortion Estimation for Robust Video Coding in H.264/AVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Lifeng Sun,et al.  Generalizing Rate Control Strategies for Realtime Video Streaming via Learning from Deep Learning , 2019, MMAsia.

[14]  Bin Xu,et al.  CNN-based rate-distortion modeling for H.265/HEVC , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[15]  Anil C. Kokaram,et al.  Optimizing Transcoder Quality Targets Using a Neural Network with an Embedded Bitrate Model , 2016, Visual Information Processing and Communication.

[16]  F. Muller Distribution shape of two-dimensional DCT coefficients of natural images , 1993 .

[17]  E. Mansard,et al.  The Measurement of Incident and Reflected Spectra Using a Least squares Method , 1980 .

[18]  Chin-Hui Lee,et al.  Video segmentation using spatial and temporal statistical analysis method , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[19]  Lifeng Sun,et al.  QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning , 2018, ACM Multimedia.

[20]  Eirikur Agustsson,et al.  NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[21]  Luis Nero Alves,et al.  Fast Motion Estimation Algorithm for HEVC , 2012, 2012 IEEE Second International Conference on Consumer Electronics - Berlin (ICCE-Berlin).

[22]  Homer H. Chen,et al.  SSIM-Based Perceptual Rate Control for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[24]  Xinfeng Zhang,et al.  Spatial-temporal residue network based in-loop filter for video coding , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[25]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[26]  Yu Zhou,et al.  SSIM-Based Game Theory Approach for Rate-Distortion Optimized Intra Frame CTU-Level Bit Allocation , 2016, IEEE Transactions on Multimedia.

[27]  Zhu Li,et al.  A Machine Learning Approach to Accurate Sequence-Level Rate Control Scheme for Video Coding , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[28]  Ming Yang,et al.  Using Deep Learning Neural Network for Block Partitioning in H.265/HEVC , 2018 .

[29]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  M. Angela Sasse,et al.  Sharp or smooth?: comparing the effects of quantization vs. frame rate for streamed video , 2004, CHI '04.

[31]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[32]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Marta Mrak,et al.  Estimation of Rate Control Parameters for Video Coding Using CNN , 2018, 2018 IEEE Visual Communications and Image Processing (VCIP).

[34]  Bin Li,et al.  QP refinement according to Lagrange multiplier for High Efficiency Video Coding , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[35]  David Flynn,et al.  HEVC Complexity and Implementation Analysis , 2012, IEEE Transactions on Circuits and Systems for Video Technology.