Data-Driven Rate Control for Rate-Distortion Optimization in HEVC Based on Simplified Effective Initial QP Learning

Different from the conventional calculative methods, a learning-based initial quantization parameter (LIQP) method is proposed in this paper to improve rate control of high efficiency video coding (H.265). First, the framework for initial quantization parameter (QP) learning is proposed, where a novel equivalent approach to build the benchmark labels is proposed using the single rate-distortion (R-D) pair in each initial QP testing. With the criterion of maximizing the prediction accuracy of initial QPs, features and parameters of the learning model are refined. Instead of the traditionally used target bits per pixel (bpp) for intraframe, the target bpp for all remaining frames is proposed to avoid the empirical setting on intracoding bits, and thus the related inaccuracy can be prevented. We clearly present the motivations of the proposed LIQP method, as well as the reasons for the extracted features and model parameters. The proposed LIQP method outperforms the latest HM-16.14 by achieving significant gains on R-D performance (−15.48% BD-BR and 0.782 dB BD-PSNR gains), quality smoothness (1.581 dB versus 2.598 dB), and more stable buffer occupancy control, with similar high bit rate accuracy (99.84% versus 99.87%), and can also work well for scene change cases.

[1]  NebutaFestival,et al.  Fast HEVC Encoding Decisions Using Data Mining , 2022 .

[2]  Kurt Hornik,et al.  The support vector machine under test , 2003, Neurocomputing.

[3]  T. Wiegand,et al.  Text Description of Joint Model Reference Encoding Methods and Decoding Concealment Methods , 2004 .

[4]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[5]  Sanjit K. Mitra,et al.  . Optimum bit allocation and accurate rate control for video coding via ρ-domain source modeling , 2002, IEEE Trans. Circuits Syst. Video Technol..

[6]  Christos Grecos,et al.  MPEG-7 Descriptors Based Shot Detection and Adaptive Initial Quantization Parameter Estimation for the H.264/AVC , 2009, IEEE Transactions on Broadcasting.

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[9]  Lu Yu,et al.  CU splitting early termination based on weighted SVM , 2013, EURASIP Journal on Image and Video Processing.

[10]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[11]  Gang Hua,et al.  Multimedia Big Data Computing , 2015, IEEE Multim..

[12]  Stefanie Eberhardt Support Vector Machines For Pattern Recognition , 2006 .

[13]  Shahaboddin Shamshirband,et al.  Sensor Data Fusion by Support Vector Regression Methodology—A Comparative Study , 2015, IEEE Sensors Journal.

[14]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[15]  C.-C. Jay Kuo,et al.  Novel Rate-Quantization Model-Based Rate Control With Adaptive Initialization for Spatial Scalable Video Coding , 2012, IEEE Transactions on Industrial Electronics.

[16]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[18]  Yongdong Zhang,et al.  High Efficiency Video Coding: High Efficiency Video Coding , 2014 .

[19]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[20]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Zhan Ma,et al.  Fast Mode and Partition Decision Using Machine Learning for Intra-Frame Coding in HEVC Screen Content Coding Extension , 2016, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[22]  Sam Kwong,et al.  Rate-Distortion Optimization of Rate Control for H.264 With Adaptive Initial Quantization Parameter Determination , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Detlev Marpe,et al.  Block Merging for Quadtree-Based Partitioning in HEVC , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Yu Zhou,et al.  SSIM-Based Game Theory Approach for Rate-Distortion Optimized Intra Frame CTU-Level Bit Allocation , 2016, IEEE Transactions on Multimedia.

[25]  Rae-Hong Park,et al.  Fast CU Partitioning Algorithm for HEVC Using an Online-Learning-Based Bayesian Decision Rule , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Xuan Jing,et al.  Frame Complexity-Based Rate-Quantization Model for H.264/AVC Intraframe Rate Control , 2008, IEEE Signal Processing Letters.

[27]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[28]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[29]  Pedro Cuenca,et al.  Low-Complexity Heterogeneous Video Transcoding Using Data Mining , 2008, IEEE Transactions on Multimedia.

[30]  Dong-Gyu Sim,et al.  Pixel-Wise Unified Rate-Quantization Model for Multi-Level Rate Control , 2013, IEEE Journal of Selected Topics in Signal Processing.

[31]  Shang-Hong Lai,et al.  Fast H.264 Encoding Based on Statistical Learning , 2010, PCM.

[32]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[33]  Gao Wei,et al.  Phase Congruency based edge saliency detection and rate control for perceptual image and video coding , 2016 .

[34]  Sam Kwong,et al.  DCT Coefficient Distribution Modeling and Quality Dependency Analysis Based Frame-Level Bit Allocation for HEVC , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[36]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[37]  David L. Neuhoff,et al.  Quantization , 2022, IEEE Trans. Inf. Theory.

[38]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[39]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[40]  Ramesh C. Jain,et al.  Guest Editorial Multimedia: The Biggest Big Data , 2015, IEEE Trans. Multim..

[41]  Sam Kwong,et al.  Joint Machine Learning and Game Theory for Rate Control in High Efficiency Video Coding. , 2017, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.