Binary and Multi-Class Learning Based Low Complexity Optimization for HEVC Encoding

High Efficiency Video Coding (HEVC) improves the compression efficiency at the cost of high computational complexity by using the quad-tree coding unit (CU) structure and variable prediction unit (PU) modes. To minimize the HEVC encoding complexity while maintaining its compression efficiency, a binary and multi-class support vector machine (SVM)-based fast HEVC encoding algorithm is presented in this paper. First, the processes of recursive CU decision and PU selection in HEVC are modeled as hierarchical binary classification and multi-class classification structures. Second, according to the two classification structures, the CU decision and PU selection are optimized by binary and multi-class SVM, i.e., the CU and PU can be predicted directly via classifiers without intensive rate distortion (RD) cost calculation. In particular, to achieve better prediction performance, a learning method is proposed to combine the off-line machine learning (ML) mode and on-line ML mode for classifiers based on a multiple reviewers system. Additionally, the optimal parameters determination scheme is adopted for flexible complexity allocation under a given RD constraint. Experimental results show that the proposed method can achieve 68.3%, 67.3%, and 65.6% time saving on average while the values of Bjøntegaard delta peak signal-to-noise ratio are −0.093dB, −0.091dB, and −0.094dB and the values of Bjøntegaard delta bit rate are 4.191%, 3.842%, and 3.665% under low delay $P$ main, low delay main, and random access configurations, respectively, when compared with the HEVC test model version HM 16.5. Meanwhile, the proposed method outperforms the state-of-the-art fast coding algorithms in terms of complexity reduction and RD performance.

[1]  Yongdong Zhang,et al.  High Efficiency Video Coding: High Efficiency Video Coding , 2014 .

[2]  Anil Fernando,et al.  Content-Adaptive Feature-Based CU Size Prediction for Fast Low-Delay Video Encoding in HEVC , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Xinpeng Zhang,et al.  Fast TU size decision algorithm for HEVC encoders using Bayesian theorem detection , 2015, Signal Process. Image Commun..

[4]  NebutaFestival,et al.  Fast HEVC Encoding Decisions Using Data Mining , 2022 .

[5]  Kiho Choi,et al.  Early TU decision method for fast video encoding in high efficiency video coding , 2012 .

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Chih-Jen Lin,et al.  Probability Estimates for Multi-class Classification by Pairwise Coupling , 2003, J. Mach. Learn. Res..

[8]  Long Xu,et al.  Machine Learning-Based Coding Unit Depth Decisions for Flexible Complexity Allocation in High Efficiency Video Coding , 2015, IEEE Transactions on Image Processing.

[9]  Jeong-Hoon Park,et al.  Block Partitioning Structure in the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Bin Li,et al.  An Efficient Fast Mode Decision Method for Inter Prediction in HEVC , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  King Ngi Ngan,et al.  Fast HEVC Inter CU Decision Based on Latent SAD Estimation , 2015, IEEE Transactions on Multimedia.

[12]  Munchurl Kim,et al.  A Novel Fast CU Encoding Scheme Based on Spatiotemporal Encoding Parameters for HEVC Inter Coding , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Gangyi Jiang,et al.  Machine learning based fast H.264/AVC to HEVC transcoding exploiting block partition similarity , 2016, J. Vis. Commun. Image Represent..

[14]  Robert N. Kantor,et al.  ANALYTIC SCORING OF TOEFL® CBT ESSAYS: SCORES FROM HUMANS AND E‐RATER® , 2008 .

[15]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[16]  Xinpeng Zhang,et al.  An Effective CU Size Decision Method for HEVC Encoders , 2013, IEEE Transactions on Multimedia.

[17]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[18]  Eduardo Peixoto,et al.  Inter-Prediction Optimizations for Video Coding Using Adaptive Coding Unit Visiting Order , 2016, IEEE Transactions on Multimedia.

[19]  Marcelo Alencar,et al.  Online learning early skip decision method for the HEVC Inter process using the SVM-based Pegasos algorithm , 2016 .

[20]  Panos Nasiopoulos,et al.  Online-Learning-Based Mode Prediction Method for Quality Scalable Extension of the High Efficiency Video Coding (HEVC) Standard , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Hyun Wook Park,et al.  A Fast Mode Decision Method in HEVC Using Adaptive Ordering of Modes , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Rae-Hong Park,et al.  Fast CU Partitioning Algorithm for HEVC Using an Online-Learning-Based Bayesian Decision Rule , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Jaeho Lee,et al.  A Fast CU Size Decision Algorithm for HEVC , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Eduardo Peixoto,et al.  H.264/AVC to HEVC Video Transcoder Based on Dynamic Thresholding and Content Modeling , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Byeungwoo Jeon,et al.  Early Skip Mode Decision for HEVC Encoder With Emphasis on Coding Quality , 2015, IEEE Transactions on Broadcasting.

[26]  Xingming Sun,et al.  Fast Motion Estimation Based on Content Property for Low-Complexity H.265/HEVC Encoder , 2016, IEEE Transactions on Broadcasting.

[27]  Marko Viitanen,et al.  Efficient Mode Decision Schemes for HEVC Inter Prediction , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.