Tunable VVC Frame Partitioning Based on Lightweight Machine Learning

Block partition structure is a critical module in video coding scheme to achieve significant gap of compression performance. Under the exploration of the future video coding standard, named Versatile Video Coding (VVC), a new Quad Tree Binary Tree (QTBT) block partition structure has been introduced. In addition to the QT block partitioning defined in High Efficiency Video Coding (HEVC) standard, new horizontal and vertical BT partitions are enabled, which drastically increases the encoding time compared to HEVC. In this paper, we propose a lightweight and tunable QTBT partitioning scheme based on a Machine Learning (ML) approach. The proposed solution uses Random Forest classifiers to determine for each coding block the most probable partition modes. To minimize the encoding loss induced by misclassification, risk intervals for classifier decisions are introduced in the proposed solution. By varying the size of risk intervals, tunable trade-off between encoding complexity reduction and coding loss is achieved. The proposed solution implemented in the JEM-7.0 software offers encoding complexity reductions ranging from 30% to 70% in average for only 0.7% to 3.0% Bjøntegaard Delta Rate (BD-BR) increase in Random Access (RA) coding configuration, with very slight overhead induced by Random Forest. The proposed solution based on Random Forest classifiers is also efficient to reduce the complexity of the Multi-Type Tree (MTT) partitioning scheme under the VTM-5.0 software, with complexity reductions ranging from 25% to 61% in average for only 0.4% to 2.2% BD-BR increase.

[1]  류수경 Machine Learning-Based Fast Angular Prediction Mode Decision Technique in Video Coding , 2018 .

[2]  Guilherme Corrêa,et al.  Coding Tree Depth Estimation for Complexity Reduction of HEVC , 2013, 2013 Data Compression Conference.

[3]  Eduardo Peixoto,et al.  Fast HEVC coding using reverse CU visiting , 2015, 2015 Picture Coding Symposium (PCS).

[4]  Brian C. Ross Mutual Information between Discrete and Continuous Data Sets , 2014, PloS one.

[5]  Maxime Pelcat,et al.  Prediction of quad-tree partitioning for budgeted energy HEVC encoding , 2017, 2017 IEEE International Workshop on Signal Processing Systems (SiPS).

[6]  Chao Yang,et al.  CNN oriented fast QTBT partition algorithm for JVET intra coding , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[7]  NebutaFestival,et al.  Fast HEVC Encoding Decisions Using Data Mining , 2022 .

[8]  Maxime Pelcat,et al.  Machine Learning Based Choice of Characteristics for the One-Shot Determination of the HEVC Intra Coding Tree , 2018, 2018 Picture Coding Symposium (PCS).

[9]  Hongliang Li,et al.  A Fast HEVC Inter CU Selection Method Based on Pyramid Motion Divergence , 2014, IEEE Transactions on Multimedia.

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[12]  Luc Van Gool,et al.  Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[13]  Jian Zhang,et al.  Probabilistic Decision Based Block Partitioning for Future Video Coding. , 2018, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[14]  Jie Chen,et al.  Fast coding unit size selection for HEVC based on Bayesian decision rule , 2012, 2012 Picture Coding Symposium.

[15]  Byeungwoo Jeon,et al.  Early Skip Mode Decision for HEVC Encoder With Emphasis on Coding Quality , 2015, IEEE Transactions on Broadcasting.

[16]  Olivier Déforges,et al.  Emerging video coding performance: 4K quality monitoring , 2017, 2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX).

[17]  Jianjun Lei,et al.  Early MERGE Mode Decision Based on Motion Estimation and Hierarchical Depth Correlation for HEVC , 2014, IEEE Transactions on Broadcasting.

[18]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Tao Zhang,et al.  Fast Intra-Mode and CU Size Decision for HEVC , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[21]  Lu Yu,et al.  CU splitting early termination based on weighted SVM , 2013, EURASIP Journal on Image and Video Processing.

[22]  Xinfeng Zhang,et al.  Fast QTBT Partitioning Decision for Interframe Coding with Convolution Neural Network , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[23]  Tian-Sheuan Chang,et al.  Fast zero block detection and early CU termination for HEVC Video Coding , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[24]  Zhi Liu,et al.  Fast Inter Mode Decision Using Spatial Property of Motion Field , 2008, IEEE Transactions on Multimedia.

[25]  Guilherme Corrêa,et al.  Complexity control of high efficiency video encoders for power-constrained devices , 2011, IEEE Transactions on Consumer Electronics.

[26]  Gary J. Sullivan,et al.  Video Quality Evaluation Methodology and Verification Testing of HEVC Compression Performance , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Biao Min,et al.  A Fast CU Size Decision Algorithm for the HEVC Intra Encoder , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Zhenyu Liu,et al.  CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network , 2016, IEEE Transactions on Image Processing.

[29]  Zhan Ma,et al.  Fast CU partition decision using machine learning for screen content compression , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[30]  Jianle Chen,et al.  Joint Separable and Non-Separable Transforms for Next-Generation Video Coding , 2018, IEEE Transactions on Image Processing.

[31]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Debin Zhao,et al.  Gradient based fast mode decision algorithm for intra prediction in HEVC , 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet).

[33]  Zhi Liu,et al.  Effective CU Size Decision for HEVC Intracoding , 2014, IEEE Transactions on Image Processing.

[34]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[35]  Luciano Volcan Agostini,et al.  Fast HEVC intra prediction mode decision based on EDGE direction information , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).