Fast CU partition-based machine learning approach for reducing HEVC complexity

With the development of video coding technology, the high efficiency video coding (HEVC) provides better coding efficiency compared to its predecessors H.264/AVC. HEVC improves rate distortion (RD) performance significantly with increased encoding complexity. Due to the adoption of a large variety of coding unit (CU) sizes, at RD optimization level, the quadtree partition of the CU consumes a large proportion of the encoding complexity. Hence, the computational complexity cost remains a critical issue that must be properly considered in the optimization task. In this paper, two machine learning-based fast CU partition method for inter-mode HEVC are proposed, to optimize the complexity allocation at CU level. First, we propose an online support vector machine (SVM)-based fast CU algorithm for reducing HEVC complexity. The later was trained in an online way. Second, a deep convolutional neural network (CNN) is designed to predict the CU partition, in which large-scale training database including substantial CU partition data is considered. Experimental results demonstrate that the proposed online SVM can achieve a time saving of 52.28% with a degradation of 1.928% in the bitrate (BR). However, the proposed deep CNN can reduce the encoding time by 53.99% with 0.195% BR degradation. Compared to the state-of-the art, the two proposed approaches outperform the related works in terms of both RD performance and complexity reduction at inter-mode.

[1]  Munchurl Kim,et al.  Fast CU Splitting and Pruning for Suboptimal CU Partitioning in HEVC Intra Coding , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Yang Wang,et al.  Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. , 2018, Cancer genomics & proteomics.

[3]  Jörn Ostermann,et al.  Deep learning-based intra prediction mode decision for HEVC , 2016, 2016 Picture Coding Symposium (PCS).

[4]  Hongliang Li,et al.  A Fast HEVC Inter CU Selection Method Based on Pyramid Motion Divergence , 2014, IEEE Transactions on Multimedia.

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  Xu Wang,et al.  Fuzzy SVM-Based Coding Unit Decision in HEVC , 2018, IEEE Transactions on Broadcasting.

[7]  NebutaFestival,et al.  Fast HEVC Encoding Decisions Using Data Mining , 2022 .

[8]  Nouri Masmoudi,et al.  Fast Motion Estimation’s Configuration Using Diamond Pattern and ECU, CFM, and ESD Modes for Reducing HEVC Computational Complexity , 2019 .

[9]  Zhenyu Liu,et al.  CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network , 2016, IEEE Transactions on Image Processing.

[10]  Long Xu,et al.  Machine Learning-Based Coding Unit Depth Decisions for Flexible Complexity Allocation in High Efficiency Video Coding , 2015, IEEE Transactions on Image Processing.

[11]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[12]  Guillermo Botella Juan,et al.  Fast and effective CU size decision based on spatial and temporal homogeneity detection , 2018, Multimedia Tools and Applications.

[13]  Zhiyong Gao,et al.  Fast HEVC intra mode decision based on logistic regression classification , 2016, 2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[14]  Anil Fernando,et al.  Content-Adaptive Feature-Based CU Size Prediction for Fast Low-Delay Video Encoding in HEVC , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Wen Gao,et al.  Low complexity encoder optimization for HEVC , 2016, J. Vis. Commun. Image Represent..

[16]  En-Hui Yang,et al.  Fully Connected Network for HEVC CU Split Decision equipped with Laplacian Transparent Composite Model , 2018, 2018 Picture Coding Symposium (PCS).

[17]  Zongju Peng,et al.  Binary and Multi-Class Learning Based Low Complexity Optimization for HEVC Encoding , 2017, IEEE Transactions on Broadcasting.

[18]  José Luis Martínez,et al.  Adaptive inter CU partitioning based on a look-ahead stage for HEVC , 2019, Signal Process. Image Commun..

[19]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Shengxi Li,et al.  Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face , 2014, IEEE Journal of Selected Topics in Signal Processing.

[21]  Sergio Bampi,et al.  Fast Coding Unit Partition Decision for HEVC Using Support Vector Machines , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Nouri Masmoudi,et al.  Optimisation of HEVC motion estimation exploiting SAD and SSD GPU-based implementation , 2018, IET Image Process..

[23]  Mai Xu,et al.  A deep convolutional neural network approach for complexity reduction on intra-mode HEVC , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[24]  Yong-Jo Ahn,et al.  Square-type-first inter-CU tree search algorithm for acceleration of HEVC encoder , 2015, Journal of Real-Time Image Processing.

[25]  Jie Chen,et al.  Fast coding unit size selection for HEVC based on Bayesian decision rule , 2012, 2012 Picture Coding Symposium.

[26]  Jong-Hyeok Lee,et al.  Fast encoding algorithm for high-efficiency video coding (HEVC) system based on spatio-temporal correlation , 2015, Journal of Real-Time Image Processing.

[27]  Nouri Masmoudi,et al.  Real-time motion estimation diamond search algorithm for the new high efficiency video coding on FPGA , 2018 .

[28]  Wen Gao,et al.  Neural Network Based Inter Prediction for HEVC , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[29]  Xingming Sun,et al.  Adaptive Inter CU Depth Decision for HEVC Using Optimal Selection Model and Encoding Parameters , 2017, IEEE Transactions on Broadcasting.

[30]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .