Fast QTBT Partition Algorithm for JVET Intra Coding Based on CNN

The latest Joint Video Exploration Team (JVET) employs quad-tree plus binary-tree (QTBT) block partitioning structure, which can improve coding performance significantly than HEVC with hugely increased encoding complexity. Aiming at alleviating the encoding complexity, we propose a novel fast coding unit (CU) depth decision method based on convolution neural networks (CNN) in this paper. Specially, the QTBT partition depth range is modeled as a 5-class classification problem, and design a single CNN classifier try to predict the depth range of CU directly, rather than to judge split or not at each depth level. The proposed fast scheme has the advantage that it can make the trade-off between the RD performance and time saving by setting different partition method; our “RD Maintaining” setting can achieve 43.69\(\%\) complexity reduction with only 0.77\(\%\) Bjontegaard Delta bitrate (BD-rate) increase. And our “Low Complexity” setting can achieve 62.96\(\%\) complexity reduction with 2.06\(\%\) BD-rate increase.

[1]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Zhan Ma,et al.  Fast Intra Mode Decision for High Efficiency Video Coding (HEVC) , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Lu Yu,et al.  CU splitting early termination based on weighted SVM , 2013, EURASIP Journal on Image and Video Processing.

[4]  Zhenyu Liu,et al.  CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network , 2016, IEEE Transactions on Image Processing.

[5]  Ping An,et al.  Fast CU size decision and mode decision algorithm for HEVC intra coding , 2013, IEEE Transactions on Consumer Electronics.

[6]  Zhiyong Gao,et al.  Neyman-Pearson-Based Early Mode Decision for HEVC Encoding , 2016, IEEE Transactions on Multimedia.

[7]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[8]  Jianjun Lei,et al.  Fast Intra Prediction Based on Content Property Analysis for Low Complexity HEVC-Based Screen Content Coding , 2017, IEEE Transactions on Broadcasting.

[9]  Long Xu,et al.  Machine Learning-Based Coding Unit Depth Decisions for Flexible Complexity Allocation in High Efficiency Video Coding , 2015, IEEE Transactions on Image Processing.

[10]  Biao Min,et al.  A Fast CU Size Decision Algorithm for the HEVC Intra Encoder , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.