Learned Fast HEVC Intra Coding

In High Efficiency Video Coding (HEVC), excellent rate-distortion (RD) performance is achieved in part by having a flexible quadtree coding unit (CU) partition and a large number of intra-prediction modes. Such an excellent RD performance is achieved at the expense of much higher computational complexity. In this paper, we propose a learned fast HEVC intra coding (LFHI) framework taking into account the comprehensive factors of fast intra coding to reach an improved configurable tradeoff between coding performance and computational complexity. First, we design a low-complex shallow asymmetric-kernel CNN (AK-CNN) to efficiently extract the local directional texture features of each block for both fast CU partition and fast intra-mode decision. Second, we introduce the concept of the minimum number of RDO candidates (MNRC) into fast mode decision, which utilizes AK-CNN to predict the minimum number of best candidates for RDO calculation to further reduce the computation of intra-mode selection. Third, an evolution optimized threshold decision (EOTD) scheme is designed to achieve configurable complexity-efficiency tradeoffs. Finally, we propose an interpolation-based prediction scheme that allows for our framework to be generalized to all quantization parameters (QPs) without the need for training the network on each QP. The experimental results demonstrate that the LFHI framework has a high degree of parallelism and achieves a much better complexity-efficiency tradeoff, achieving up to 75.2% intra-mode encoding complexity reduction with negligible rate-distortion performance degradation, superior to the existing fast intra-coding schemes.

[1]  Yui-Lam Chan,et al.  Online-Learning-Based Bayesian Decision Rule for Fast Intra Mode and CU Partitioning Algorithm in HEVC Screen Content Coding , 2020, IEEE Transactions on Image Processing.

[2]  Kais Rouis,et al.  Clustering-based fast intra prediction mode algorithm for HEVC , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[3]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[4]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[5]  Xingang Liu,et al.  Fast CU Size Decisions for HEVC Intra Frame Coding Based on Support Vector Machines , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[6]  Giulia Boato,et al.  RAISE: a raw images dataset for digital image forensics , 2015, MMSys.

[7]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[8]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[9]  Marta Mrak,et al.  Decision Trees for Complexity Reduction in Video Compression , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[10]  Munchurl Kim,et al.  Fast CU Splitting and Pruning for Suboptimal CU Partitioning in HEVC Intra Coding , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Gangyi Jiang,et al.  Statistical Early Termination and Early Skip Models for Fast Mode Decision in HEVC INTRA Coding , 2019, ACM Trans. Multim. Comput. Commun. Appl..

[12]  Dongdong Zhang,et al.  Fast intra mode decision for HEVC based on texture characteristic from RMD and MPM , 2014, 2014 IEEE Visual Communications and Image Processing Conference.

[13]  Kemal Ugur,et al.  Intra Coding of the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Mengmeng Zhang,et al.  An adaptive fast intra mode decision in HEVC , 2012, 2012 19th IEEE International Conference on Image Processing.

[15]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[16]  Mai Xu,et al.  A deep convolutional neural network approach for complexity reduction on intra-mode HEVC , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[17]  H. Hochman,et al.  Pareto Optimal Redistribution , 1969 .

[18]  Zulin Wang,et al.  Reducing Complexity of HEVC: A Deep Learning Approach , 2017, IEEE Transactions on Image Processing.

[19]  Zhan Ma,et al.  Fast Intra Mode Decision for High Efficiency Video Coding (HEVC) , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Yi Li,et al.  Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[23]  Wen-Nung Lie,et al.  Fast intra mode decision and fast CU size decision for depth video coding in 3D-HEVC , 2019, Signal Process. Image Commun..

[24]  Stéphane Coulombe,et al.  Fast HEVC Intra Mode Decision Based on Edge Detection and SATD Costs Classification , 2015, 2015 Data Compression Conference.

[25]  Gerald Schaefer,et al.  UCID: an uncompressed color image database , 2003, IS&T/SPIE Electronic Imaging.

[26]  Muhammad Usman Karim Khan,et al.  An adaptive complexity reduction scheme with fast prediction unit decision for HEVC intra encoding , 2013, 2013 IEEE International Conference on Image Processing.

[27]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[28]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Debin Zhao,et al.  Gradient based fast mode decision algorithm for intra prediction in HEVC , 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet).

[30]  Sookyung Ryu,et al.  Machine Learning-Based Fast Angular Prediction Mode Decision Technique in Video Coding , 2018, IEEE Transactions on Image Processing.

[31]  Nan Hu,et al.  Fast intra mode decision for HEVC based on Transparent Composite Model , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[32]  Jörn Ostermann,et al.  Deep learning-based intra prediction mode decision for HEVC , 2016, 2016 Picture Coding Symposium (PCS).

[33]  Zhenzhong Chen,et al.  A fast mode decision algorithm for HEVC intra prediction , 2016, 2016 Visual Communications and Image Processing (VCIP).

[34]  Nan Hu,et al.  Fast Mode Selection for HEVC Intra-Frame Coding With Entropy Coding Refinement Based on a Transparent Composite Model , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[36]  Tian-Sheuan Chang,et al.  Fast intra prediction algorithm and design for high efficiency video coding , 2016, 2016 IEEE International Symposium on Circuits and Systems (ISCAS).

[37]  Eirikur Agustsson,et al.  NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[38]  Tao Zhang,et al.  Fast Intra-Mode and CU Size Decision for HEVC , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Zhan Ma,et al.  Fast Mode and Partition Decision Using Machine Learning for Intra-Frame Coding in HEVC Screen Content Coding Extension , 2016, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[40]  Kaisa Miettinen,et al.  Nonlinear multiobjective optimization , 1998, International series in operations research and management science.

[41]  Jaeho Lee,et al.  Fast PU Skip and Split Termination Algorithm for HEVC Intra Prediction , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[42]  Debin Zhao,et al.  Fast intra-encoding algorithm for High Efficiency Video Coding , 2014, Signal Process. Image Commun..

[43]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[44]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[45]  Warnakulasuriya Anil Chandana Fernando,et al.  Efficient coding unit size selection based on texture analysis for HEVC intra prediction , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[46]  Charles L. Lawson,et al.  Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[47]  K. R. Rao,et al.  Adaptive CU Mode Selection in HEVC Intra Prediction: A Deep Learning Approach , 2019, Circuits Syst. Signal Process..

[48]  Byeungwoo Jeon,et al.  Adaptive keypoint-based CU depth decision for HEVC intra coding , 2016, 2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[49]  Zhenyu Liu,et al.  CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network , 2016, IEEE Transactions on Image Processing.

[50]  Myung Hoon Sunwoo,et al.  Hierarchical fast mode decision algorithm for intra prediction in HEVC , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[51]  Ping An,et al.  Fast CU size decision and mode decision algorithm for HEVC intra coding , 2013, IEEE Transactions on Consumer Electronics.

[52]  Wen-Hsiao Peng,et al.  HEVC/H.265 coding unit split decision using deep reinforcement learning , 2017, 2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS).