Reinforcement learning based coding unit early termination algorithm for high efficiency video coding

Abstract In this paper, we propose a Reinforcement Learning (RL) based Coding Unit (CU) early termination algorithm for High Efficiency Video Coding (HEVC). RL is utilized to learn a CU early termination classifier independent of depths for low complexity video coding. Firstly, we model the process of CU decision as a Markov Decision Process (MDP) according to the Markov property of CU decision. Secondly, based on the MDP, a CU early termination classifier independent of depths is learned from trajectories of CU decision across different depths with the end-to-end actor-critic RL algorithm. Finally, a CU decision early termination algorithm is introduced with the learned classifier, so as to reduce computational complexity of CU decision. We implement the proposed scheme with different neural network structures. Two different neural network structures are utilized in the implementation of RL based video encoder, which are evaluated to reduce video coding complexity by 34.34 % and 43.33 % . With regard to Bj o ntegaard delta peak signal-to-noise ratio and Bj o ntegaard delta bit rate, the results are −0.033 dB and 0.85 % , −0.099 dB and 2.56 % respectively on average under low delay B main configuration, when compared with the HEVC test model version 16.5.

[1]  Zhan Ma,et al.  Fast CU partition decision using machine learning for screen content compression , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[2]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Jennie Si,et al.  Online learning control by association and reinforcement , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[4]  Hongliang Li,et al.  MRF-Based Fast HEVC Inter CU Decision With the Variance of Absolute Differences , 2014, IEEE Transactions on Multimedia.

[5]  Eduardo Peixoto,et al.  Inter-Prediction Optimizations for Video Coding Using Adaptive Coding Unit Visiting Order , 2016, IEEE Transactions on Multimedia.

[6]  Zhan Ma,et al.  Fast Mode and Partition Decision Using Machine Learning for Intra-Frame Coding in HEVC Screen Content Coding Extension , 2016, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[7]  Zongju Peng,et al.  Binary and Multi-Class Learning Based Low Complexity Optimization for HEVC Encoding , 2017, IEEE Transactions on Broadcasting.

[8]  Zhi Liu,et al.  Effective CU Size Decision for HEVC Intracoding , 2014, IEEE Transactions on Image Processing.

[9]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[10]  Zulin Wang,et al.  Reducing Complexity of HEVC: A Deep Learning Approach , 2017, IEEE Transactions on Image Processing.

[11]  Jongho Kim,et al.  Adaptive Coding Unit early termination algorithm for HEVC , 2012, 2012 IEEE International Conference on Consumer Electronics (ICCE).

[12]  Jie Chen,et al.  Fast coding unit size selection for HEVC based on Bayesian decision rule , 2012, 2012 Picture Coding Symposium.

[13]  Gangyi Jiang,et al.  Machine learning based fast H.264/AVC to HEVC transcoding exploiting block partition similarity , 2016, J. Vis. Commun. Image Represent..

[14]  Rae-Hong Park,et al.  Fast CU Partitioning Algorithm for HEVC Using an Online-Learning-Based Bayesian Decision Rule , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16]  Gangyi Jiang,et al.  Effective Data Driven Coding Unit Size Decision Approaches for HEVC INTRA Coding , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Madhukar Budagavi,et al.  Improving Intra Prediction in High-Efficiency Video Coding , 2016, IEEE Transactions on Image Processing.

[18]  Hyun Wook Park,et al.  A Fast Mode Decision Method in HEVC Using Adaptive Ordering of Modes , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Mei Yu,et al.  Statistical Early Termination Model for Fast Mode Decision and Reference Frame Selection in Multiview Video Coding , 2012, IEEE Transactions on Broadcasting.

[20]  Zhenyu Liu,et al.  CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network , 2016, IEEE Transactions on Image Processing.

[21]  Yong-Jo Ahn,et al.  Fast mode decision and early termination based on perceptual visual quality for HEVC encoders , 2017, Journal of Real-Time Image Processing.

[22]  Gangyi Jiang,et al.  Low Complexity HEVC INTRA Coding for High-Quality Mobile Video Communication , 2015, IEEE Transactions on Industrial Informatics.

[23]  Xu Wang,et al.  Fuzzy SVM-Based Coding Unit Decision in HEVC , 2018, IEEE Transactions on Broadcasting.

[24]  Wen-Hsiao Peng,et al.  HEVC/H.265 coding unit split decision using deep reinforcement learning , 2017, 2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS).

[25]  NebutaFestival,et al.  Fast HEVC Encoding Decisions Using Data Mining , 2022 .

[27]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[28]  Antonio Ortega,et al.  Optimal trellis-based buffered compression and fast approximations , 1994, IEEE Trans. Image Process..

[29]  Harvey J. Everett Generalized Lagrange Multiplier Method for Solving Problems of Optimum Allocation of Resources , 1963 .

[30]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[31]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[32]  Long Xu,et al.  Machine Learning-Based Coding Unit Depth Decisions for Flexible Complexity Allocation in High Efficiency Video Coding , 2015, IEEE Transactions on Image Processing.

[33]  Heiko Schwarz,et al.  Reinforcement learning for video encoder control in HEVC , 2017, 2017 International Conference on Systems, Signals and Image Processing (IWSSIP).