Perceptual-Based HEVC Intra Coding Optimization Using Deep Convolution Networks

In this paper, we propose a novel perceptual-based intra coding optimization algorithm for the High Efficiency Video Coding (HEVC) using deep convolution networks (DCNs). According to the saliency map, the algorithm can intelligently adjust bit rate allocation between the salient and non-salient regions of the video. The proposed strategy mainly consists of two techniques, saliency map extraction, and intelligent bit rate allocation. First, we train a DCN model to generate the saliency map that highlights semantically salient regions. Compared with the texture-based region of interest (ROI) extraction techniques, our model is more consistent with the human visual system (HVS). Second, based on the saliency map, a modified rate-distortion optimization (RDO) method is designed to adaptively adjust bit rate allocation. As a result, the quality of the salient regions will be improved by allocating more bits while allocating fewer bit rates for the non-salient regions. The experimental results demonstrate that our approach can deal with multiple types of video to enhance the visual experience. For conventional videos, the proposed method achieves 0.64-dB PSNR improvement for the salient regions and saves 3.02% bit rate on average compared with HM16.7. Moreover, for conversational videos, the proposed method can significantly reduce the bit rate by 8.65% without dropping the quality of important regions.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Zulin Wang,et al.  A ROI-based bit allocation scheme for HEVC towards perceptual conversational video coding , 2013, 2013 Sixth International Conference on Advanced Computational Intelligence (ICACI).

[3]  Min-Su Cheon,et al.  Improved Video Compression Efficiency Through Flexible Unit Representation and Corresponding Extension of Coding Tools , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Jingning Han,et al.  A New Rate Control Scheme For Video Coding Based On Region Of Interest , 2017, IEEE Access.

[5]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Kemal Ugur,et al.  Intra Coding of the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Kebin Jia,et al.  Medical Ultrasound Video Coding with H.265/HEVC Based on ROI Extraction , 2016, PloS one.

[8]  David Yee,et al.  Medical image compression based on region of interest using better portable graphics (BPG) , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[9]  Shengxi Li,et al.  Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face , 2014, IEEE Journal of Selected Topics in Signal Processing.

[10]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  David Zhang,et al.  Learning Convolutional Networks for Content-Weighted Image Compression , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  King Ngi Ngan,et al.  Perceptual sensitivity-based rate control method for high efficiency video coding , 2015, Multimedia Tools and Applications.

[13]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[14]  He Special Section on the Joint Call for Proposals on High Efficiency Video Coding ( HEVC ) Standardization , 2011 .

[15]  James A. Storer,et al.  Semantic Perceptual Image Compression Using Deep Convolution Networks , 2016, 2017 Data Compression Conference (DCC).

[16]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Jing Chen,et al.  Perceptual feature guided rate distortion optimization for high efficiency video coding , 2017, Multidimens. Syst. Signal Process..

[18]  Mohamed-Chaker Larabi,et al.  Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization , 2018, IEEE Access.

[19]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20]  Wen Gao,et al.  Surveillance video coding with quadtree partition based ROI extraction , 2013, 2013 Picture Coding Symposium (PCS).

[21]  Goswami Piyali,et al.  Low complexity in-loop skin tone detection for ROI coding in the HEVC encoder , 2016 .

[22]  Antonio Ortega,et al.  Rate-distortion methods for image and video compression , 1998, IEEE Signal Process. Mag..

[23]  Béatrice Pesquet-Popescu,et al.  ROI-based rate control using tiles for an HEVC encoded video stream over a lossy network , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[24]  Gary J. Sullivan,et al.  High efficiency video coding: the next frontier in video compression [Standards in a Nutshell] , 2013, IEEE Signal Processing Magazine.

[25]  Jeong-Hoon Park,et al.  Block Partitioning Structure in the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Francesc Aulí Llinàs,et al.  JPEG2000 ROI coding through component priority for digital mammography , 2011, Comput. Vis. Image Underst..

[27]  Miguel Hernández-Cabronero,et al.  Graph-Based Rate Control in Pathology Imaging With Lossless Region of Interest Coding , 2018, IEEE Transactions on Medical Imaging.

[28]  Antti Hallapuro,et al.  Comparative Rate-Distortion-Complexity Analysis of HEVC and AVC Video Codecs , 2012, IEEE Transactions on Circuits and Systems for Video Technology.