Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding

Recently, convolutional neural network (CNN) has attracted tremendous attention and has achieved great success in many image processing tasks. In this paper, we focus on CNN technology combined with image restoration to facilitate video coding performance and propose the content-aware CNN based in-loop filtering for high-efficiency video coding (HEVC). In particular, we quantitatively analyze the structure of the proposed CNN model from multiple dimensions to make the model interpretable and optimal for CNN-based loop filtering. More specifically, each coding tree unit (CTU) is treated as an independent region for processing, such that the proposed content-aware multimodel filtering mechanism is realized by the restoration of different regions with different CNN models under the guidance of the discriminative network. To adapt the image content, the discriminative neural network is learned to analyze the content characteristics of each region for the adaptive selection of the deep learning model. The CTU level control is also enabled in the sense of rate-distortion optimization. To learn the CNN model, an iterative training method is proposed by simultaneously labeling filter categories at the CTU level and fine-tuning the CNN model parameters. The CNN based in-loop filter is implemented after sample adaptive offset in HEVC, and extensive experiments show that the proposed approach significantly improves the coding performance and achieves up to 10.0% bit-rate reduction. On average, 4.1%, 6.0%, 4.7%, and 6.0% bit-rate reduction can be obtained under all intra, low delay, low delay P, and random access configurations, respectively.

[1]  D. Slepian Linear Least-Squares Filtering of Distorted Images , 1967 .

[2]  Thomas Kailath,et al.  A view of three decades of linear filtering theory , 1974, IEEE Trans. Inf. Theory.

[3]  Bhaskar Ramamurthi,et al.  Nonlinear space-variant postprocessing of block coded images , 1986, IEEE Trans. Acoust. Speech Signal Process..

[4]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[5]  Ming-Ting Sun,et al.  Video bridging based on H.261 standard , 1994, IEEE Trans. Circuits Syst. Video Technol..

[6]  Faouzi Kossentini,et al.  H.263+: video coding at low bit rates , 1998, IEEE Trans. Circuits Syst. Video Technol..

[7]  Jong Beom Ra,et al.  A deblocking filter with two separate modes in block-based video coding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[8]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[9]  Gerald Schaefer,et al.  UCID: an uncompressed color image database , 2003, IS&T/SPIE Electronic Imaging.

[10]  Jani Lainema,et al.  Adaptive deblocking filter , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[12]  Feng Wu,et al.  Overview of AVS video standard , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[13]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[14]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[15]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Minhua Zhou,et al.  HEVC Deblocking Filter , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Wen Gao,et al.  Adaptive loop filter with temporal prediction , 2012, 2012 Picture Coding Symposium.

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  Chia-Yang Tsai,et al.  Sample Adaptive Offset in the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Thomas Sikora,et al.  Adaptive Global Motion Temporal Filtering for High Efficiency Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Takashi Watanabe,et al.  Adaptive Loop Filtering for Video Coding , 2013, IEEE Journal of Selected Topics in Signal Processing.

[23]  Wen Gao,et al.  Compression Artifact Reduction by Overlapped-Block Transform Coefficient Estimation With Block Similarity , 2013, IEEE Transactions on Image Processing.

[24]  Wen Gao,et al.  Group-Based Sparse Representation for Image Restoration , 2014, IEEE Transactions on Image Processing.

[25]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[26]  Xiaoou Tang,et al.  Compression Artifacts Reduction by a Deep Convolutional Network , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[27]  Wen Gao,et al.  AVS2 ? Making Video Coding Smarter [Standards in a Nutshell] , 2015, IEEE Signal Processing Magazine.

[28]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[29]  Wen Gao,et al.  Video Compression Artifact Reduction via Spatio-Temporal Multi-Hypothesis Prediction , 2015, IEEE Transactions on Image Processing.

[30]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[31]  Dong-Gyu Sim,et al.  Parallelized deblocking filtering of HEVC decoders based on complexity estimation , 2015, Journal of Real-Time Image Processing.

[32]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[34]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Wen Gao,et al.  Nonlocal In-Loop Filter: The Way Toward Next-Generation Video Coding? , 2016, IEEE MultiMedia.

[36]  Marta Karczewicz,et al.  Geometry transformation-based adaptive in-loop filter , 2016, 2016 Picture Coding Symposium (PCS).

[37]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[38]  Munchurl Kim,et al.  CNN-based in-loop filtering for coding efficiency improvement , 2016, 2016 IEEE 12th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[39]  Zhenyu Liu,et al.  CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network , 2016, IEEE Transactions on Image Processing.

[40]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[41]  Wen Gao,et al.  Low-Rank Decomposition-Based Restoration of Compressed Images via Adaptive Noise Estimation , 2016, IEEE Transactions on Image Processing.

[42]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  M. Ramezanpour,et al.  An Efficient Deblocking Filter Algorithm for Reduction of Blocking Artifacts in HEVC Standard , 2016 .

[44]  Lei Zhang,et al.  Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[45]  Wen Gao,et al.  Low-Rank-Based Nonlocal Adaptive Loop Filter for High-Efficiency Video Compression , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[46]  Lei Zhang,et al.  Image Restoration: From Sparse and Low-Rank Priors to Deep Priors [Lecture Notes] , 2017, IEEE Signal Processing Magazine.

[47]  Fabien Racapé,et al.  Adaptive Clipping in JEM , 2017, 2017 Data Compression Conference (DCC).

[48]  Valero Laparra,et al.  End-to-end Optimized Image Compression , 2016, ICLR.

[49]  Dong Liu,et al.  A convolutional neural network approach for half-pel interpolation in video coding , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[50]  Dong Liu,et al.  A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding , 2016, MMM.

[51]  Wen Gao,et al.  High-Efficiency Image Coding via Near-Optimal Filtering , 2017, IEEE Signal Processing Letters.

[52]  Xinfeng Zhang,et al.  Spatial-temporal residue network based in-loop filter for video coding , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[53]  Michael Unser,et al.  Deep Convolutional Neural Network for Inverse Problems in Imaging , 2016, IEEE Transactions on Image Processing.

[54]  Zulin Wang,et al.  Decoder-side HEVC quality enhancement with scalable convolutional neural network , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[55]  Wen Gao,et al.  Neural Network Based Inter Prediction for HEVC , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[56]  Qionghai Dai,et al.  Residual Highway Convolutional Neural Networks for in-loop Filtering in HEVC , 2018, IEEE Transactions on Image Processing.

[57]  Dong Liu,et al.  One-for-All: Grouped Variation Network-Based Fractional Interpolation in Video Coding , 2019, IEEE Transactions on Image Processing.

[58]  Xinfeng Zhang,et al.  Light Field Image Compression Using Generative Adversarial Network-Based View Synthesis , 2019, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.