Deep learning based HEVC in-loop filter and noise reduction

Abstract The lossy compression techniques at low bit rate often create ringing and contouring effects on the output images and introduce various blurring and distortion at block bounders. To overcome those compression artifacts different neural network based post-processing techniques have been experimented with over the last few years. The traditional loop-filter methods in the HEVC frame-work support two post-processing operations namely a de-blocking filter followed by a sample adaptive offset (SAO) filter. These operations usually introduce extra signaling bits and become overhead to the network with high-resolution video processing. In this study, we came up with a new deep learning-based algorithm for SAO filtering operations and substantiated the merits of the proposed method. We introduced a variable filter size sub-layered dense CNN (SDCNN) to improve the denoising operation and incorporated large stride deconvolution layers for further computation improvement. We demonstrate that our deconvolution model can effectively be trained by leveraging the high-frequency edge features learned in a shallow network using residual learning and data augmentation techniques. Extensive experiments show that our approach outperformed other state-of-the-art approaches in terms of SSIM, Bjontegaard delta bit-rate (BD-BR), BD-PSNR measurements on the standard video test set and achieves an average of 8.73 % bit rate saving compared to HEVC baseline.

[1]  Thomas Sikora,et al.  Adaptive Global Motion Temporal Filtering for High Efficiency Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  Dong Liu,et al.  A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding , 2016, MMM.

[4]  Munchurl Kim,et al.  CNN-based in-loop filtering for coding efficiency improvement , 2016, 2016 IEEE 12th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[5]  Xiaoyun Zhang,et al.  DVC: An End-To-End Deep Video Compression Framework , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[8]  Xiaoou Tang,et al.  Deep Convolution Networks for Compression Artifacts Reduction , 2016, ArXiv.

[9]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10]  Kai Zeng,et al.  SSIM-inspired two-pass rate control for High Efficiency Video Coding , 2015, 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP).

[11]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Zhuowen Tu,et al.  Holistically-Nested Edge Detection , 2015, ICCV.

[14]  Xinfeng Zhang,et al.  Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding , 2019, IEEE Transactions on Image Processing.

[15]  Wen Gao,et al.  Low-Rank-Based Nonlocal Adaptive Loop Filter for High-Efficiency Video Compression , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Chia-Yang Tsai,et al.  Sample Adaptive Offset in the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Yong Fang,et al.  A Switchable Deep Learning Approach for In-Loop Filtering in Video Coding , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Qionghai Dai,et al.  Residual Highway Convolutional Neural Networks for in-loop Filtering in HEVC , 2018, IEEE Transactions on Image Processing.

[19]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[20]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[21]  Christopher Conly,et al.  Deep Learning Based HEVC In-Loop Filtering for Decoder Quality Enhancement , 2018, 2018 Picture Coding Symposium (PCS).

[22]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  H.de J. Ochoa-Domínguez,et al.  Descripción del nuevo estándar de video H.264 y comparación de su eficiencia de codificación con otros estándares , 2007 .

[24]  Chen Chen,et al.  A New HEVC In-Loop Filter Based on Multi-channel Long-Short-Term Dependency Residual Networks , 2018, 2018 Data Compression Conference.

[25]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[26]  L. Gool,et al.  Learning for Video Compression With Hierarchical Quality and Recurrent Enhancement , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  K. R. Rao,et al.  The H.264 Video Coding Standard , 2014, IEEE Potentials.

[28]  Nitish Srivastava,et al.  Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.

[29]  Minhua Zhou,et al.  HEVC Deblocking Filter , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Wen Gao,et al.  Parallel In-Loop Filtering in HEVC Encoder on GPU , 2018, IEEE Transactions on Consumer Electronics.

[31]  Jani Lainema,et al.  Adaptive deblocking filter , 2003, IEEE Trans. Circuits Syst. Video Technol..

[32]  Wen Gao,et al.  Nonlocal In-Loop Filter: The Way Toward Next-Generation Video Coding? , 2016, IEEE MultiMedia.

[33]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[34]  Christopher Conly,et al.  Fast Mode Decision In Hevc Intra Prediction, Using Region Wise CNN Feature Classification , 2018, 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[35]  Takashi Watanabe,et al.  Adaptive Loop Filtering for Video Coding , 2013, IEEE Journal of Selected Topics in Signal Processing.

[36]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[37]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[38]  Sungjei Kim,et al.  Multi-modal/multi-scale convolutional neural network based in-loop filter design for next generation video codec , 2017, 2017 IEEE International Conference on Image Processing (ICIP).