Residual in Residual Based Convolutional Neural Network In-loop Filter for AVS3

Deep learning based video coding tools development has been an emerging topic recently. In this paper, we propose a novel deep convolutional neural network (CNN) based in-loop filter algorithm for the third generation of Audio Video Coding Standard (AVS3). Specifically, we first introduce a residual block based CNN model with global identity connection for the luminance in-loop filter to replace conventional rule-based algorithms in AVS3. Subsequently, the reconstructed luminance channel is deployed as textural and structural guidance for chrominance filtering. The corresponding syntax elements are also designed for the CNN based in-loop filtering. In addition, we build a large scale database for the learning based in-loop filtering algorithm. Experimental results show that our method achieves on average 7.5%, 16.9% and 18.6% BD-rate reduction under all intra (AI) configuration on common test sequences. In particular, the performance for 4K videos is 6.4%, 15.5% and 17.5% respectively. Moreover, under random access (RA) configuration, the proposed method brings 3.3%, 14.4%, and 13.6% BD-rate reduction separately.

[1]  Siwei Ma,et al.  Framework of AVS2-video coding , 2013, 2013 IEEE International Conference on Image Processing.

[2]  Eirikur Agustsson,et al.  NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  Bin Li,et al.  Fully Connected Network-Based Intra Prediction for Image Coding , 2018, IEEE Transactions on Image Processing.

[4]  Wen Gao,et al.  The second generation IEEE 1857 video coding standard , 2015, 2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP).

[5]  Qionghai Dai,et al.  Residual Highway Convolutional Neural Networks for in-loop Filtering in HEVC , 2018, IEEE Transactions on Image Processing.

[6]  Minhua Zhou,et al.  HEVC Deblocking Filter , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Xinfeng Zhang,et al.  Spatial-temporal residue network based in-loop filter for video coding , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Xiaoyun Zhang,et al.  Enhancing HEVC Compressed Videos with a Partition-Masked Convolutional Neural Network , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[11]  Wen Gao,et al.  Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation , 2019, IEEE Transactions on Image Processing.

[12]  Dong Liu,et al.  A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding , 2016, MMM.

[13]  Lu Yu,et al.  An In-Loop Filter Based on Low-Complexity CNN using Residuals in Intra Video Coding , 2019, 2019 IEEE International Symposium on Circuits and Systems (ISCAS).

[14]  Xinfeng Zhang,et al.  Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding , 2019, IEEE Transactions on Image Processing.

[15]  Shiqi Wang,et al.  History-Based Motion Vector Prediction for Future Video Coding , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[16]  Chia-Yang Tsai,et al.  Sample Adaptive Offset in the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Takashi Watanabe,et al.  Adaptive Loop Filtering for Video Coding , 2013, IEEE Journal of Selected Topics in Signal Processing.

[19]  Xinfeng Zhang,et al.  Enhanced Bi-Prediction With Convolutional Neural Network for High-Efficiency Video Coding , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Xinfeng Zhang,et al.  Image and Video Compression With Neural Networks: A Review , 2019, IEEE Transactions on Circuits and Systems for Video Technology.