Generative Adversarial Network-Based Intra Prediction for Video Coding

In this paper, a novel intra prediction method is proposed to improve the video coding performance, in which the generative adversarial network (GAN) is adopted to intelligently remove the spatial redundancy with the inference process. The proposed GAN-based method improves the prediction by exploiting more information and generating more flexible prediction patterns. In particular, the intra prediction is modeled as an inpainting task, which is accomplished with the GAN model to fill in the missing part by conditioning on the available reconstructed pixels. As such, the learned GAN model is incorporated into both video encoder and decoder, and the rate-distortion optimization is performed for the competition between GAN-based intra prediction and traditional angular-based intra prediction to achieve better coding performance. The proposed scheme is implemented into the high-efficiency video coding test model (HM 16.17) and the versatile video coding test model (VTM 1.1). The experimental results show that the proposed algorithm can achieve 6.6%, 7.5%, and 7.5% under HM 16.17 and 6.75%, 7.63%, and 7.65% under VTM 1.1 bit rate savings on average for luma and chroma components in the intra coding scenario.

[1]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Madhukar Budagavi,et al.  Improving Intra Prediction in High-Efficiency Video Coding , 2016, IEEE Transactions on Image Processing.

[3]  Ankur Saxena,et al.  DCT/DST-Based Transform Coding for Intra Prediction in Image/Video Coding , 2013, IEEE Transactions on Image Processing.

[4]  Sekhar Mandal,et al.  A Group-Based Image Inpainting Using Patch Refinement in MRF Framework , 2018, IEEE Transactions on Image Processing.

[5]  Kemal Ugur,et al.  Intra Coding of the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Wen-Hsiao Peng,et al.  Intra Line Copy for HEVC Screen Content Intra-Picture Prediction , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Marta Mrak,et al.  Frequency-Domain Intra Prediction Analysis and Processing for High-Quality Video Coding , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Aidong Men,et al.  Intra prediction with enhanced inpainting method and vector predictor for HEVC , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Bin Li,et al.  Fully Connected Network-Based Intra Prediction for Image Coding , 2018, IEEE Transactions on Image Processing.

[10]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[11]  Heng-Ming Tai,et al.  Color-Direction Patch-Sparsity-Based Image Inpainting Using Multidirection Features , 2015, IEEE Transactions on Image Processing.

[12]  Tao Zhang,et al.  Signal Dependent Transform Based on SVD for HEVC Intracoding , 2017, IEEE Transactions on Multimedia.

[13]  Tao Zhang,et al.  Hybrid Intraprediction Based on Local and Nonlocal Correlations , 2018, IEEE Transactions on Multimedia.

[14]  Bing Zeng,et al.  DC Coefficient Estimation of Intra-Predicted Residuals in HEVC , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Chao Yang,et al.  Image Inpainting using Block-wise Procedural Training with Annealed Adversarial Counterpart , 2018, ArXiv.

[17]  Fatih Kamisli Intra prediction based on Markov process modeling of images , 2013, ICIP.

[18]  Thomas S. Huang,et al.  Free-Form Image Inpainting With Gated Convolution , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[19]  Jeong-Hoon Park,et al.  Block Partitioning Structure in the HEVC Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Oscar C. Au,et al.  Adaptive Block Coding Order for Intra Prediction in HEVC , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  D. Marpe,et al.  Neural network based intra prediction for video coding , 2018, Optical Engineering + Applications.

[22]  Moncef Gabbouj,et al.  A New Block-Based Method for HEVC Intra Coding , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Chih-Yang Lin,et al.  Predictive Texture Synthesis-Based Intra Coding Scheme for Advanced Video Coding , 2015, IEEE Transactions on Multimedia.

[24]  Yongdong Zhang,et al.  High Efficiency Video Coding: High Efficiency Video Coding , 2014 .

[25]  Jianle Chen,et al.  Enhanced Cross-Component Linear Model for Chroma Intra-Prediction in Video Coding , 2018, IEEE Transactions on Image Processing.

[26]  Gözde B. Ünal,et al.  Patch-Based Image Inpainting with Generative Adversarial Networks , 2018, ArXiv.

[27]  Gerald Schaefer,et al.  UCID: an uncompressed color image database , 2003, IS&T/SPIE Electronic Imaging.

[28]  Fatih Kamisli,et al.  Lossless Image and Intra-Frame Compression With Integer-to-Integer DST , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[30]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[31]  Hiroshi Ishikawa,et al.  Globally and locally consistent image completion , 2017, ACM Trans. Graph..

[32]  Bin Li,et al.  Efficient Multiple-Line-Based Intra Prediction for HEVC , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Xiangzhi Bai,et al.  Patch-Sparsity-Based Image Inpainting Through a Facet Deduced Directional Derivative , 2019, IEEE Transactions on Circuits and Systems for Video Technology.