Learning a Code-Space Predictor by Exploiting Intra-Image-Dependencies

We introduce a neural network based image coding model that utilizes a code-space predictor to reduce code length by modelling dependencies within the code. Inspired by the prediction mechanism of inpainting, we learn a spatial predictor in the code space to efficiently deal with spatial dependencies. It is jointly trained with the codec to estimate a probability distribution from adjacent code symbols. The resulting code stores information related to the image and the prediction of neighbours. To improve its optimization, we adapt the Generalized Divisive Normalization into a sparse variant. The resulting model outperforms other prediction based methods. We show that when integrated with the recently proposed hyperprior model, our approach obtains state-of-the-art performance for CNN-based image codecs on the MS-SSIM scale.

[1]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[2]  David Minnen,et al.  Variational image compression with a scale hyperprior , 2018, ICLR.

[3]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Valero Laparra,et al.  End-to-end Optimized Image Compression , 2016, ICLR.

[5]  Lucas Theis,et al.  Lossy Image Compression with Compressive Autoencoders , 2017, ICLR.

[6]  David Zhang,et al.  Learning Convolutional Networks for Content-Weighted Image Compression , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Valero Laparra,et al.  Density Modeling of Images using a Generalized Normalization Transformation , 2015, ICLR.

[8]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[9]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[10]  Luca Benini,et al.  Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations , 2017, NIPS.

[11]  Luc Van Gool,et al.  Conditional Probability Models for Deep Image Compression , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  Lubomir D. Bourdev,et al.  Real-Time Adaptive Image Compression , 2017, ICML.

[13]  Zhou Wang,et al.  Multi-scale structural similarity for image quality assessment , 2003 .

[14]  Touradj Ebrahimi,et al.  JPEG2000: The upcoming still image compression standard , 2001, Pattern Recognit. Lett..

[15]  David Minnen,et al.  Full Resolution Image Compression with Recurrent Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[17]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  David Minnen,et al.  Variable Rate Image Compression with Recurrent Neural Networks , 2015, ICLR.

[20]  Vivek K. Goyal,et al.  Theoretical foundations of transform coding , 2001, IEEE Signal Process. Mag..

[21]  Vladlen Koltun,et al.  Learning to Inpaint for Image Compression , 2017, NIPS.

[22]  David Minnen,et al.  Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.