A Co-Prediction-Based Compression Scheme for Correlated Images

Deep learning has achieved a preliminary success in image compression due to the ability to learn the nonlinear spaces with compact features that training samples belong to. Unfortunately, it is not straightforward for the network based image compression methods to code multiple highly related images. In this paper, we propose a co-prediction based image compression (CPIC) which uses the multi-stream autoencoders to collaboratively code the multiple highly correlated images by enforcing the co-reference constraint on the multi-stream features. Patch samples fed into the multi-stream autoencoder, are generated through corresponding patch matching under permutation, which helps the autoencoder to learn the relationship among corresponding patches from the correlated images. Each stream network consists of encoder, decoder, importance map network and binarizer. In order to guide the allocation of local bit rate of the binary features, the important map network is employed to guarantee the compactness of learned features. A proxy function is used to make the binary operation for the code layer of the autoencoder differentiable. Finally, the network optimization is formulated as a rate distortion optimization. Experimental results prove that the proposed compression method outperforms JPEG2000 up to 1.5 dB in terms of PSNR.

[1]  Matthias Bethge,et al.  Generative Image Modeling Using Spatial LSTMs , 2015, NIPS.

[2]  Cheng-Yuan Liou,et al.  Autoencoder for words , 2014, Neurocomputing.

[3]  Valero Laparra,et al.  End-to-end Optimized Image Compression , 2016, ICLR.

[4]  Lucas Theis,et al.  Lossy Image Compression with Compressive Autoencoders , 2017, ICLR.

[5]  David Zhang,et al.  Learning Convolutional Networks for Content-Weighted Image Compression , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  Y. Oishi,et al.  Digital image compression using directional sub-block DCT , 2000, WCC 2000 - ICCT 2000. 2000 International Conference on Communication Technology Proceedings (Cat. No.00EX420).

[7]  David Minnen,et al.  Joint Autoregressive and Hierarchical Priors for Learned Image Compression , 2018, NeurIPS.

[8]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[9]  Wuzhen Shi,et al.  An End-to-End Compression Framework Based on Convolutional Neural Networks , 2017, 2017 Data Compression Conference (DCC).

[10]  Frank Harary,et al.  Graph Theory , 2016 .

[11]  David Minnen,et al.  Full Resolution Image Compression with Recurrent Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Richard G. Baraniuk,et al.  Democracy in Action: Quantization, Saturation, and Compressive Sensing , 2011 .

[13]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[14]  Daniel Rueckert,et al.  Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Yann Ollivier,et al.  Auto-encoders: reconstruction versus compression , 2014, ArXiv.

[16]  Olgica Milenkovic,et al.  A comparative study of quantized compressive sensing schemes , 2009, 2009 IEEE International Symposium on Information Theory.

[17]  Daan Wierstra,et al.  Towards Conceptual Compression , 2016, NIPS.

[18]  Mohammed Ghanbari,et al.  Standard Codecs: Image Compression to Advanced Video Coding , 2003 .

[19]  Cheng-Yuan Liou,et al.  Modeling word perception using the Elman network , 2008, Neurocomputing.

[20]  Koray Kavukcuoglu,et al.  Pixel Recurrent Neural Networks , 2016, ICML.

[21]  Li Dong,et al.  Adaptive downsampling to improve image compression at low bit rates , 2006, IEEE Transactions on Image Processing.

[22]  Heiko Schwarz,et al.  Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[23]  Jiwen Lu,et al.  Learning Cascaded Deep Auto-Encoder Networks for Face Alignment , 2016, IEEE Transactions on Multimedia.

[24]  Feng Wu,et al.  Efficient Multiple-Description Image Coding Using Directional Lifting-Based Transform , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[26]  Michael G. Strintzis,et al.  Lossless image compression based on optimal prediction, adaptive lifting, and conditional arithmetic coding , 2001, IEEE Trans. Image Process..

[27]  Luc Van Gool,et al.  Conditional Probability Models for Deep Image Compression , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[28]  Weisi Lin,et al.  Efficient Image Deblocking Based on Postfiltering in Shifted Windows , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  David Minnen,et al.  Variational image compression with a scale hyperprior , 2018, ICLR.

[31]  Feng Wu,et al.  Adaptive Directional Lifting-Based Wavelet Transform for Image Coding , 2007, IEEE Transactions on Image Processing.

[32]  Xiangjun Zhang,et al.  Low Bit-Rate Image Compression via Adaptive Down-Sampling and Constrained Least Squares Upconversion , 2009, IEEE Transactions on Image Processing.

[33]  Ephraim Feig,et al.  Image compression using spatial prediction , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[34]  Anastasios N. Venetsanopoulos,et al.  A JPEG-based interpolative image coding scheme , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[35]  Lei Zhang,et al.  Sparse Representation Based Image Interpolation With Nonlocal Autoregressive Modeling , 2013, IEEE Transactions on Image Processing.

[36]  Richard G. Baraniuk,et al.  Regime Change: Bit-Depth Versus Measurement-Rate in Compressive Sensing , 2011, IEEE Transactions on Signal Processing.

[37]  David Minnen,et al.  Variable Rate Image Compression with Recurrent Neural Networks , 2015, ICLR.

[38]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[39]  Dong Liu,et al.  Image Compression With Edge-Based Inpainting , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  Raziel Haimi-Cohen,et al.  Compressive measurements generated by structurally random matrices: Asymptotic normality and quantization , 2016, Signal Process..

[41]  Xianming Liu,et al.  Data-Driven Soft Decoding of Compressed Images in Dual Transform-Pixel Domain , 2016, IEEE Transactions on Image Processing.

[42]  Guangming Shi,et al.  Binned Progressive Quantization for Compressive Sensing , 2012, IEEE Transactions on Image Processing.

[43]  Wen Gao,et al.  Compression Artifact Reduction by Overlapped-Block Transform Coefficient Estimation With Block Similarity , 2013, IEEE Transactions on Image Processing.