论文信息 - Perceptual Image Compression using Relativistic Average Least Squares GANs

Perceptual Image Compression using Relativistic Average Least Squares GANs

In this work, we provide a detailed description on our submitted methods ANTxNN and ANTxNN_SSIM to Workshop and Challenge on Learned Image Compression (CLIC) 2021. We propose to incorporate Relativistic average Least Squares GANs (RaLSGANs) into Rate-Distortion Optimization for end-to-end training, to achieve perceptual image compression. We also compare two types of discriminator networks and visualize their reconstructed images. Experimental results have validated our method optimized by RaLSGANs can achieve higher subjective quality compared to PSNR, MS-SSIM or LPIPS-optimized models.

[1] Valero Laparra,et al. End-to-end Optimized Image Compression , 2016, ICLR.

[2] Jin Soo Choi,et al. Towards the Perceptual Quality Enhancement of Low Bit-rate Compressed Images , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3] Eirikur Agustsson,et al. Nonlinear Transform Coding , 2020, IEEE Journal of Selected Topics in Signal Processing.

[4] Jiro Katto,et al. Learned Image Compression With Discretized Gaussian Mixture Likelihoods and Attention Modules , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Lucas Theis,et al. Lossy Image Compression with Compressive Autoencoders , 2017, ICLR.

[6] Qian Zhang,et al. Variable-Rate Multi-Frequency Image Compression using Modulated Generalized Octave Convolution , 2020, 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP).

[7] Eirikur Agustsson,et al. Universally Quantized Neural Compression , 2020, NeurIPS.

[8] Alexei A. Efros,et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9] David Minnen,et al. Channel-Wise Autoregressive Entropy Models for Learned Image Compression , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[10] Zhibo Chen,et al. Causal Contextual Prediction for Learned Image Compression , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[11] P. Alam. ‘S’ , 2021, Composites Engineering: An A–Z Guide.

[12] David Zhang,et al. Learning Convolutional Networks for Content-Weighted Image Compression , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14] Jungwon Lee,et al. Variable Rate Deep Image Compression With a Conditional Autoencoder , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[15] Eirikur Agustsson,et al. NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16] Zhisheng Zhong,et al. Ultra Low Bitrate Learned Image Compression by Selective Detail Decoding , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[17] Majid Rabbani,et al. An overview of the JPEG 2000 still image compression standard , 2002, Signal Process. Image Commun..

[18] Gregory K. Wallace,et al. The JPEG still picture compression standard , 1991, CACM.

[19] Donghyun Kim,et al. A Training Method for Image Compression Networks to Improve Perceptual Quality of Reconstructions , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[20] David Minnen,et al. Joint Autoregressive and Hierarchical Priors for Learned Image Compression , 2018, NeurIPS.

[21] Jiro Katto,et al. Deep Residual Learning for Image Compression , 2019, CVPR Workshops.

[22] Luc Van Gool,et al. Generative Adversarial Networks for Extreme Learned Image Compression , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23] David Minnen,et al. Variable Rate Image Compression with Recurrent Neural Networks , 2015, ICLR.

[24] Yochai Blau,et al. Rethinking Lossy Compression: The Rate-Distortion-Perception Tradeoff , 2019, ICML.

[25] Alexia Jolicoeur-Martineau,et al. The relativistic discriminator: a key element missing from standard GAN , 2018, ICLR.

[26] Jooyoung Lee,et al. An End-to-End Joint Learning Scheme of Image Compression and Quality Enhancement with Improved Entropy Minimization. , 2019 .

[27] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[28] Gary J. Sullivan,et al. Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[29] Kiyoharu Aizawa,et al. Channel-Level Variable Quantization Network for Deep Image Compression , 2020, IJCAI.

[30] Zhou Wang,et al. Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[31] Jiro Katto,et al. Deep Convolutional AutoEncoder-based Lossy Image Compression , 2018, 2018 Picture Coding Symposium (PCS).

[32] Lubomir D. Bourdev,et al. Real-Time Adaptive Image Compression , 2017, ICML.

[33] Jiro Katto,et al. Learning Image and Video Compression Through Spatial-Temporal Energy Compaction , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Luca Benini,et al. Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations , 2017, NIPS.

[35] David Minnen,et al. Variational image compression with a scale hyperprior , 2018, ICLR.

[36] Eirikur Agustsson,et al. High-Fidelity Generative Image Compression , 2020, NeurIPS.