Learned Image Compression With Separate Hyperprior Decoders

Learned image compression techniques have achieved considerable development in recent years. In this paper, we find that the performance bottleneck lies in the use of a single hyperprior decoder, in which case the ternary Gaussian model collapses to a binary one. To solve this, we propose to use three hyperprior decoders to separate the decoding process of the mixed parameters in discrete Gaussian mixture likelihoods, achieving more accurate parameters estimation. Experimental results demonstrate the proposed method optimized by MS-SSIM achieves on average 3.36% BD-rate reduction compared with state-of-the-art approach. The contribution of the proposed method to the coding time and FLOPs is negligible.

[1]  Zhibo Chen,et al.  Accelerate Neural Image Compression with Channel-Adaptive Arithmetic Coding , 2021, 2021 IEEE International Symposium on Circuits and Systems (ISCAS).

[2]  Debargha Mukherjee,et al.  A Technical Overview of AV1 , 2021, Proceedings of the IEEE.

[3]  David Minnen,et al.  Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4]  Zhibo Chen,et al.  Soft then Hard: Rethinking the Quantization in Neural Image Compression , 2021, ICML.

[5]  David Minnen,et al.  Full Resolution Image Compression with Recurrent Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  David Minnen,et al.  Variational image compression with a scale hyperprior , 2018, ICLR.

[7]  David Minnen,et al.  Joint Autoregressive and Hierarchical Priors for Learned Image Compression , 2018, NeurIPS.

[8]  Luc Van Gool,et al.  Generative Adversarial Networks for Extreme Learned Image Compression , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[9]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Jiro Katto,et al.  Learned Image Compression With Discretized Gaussian Mixture Likelihoods and Attention Modules , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[14]  Gary J. Sullivan,et al.  Developments in International Video Coding Standardization After AVC, With an Overview of Versatile Video Coding (VVC) , 2021, Proceedings of the IEEE.

[15]  Ruiqin Xiong,et al.  iWave: CNN-Based Wavelet-Like Transform for Image Compression , 2020, IEEE Transactions on Multimedia.

[16]  Houqiang Li,et al.  End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Valero Laparra,et al.  End-to-end Optimized Image Compression , 2016, ICLR.

[18]  Eirikur Agustsson,et al.  High-Fidelity Generative Image Compression , 2020, NeurIPS.

[19]  Jooyoung Lee,et al.  Context-adaptive Entropy Model for End-to-end Optimized Image Compression , 2018, ICLR.

[20]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .