论文信息 - An End-to-End Compression Framework Based on Convolutional Neural Networks

An End-to-End Compression Framework Based on Convolutional Neural Networks

Deep learning, e.g., convolutional neural networks (CNNs), has achieved great success in image processing and computer vision especially in high-level vision applications, such as recognition and understanding. However, it is rarely used to solve low-level vision problems such as image compression studied in this paper. Here, we move forward a step and propose a novel compression framework based on CNNs. To achieve high-quality image compression at low bit rates, two CNNs are seamlessly integrated into an end-to-end compression framework. The first CNN, named compact convolutional neural network (ComCNN), learns an optimal compact representation from an input image, which preserves the structural information and is then encoded using an image codec (e.g., JPEG, JPEG2000, or BPG). The second CNN, named reconstruction convolutional neural network (RecCNN), is used to reconstruct the decoded image with high quality in the decoding end. To make two CNNs effectively collaborate, we develop a unified end-to-end learning algorithm to simultaneously learn ComCNN and RecCNN, which facilitates the accurate reconstruction of the decoded image using RecCNN. Such a design also makes the proposed compression framework compatible with existing image coding standards. Experimental results validate that the proposed compression framework greatly outperforms several compression frameworks that use existing image coding standards with the state-of-the-art deblocking or denoising post-processing methods.

[1] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2] Michael K. Ng,et al. Reducing Artifacts in JPEG Decompression Via a Learned Dictionary , 2014, IEEE Transactions on Signal Processing.

[3] Deqing Sun,et al. Postprocessing of Low Bit-Rate Block DCT Coded Images Based on a Fields of Experts Prior , 2007, IEEE Transactions on Image Processing.

[4] Lucas Theis,et al. Lossy Image Compression with Compressive Autoencoders , 2017, ICLR.

[5] Jong Beom Ra,et al. Post-Processing for Blocking Artifact Reduction Based on Inter-Block Correlation , 2014, IEEE Transactions on Multimedia.

[6] Wen Gao,et al. Compression Artifact Reduction by Overlapped-Block Transform Coefficient Estimation With Block Similarity , 2013, IEEE Transactions on Image Processing.

[7] Shu-Jhen Fan-Jiang,et al. Self-learning-based post-processing for image/video deblocking via sparse representation , 2014, J. Vis. Commun. Image Represent..

[8] David Zhang,et al. Learning Convolutional Networks for Content-Weighted Image Compression , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9] Gregory K. Wallace,et al. The JPEG still picture compression standard , 1992 .

[10] Qing Ling,et al. D3: Deep Dual-Domain Based Fast Restoration of JPEG-Compressed Images , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Mohammed Ghanbari,et al. Standard Codecs: Image Compression to Advanced Video Coding , 2003 .

[12] Wen Gao,et al. CONCOLOR: Constrained Non-Convex Low-Rank Model for Image Deblocking , 2016, IEEE Transactions on Image Processing.

[13] Xiaolin Wu,et al. Data-Driven Soft Decoding of Compressed Images in Dual Transform-Pixel Domain. , 2016, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[14] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[15] Hongyang Chao,et al. Building Dual-Domain Representations for Compression Artifacts Reduction , 2016, ECCV.

[16] Matthias Bethge,et al. Generative Image Modeling Using Spatial LSTMs , 2015, NIPS.

[17] Wai-kuen Cham,et al. Image postprocessing by Non-local Kuan's filter , 2011, J. Vis. Commun. Image Represent..

[18] Yunjin Chen,et al. Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Xiaoou Tang,et al. Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] Alessandro Foi,et al. Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[23] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[24] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[25] David Minnen,et al. Full Resolution Image Compression with Recurrent Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Xiaoou Tang,et al. Compression Artifacts Reduction by a Deep Convolutional Network , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[27] Lei Zhang,et al. Weighted Nuclear Norm Minimization with Application to Image Denoising , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[28] Wen Gao,et al. Reducing Blocking Artifacts in Compressed Images via Transform-Domain Non-local Coefficients Estimation , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[29] Kyoung Mu Lee,et al. Deeply-Recursive Convolutional Network for Image Super-Resolution , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[31] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[32] David Minnen,et al. Variable Rate Image Compression with Recurrent Neural Networks , 2015, ICLR.

[33] Karen O. Egiazarian,et al. Pointwise Shape-Adaptive DCT for High-Quality Denoising and Deblocking of Grayscale and Color Images , 2007, IEEE Transactions on Image Processing.

[34] Eduardo A. B. da Silva,et al. A generic post-deblocking filter for block based image compression algorithms , 2012, Signal Process. Image Commun..

[35] Koray Kavukcuoglu,et al. Pixel Recurrent Neural Networks , 2016, ICML.

[36] Valero Laparra,et al. End-to-end Optimized Image Compression , 2016, ICLR.

[37] Jun Zhou,et al. Adaptive non-local means filtering for image deblocking , 2011, 2011 4th International Congress on Image and Signal Processing.

[38] Weisi Lin,et al. Efficient Image Deblocking Based on Postfiltering in Shifted Windows , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[39] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Jie Ren,et al. Image Blocking Artifacts Reduction via Patch Clustering and Low-Rank Minimization , 2013, 2013 Data Compression Conference.