论文信息 - Soft-to-Hard Vector Quantization for End-to-End Learned Compression of Images and Neural Networks

Soft-to-Hard Vector Quantization for End-to-End Learned Compression of Images and Neural Networks

In this work we present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy. Our method is based on a soft (continuous) relaxation of quantization and entropy, which we anneal to their discrete counterparts throughout training. We showcase this method for two challenging applications: Image compression and neural network compression. While these tasks have typically been approached with different methods, our soft-to-hard quantization approach gives state-of-the-art results for both.

[1] Ian H. Witten,et al. Arithmetic coding for data compression , 1987, CACM.

[2] Yiran Chen,et al. Learning Structured Sparsity in Deep Neural Networks , 2016, NIPS.

[3] Zhou Wang,et al. Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[4] Horst Bischof,et al. Optimizing 1-Nearest Prototype Classifiers , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[6] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[7] David Minnen,et al. Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8] Christian Ledig,et al. Is the deconvolution layer the same as a convolutional layer? , 2016, ArXiv.

[9] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[10] Geoffrey E. Hinton,et al. Using very deep autoencoders for content-based image retrieval , 2011, ESANN.

[11] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[12] David Minnen,et al. Variable Rate Image Compression with Recurrent Neural Networks , 2015, ICLR.

[13] Ali Farhadi,et al. XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks , 2016, ECCV.

[14] Lucas Theis,et al. Lossy Image Compression with Compressive Autoencoders , 2017, ICLR.

[15] Yoshua Bengio,et al. BinaryConnect: Training Deep Neural Networks with binary weights during propagations , 2015, NIPS.

[16] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[17] Song Han,et al. Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.

[18] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[20] Yurong Chen,et al. Dynamic Network Surgery for Efficient DNNs , 2016, NIPS.

[21] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22] Lin Xu,et al. Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights , 2017, ICLR.

[23] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[24] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Sebastian Thrun,et al. Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[26] Geoffrey C. Fox,et al. Vector quantization by deterministic annealing , 1992, IEEE Trans. Inf. Theory.

[27] Luc Van Gool,et al. A+: Adjusted Anchored Neighborhood Regression for Fast Super-Resolution , 2014, ACCV.

[28] Daniel Rueckert,et al. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Valero Laparra,et al. End-to-end optimization of nonlinear transform codes for perceptual quality , 2016, 2016 Picture Coding Symposium (PCS).

[30] Jitendra Malik,et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[31] Josef A. Nossek,et al. Continuation-based learning algorithm for discrete-time cellular neural networks , 1994, Proceedings of the Third IEEE International Workshop on Cellular Neural Networks and their Applications (CNNA-94).

[32] David Minnen,et al. Full Resolution Image Compression with Recurrent Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Ran El-Yaniv,et al. Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations , 2016, J. Mach. Learn. Res..

[34] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[35] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[36] Max Welling,et al. Soft Weight-Sharing for Neural Network Compression , 2017, ICLR.

[37] Allen Gersho,et al. Competitive learning and soft competition for vector quantizer design , 1992, IEEE Trans. Signal Process..

[38] Jungwon Lee,et al. Towards the Limit of Network Quantization , 2016, ICLR.

[39] Valero Laparra,et al. End-to-end Optimized Image Compression , 2016, ICLR.