论文信息 - DropPruning for Model Compression

DropPruning for Model Compression

Deep neural networks (DNNs) have dramatically achieved great success on a variety of challenging tasks. However, most of the successful DNNs are structurally so complex, leading to much storage requirement and floating-point operation. This paper proposes a novel technique, named Drop Pruning, to compress the DNNs by pruning the weights from a dense high-accuracy baseline model without accuracy loss. Drop Pruning also falls into the standard iterative prune-retrain procedure, where a \emph{drop} strategy exists at each pruning step: \emph{drop out}, stochastic deleting some unimportant weights and \emph{drop in}, stochastic recovering some pruned weights. \emph{Drop out} and \emph{drop in} are supposed to handle the two drawbacks of the traditional pruning methods: local importance judgment and irretrievable pruning process, respectively. The suitable choosing of \emph{drop} probabilities can decrease the model size during pruning process and lead it to flow to the target sparsity. Drop Pruning also has some similar spirits with dropout, a stochastic algorithm in Integer Optimization and the Dense-Sparse-Dense training technique. Drop Pruning can significantly reducing overfitting while compressing the model. Experimental results demonstrates that Drop Pruning can achieve the state-of-the-art performance on many benchmark pruning tasks, about ${11.1\times}$ compression of VGG-16 on CIFAR10 and ${14.3\times}$ compression of LeNet-5 on MNIST without accuracy loss, which may provide some new insights into the aspect of model compression.

[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2] Yurii Nesterov,et al. Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems , 2012, SIAM J. Optim..

[3] Song Han,et al. Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.

[4] Toshimasa Watanabe,et al. Performance comparison of approximation algorithms for the minimum weight vertex cover problem , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[5] Hanan Samet,et al. Pruning Filters for Efficient ConvNets , 2016, ICLR.

[6] Max Welling,et al. Learning Sparse Neural Networks through L0 Regularization , 2017, ICLR.

[7] Andrew Lavin,et al. Fast Algorithms for Convolutional Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Lorien Y. Pratt,et al. Comparing Biases for Minimal Network Construction with Back-Propagation , 1988, NIPS.

[9] Suyog Gupta,et al. To prune, or not to prune: exploring the efficacy of pruning for model compression , 2017, ICLR.

[10] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[11] Philip H. S. Torr,et al. SNIP: Single-shot Network Pruning based on Connection Sensitivity , 2018, ICLR.

[12] Wei Sun,et al. Potential Game Theoretic Learning for the Minimal Weighted Vertex Cover in Distributed Networking Systems , 2019, IEEE Transactions on Cybernetics.

[13] Ohad Shamir,et al. A Stochastic PCA and SVD Algorithm with an Exponential Convergence Rate , 2014, ICML.

[14] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] John K. Karlof,et al. Integer programming : theory and practice , 2005 .

[17] Xin Dong,et al. Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon , 2017, NIPS.

[18] Kunal Talwar,et al. Targeted Dropout , 2018 .

[19] Wolfgang Banzhaf,et al. Genetic Programming: An Introduction , 1997 .

[20] Guy Lemieux,et al. DropBack: Continuous Pruning During Training , 2018, ArXiv.

[21] Zhiqiang Shen,et al. Learning Efficient Convolutional Networks through Network Slimming , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22] Jianxin Wu,et al. ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[23] Eunho Yang,et al. Adaptive Network Sparsification with Dependent Variational Beta-Bernoulli Dropout , 2018, 1805.10896.

[24] Vivienne Sze,et al. Efficient Processing of Deep Neural Networks: A Tutorial and Survey , 2017, Proceedings of the IEEE.

[25] Babak Hassibi,et al. Second Order Derivatives for Network Pruning: Optimal Brain Surgeon , 1992, NIPS.

[26] Jian Cheng,et al. Quantized Convolutional Neural Networks for Mobile Devices , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Ariel D. Procaccia,et al. Variational Dropout and the Local Reparameterization Trick , 2015, NIPS.

[28] Victor S. Lempitsky,et al. Fast ConvNets Using Group-Wise Brain Damage , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Seunghoon Hong,et al. Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30] Tao Zhang,et al. A Survey of Model Compression and Acceleration for Deep Neural Networks , 2017, ArXiv.

[31] Timo Aila,et al. Pruning Convolutional Neural Networks for Resource Efficient Inference , 2016, ICLR.

[32] Dongsoo Lee,et al. DeepTwist: Learning Model Compression via Occasional Weight Distortion , 2018, ArXiv.

[33] Dmitry P. Vetrov,et al. Variational Dropout Sparsifies Deep Neural Networks , 2017, ICML.

[34] Yixin Chen,et al. Compressing Neural Networks with the Hashing Trick , 2015, ICML.

[35] Yann LeCun,et al. Optimal Brain Damage , 1989, NIPS.

[36] Rui Peng,et al. Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient Deep Architectures , 2016, ArXiv.

[37] Alex Kendall,et al. Concrete Dropout , 2017, NIPS.

[38] Max Welling,et al. Soft Weight-Sharing for Neural Network Compression , 2017, ICLR.

[39] Yann LeCun,et al. Fast Training of Convolutional Networks through FFTs , 2013, ICLR.

[40] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[41] Yann LeCun,et al. Regularization of Neural Networks using DropConnect , 2013, ICML.

[42] Yurong Chen,et al. Dynamic Network Surgery for Efficient DNNs , 2016, NIPS.

[43] Wonyong Sung,et al. Coarse Pruning of Convolutional Neural Networks with Random Masks , 2017 .

[44] Kaiming He,et al. Rethinking ImageNet Pre-Training , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[45] David P. Wipf,et al. Compressing Neural Networks using the Variational Information Bottleneck , 2018, ICML.

[46] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[47] Gernot A. Fink,et al. Neuron Pruning for Compressing Deep Networks Using Maxout Architectures , 2017, GCPR.

[48] G. Schwarz. Estimating the Dimension of a Model , 1978 .

[49] Р Ю Чуйков,et al. Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .

[50] Song Han,et al. DSD: Dense-Sparse-Dense Training for Deep Neural Networks , 2016, ICLR.

[51] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[52] Ivan V. Oseledets,et al. Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition , 2014, ICLR.

[53] Rich Caruana,et al. Do Deep Nets Really Need to be Deep? , 2013, NIPS.

[54] Pascal Vincent,et al. Dropout as data augmentation , 2015, ArXiv.

[55] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[56] Sebastian Ruder,et al. An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[57] Eugenio Culurciello,et al. An Analysis of Deep Neural Network Models for Practical Applications , 2016, ArXiv.

[58] Hao Zhou,et al. Less Is More: Towards Compact CNNs , 2016, ECCV.

[59] Jeff A. Bilmes,et al. Training Compressed Fully-Connected Networks with a Density-Diversity Penalty , 2016, ICLR.