论文信息 - Composition of Saliency Metrics for Pruning with a Myopic Oracle

Composition of Saliency Metrics for Pruning with a Myopic Oracle

The cost of Convolutional Neural Network (CNN) inference can be reduced by pruning weights from a trained network, eliminating computations while preserving the predictive accuracy up to some threshold. While many heuristic saliency metrics have been proposed to guide this process, the quality of pruning decisions made by any one metric is highly contextsensitive. Some metrics make excellent pruning decisions for one network, while making poor decisions for other networks. Traditionally, a single heuristic saliency metric is used for the entire pruning process. We show how to compose a set of these saliency metrics to form a much more robust (albeit still heuristic) saliency. The key idea is to exploit the cases where the different base metrics do well, and avoid the cases where they do poorly by switching to a different metric. With an experimental evaluation of channel pruning on several popular CNNs on the CIFAR-10 and CIFAR-100 datasets, we show that the composite saliency metrics derived by our method consistently outperform all of the individual constituent metrics.

David Gregg | Andrew Anderson | Kaveena Persand

[1] Yi Yang,et al. Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks , 2018, IJCAI.

[2] Timo Aila,et al. Pruning Convolutional Neural Networks for Resource Efficient Inference , 2016, ICLR.

[3] Zi Wang,et al. Towards Efficient Convolutional Neural Networks Through Low-Error Filter Saliency Estimation , 2019, PRICAI.

[4] Ping Wang,et al. Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks , 2019, NeurIPS.

[5] Song Han,et al. Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.

[6] Hanan Samet,et al. Pruning Filters for Efficient ConvNets , 2016, ICLR.

[7] Song Han,et al. DSD: Dense-Sparse-Dense Training for Deep Neural Networks , 2016, ICLR.

[8] Yu Wang,et al. Exploring the Granularity of Sparsity in Convolutional Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9] Wonyong Sung,et al. Structured Pruning of Deep Convolutional Neural Networks , 2015, ACM J. Emerg. Technol. Comput. Syst..

[10] Ehud D. Karnin,et al. A simple procedure for pruning back-propagation trained neural networks , 1990, IEEE Trans. Neural Networks.

[11] Lior Wolf,et al. Channel-Level Acceleration of Deep Face Representations , 2015, IEEE Access.