论文信息 - Multi-Scale Dense Convolutional Networks for Efficient Prediction

Multi-Scale Dense Convolutional Networks for Efficient Prediction

We introduce a new convolutional neural network architecture with the ability to adapt dynamically to computational resource limits at test time. Our network architecture uses progressively growing multi-scale convolutions and dense connectivity, which allows for the training of multiple classifiers at intermediate layers of the network. We evaluate our approach in two settings: (1) anytime classification, where the network’s prediction for a test example is progressively updated, facilitating the output of a prediction at any time; and (2) budgeted batch classification, where a fixed amount of computation is available to classify a set of examples that can be spent unevenly across “easier” and “harder” inputs. Experiments on three image-classification datasets demonstrate that our proposed framework substantially improves the state-of-the-art in both settings.

[1] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[2] Yixin Chen,et al. Compressing Neural Networks with the Hashing Trick , 2015, ICML.

[3] Ji Wan,et al. Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[4] Razvan Pascanu,et al. Progressive Neural Networks , 2016, ArXiv.

[5] Paul A. Viola,et al. Robust Real-time Object Detection , 2001 .

[6] Gregory Shakhnarovich,et al. FractalNet: Ultra-Deep Neural Networks without Residuals , 2016, ICLR.

[7] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[9] Stéphane Mallat,et al. Multiscale Hierarchical Convolutional Networks , 2017, ArXiv.

[10] Augustus Odena,et al. Changing Model Behavior at Test-Time Using Reinforcement Learning , 2017, ICLR.

[11] Zhuowen Tu,et al. Deeply-Supervised Nets , 2014, AISTATS.

[12] Alex Graves,et al. Adaptive Computation Time for Recurrent Neural Networks , 2016, ArXiv.

[13] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14] Xin Zhang,et al. End to End Learning for Self-Driving Cars , 2016, ArXiv.

[15] Nikos Komodakis,et al. Wide Residual Networks , 2016, BMVC.

[16] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[18] Yixin Chen,et al. Compressing Convolutional Neural Networks in the Frequency Domain , 2015, KDD.

[19] Yann LeCun,et al. Optimal Brain Damage , 1989, NIPS.

[20] Kilian Q. Weinberger,et al. Deep Networks with Stochastic Depth , 2016, ECCV.

[21] Stella X. Yu,et al. Neural Multigrid , 2016, ArXiv.

[22] Trevor Darrell,et al. Anytime Recognition of Objects and Scenes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] J. Andrew Bagnell,et al. SpeedBoost: Anytime Prediction with Uniform Near-Optimality , 2012, AISTATS.

[25] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[26] Hanan Samet,et al. Pruning Filters for Efficient ConvNets , 2016, ICLR.

[27] Venkatesh Saligrama,et al. Efficient Learning by Directed Acyclic Graph For Resource Constrained Prediction , 2015, NIPS.

[28] Kilian Q. Weinberger,et al. The Greedy Miser: Learning under Test-time Budgets , 2012, ICML.

[29] Gregory J. Wolff,et al. Optimal Brain Surgeon and general network pruning , 1993, IEEE International Conference on Neural Networks.

[30] Ming Yang,et al. Compressing Deep Convolutional Networks using Vector Quantization , 2014, ArXiv.

[31] Ali Farhadi,et al. XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks , 2016, ECCV.

[32] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Matt J. Kusner,et al. Cost-Sensitive Tree of Classifiers , 2012, ICML.

[34] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[35] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[36] Venkatesh Saligrama,et al. Supervised Sequential Classification Under Budget Constraints , 2013, AISTATS.

[37] Ran El-Yaniv,et al. Binarized Neural Networks , 2016, ArXiv.

[38] Venkatesh Saligrama,et al. Adaptive Neural Networks for Fast Test-Time Prediction , 2017, ArXiv.

[39] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40] Rich Caruana,et al. Model compression , 2006, KDD '06.

[41] Li Zhang,et al. Spatially Adaptive Computation Time for Residual Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Jakob Verbeek,et al. Convolutional Neural Fabrics , 2016, NIPS.

[43] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[44] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[45] Venkatesh Saligrama,et al. Feature-Budgeted Random Forest , 2015, ICML.