论文信息 - Accelerate convolutional neural networks for binary classification via cascading cost-sensitive feature

Accelerate convolutional neural networks for binary classification via cascading cost-sensitive feature

Convolutional Neural Networks (CNNs) have delivered impressive state-of-the-art performances for many vision tasks, while the computation costs of these networks during test-time are notorious. Empirical results have discovered that CNNs have learned the redundant representations both within and across different layers. When CNNs are applied for binary classification, we investigate a method to exploit this redundancy across layers, and construct a cascade of classifiers which explicitly balances classification accuracy and hierarchical feature extraction costs. Our method cost-sensitively selects feature points across several layers from trained networks and embeds non-expensive yet discriminative features into a cascade. Experiments on binary classification demonstrate that our framework leads to drastic test-time improvements, e.g., possible 47.2x speedup for TRECVID upper body detection, 2.82x speedup for Pascal VOC2007 People detection, 3.72x for INRIA Person detection with less than 0.5% drop in accuracies of the original networks.

[1] Christophe Garcia,et al. Simplifying ConvNets for Fast Learning , 2012, ICANN.

[2] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3] Alan L. Yuille,et al. The Concave-Convex Procedure , 2003, Neural Computation.

[4] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[5] Berin Martini,et al. Large-Scale FPGA-based Convolutional Networks , 2011 .

[6] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[7] Andrew Zisserman,et al. Speeding up Convolutional Neural Networks with Low Rank Expansions , 2014, BMVC.

[8] Jürgen Schmidhuber,et al. Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Vincent Vanhoucke,et al. Improving the speed of neural networks on CPUs , 2011 .

[10] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[11] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[13] Joan Bruna,et al. Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation , 2014, NIPS.

[14] Kilian Q. Weinberger,et al. Classifier Cascade for Minimizing Feature Evaluation Cost , 2012, AISTATS.

[15] Paul A. Viola,et al. Robust Real-time Object Detection , 2001 .

[16] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[17] Misha Denil,et al. Predicting Parameters in Deep Learning , 2014 .