Pedestrian detection based on multi-convolutional features by feature maps pruning

Convolutional neural network (CNN) has developed such a large network size in last few years, so reducing the storage requirement without hurting its accuracy becomes necessary. In this paper, in order to reduce the number of high dimensional feature maps in shallow layers, we propose a feature map selection method, which cuts the feature map number by correlation coefficient between kernels and finishes detection by HOG+SVM method. Firstly, we extract feature maps of shallow layers from trained CNN. Then, we merge strongly relevant feature maps and choose all maps among weakly relevant feature maps by analyzing correlation coefficient of kernels. Finally, we extract HOG features of the chosen feature maps and use SVM to complete the training and classification. The experimental results show that the proposed method can effectively prune high dimensional feature maps and stabilize or even advance the performance in pedestrian detection.

[1]  Song Zheng,et al.  An Improved Moving Object Detection Algorithm Based on Frame Difference and Edge Detection , 2007, Fourth International Conference on Image and Graphics (ICIG 2007).

[2]  Xiaogang Wang,et al.  Learning Mutual Visibility Relationship for Pedestrian Detection with a Deep Model , 2016, International Journal of Computer Vision.

[3]  Pushmeet Kohli,et al.  On Detection of Multiple Object Instances Using Hough Transforms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Xiaogang Wang,et al.  Joint Deep Learning for Pedestrian Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[7]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[8]  Alain Rakotomamonjy,et al.  A Pedestrian Detector Using Histograms of Oriented Gradients and a Support Vector Machine Classifier , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[9]  Gerald Penn,et al.  Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[12]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[13]  Zheng Liu,et al.  Pedestrian detection in thermal images using adaptive fuzzy C-means clustering and convolutional neural networks , 2015, 2015 14th IAPR International Conference on Machine Vision Applications (MVA).

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Fatih Murat Porikli,et al.  Pedestrian Detection via Classification on Riemannian Manifolds , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Ming Yang,et al.  Compressing Deep Convolutional Networks using Vector Quantization , 2014, ArXiv.

[18]  Yu-Jin Zhang,et al.  Fast Human Detection by Boosting Histograms of Oriented Gradients , 2007, Fourth International Conference on Image and Graphics (ICIG 2007).

[19]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[20]  Yixin Chen,et al.  Compressing Neural Networks with the Hashing Trick , 2015, ICML.

[21]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[22]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[23]  Xiaogang Wang,et al.  DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection , 2014, ArXiv.