Pedestrian Detection Using ACF Based Fast R-CNN

Accurate and efficient performance is an important requirement for pedestrian detection. In this paper, we propose a novel detection framework named as ACF Based Fast R-CNN (ABF-CNN). The ABF-CNN consists of a ACF proposal generation part and a Fast R-CNN detection network. The motivation to use the Aggregated Channel Features (ACF) is due to its real-time efficiency and effective performance. To achieve high accuracy, we further propose to make use of the deep learning method Fast-RCNN. Furthermore, in order to solve the problem that CNN based methods have difficulty in hard negative mining, we propose to integrate Online Hard Example Mining (OHEM) training strategy into our detection framework. By thoroughly analyzing and optimizing each step of pedestrian detection pipeline, we develop an accurate detection framework with low computational complexity. The experimental results demonstrate that our framework achieves state-of-the-art performance on Caltech pedestrian dataset with 17% miss rate.

[1]  Meng Wang,et al.  Scene-Specific Pedestrian Detection for Static Video Surveillance , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[5]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[6]  Abhinav Gupta,et al.  Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Xiaogang Wang,et al.  DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  David Ribeiro,et al.  A real-time Deep Learning pedestrian detector for robot navigation , 2017, 2017 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC).

[10]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Ran Xu,et al.  Cascaded L1-norm Minimization Learning (CLML) classifier for human detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[13]  Huimin Ma,et al.  3D Object Proposals for Accurate Object Class Detection , 2015, NIPS.