Pedestrian Detection Using Regional Proposal Network with Feature Fusion

Pedestrian detection, which has broad application prospects in video security, robotics and self-driving vehicles etc., is one of the most important research fields in computer vision. Recently, deep learning methods, e.g., Region Proposal Network (RPN), have achieved major performance improvements in pedestrian detection. In order to further utilize the deep pedestrian features of RPN, this paper proposes a novel regional proposal network model based on feature fusion (RPN_FeaFus) for pedestrian detection. RPN_FeaFus adopts an asymmetric dual-path deep model, constructed by VGGNet and ZFNet, to extract pedestrian features in different levels, which are further combined through PCA dimension reduction and feature stacking to provide more discriminant representation. Then, the low-dimensional fusion features are adopted to detect the region proposals and train the classifier. Experimental results on three widely used pedestrian detection databases, i.e, Caltech database, Daimler database and TUD database, illuminate that RPN_FeaFus gains obvious performance improvements over its baseline RPN_BF, which is also competitive with the state-of-the-art methods.

[1]  Paulo Vinicius Koerich Borges,et al.  Pedestrian Detection Based on Blob Motion Statistics , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[4]  Bo Du,et al.  Target Detection Based on Random Forest Metric Learning , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[5]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[6]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[7]  Trevor Hastie,et al.  Additive Logistic Regression : a Statistical , 1998 .

[8]  Liang Lin,et al.  Is Faster R-CNN Doing Well for Pedestrian Detection? , 2016, ECCV.

[9]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[10]  Bernt Schiele,et al.  CityPersons: A Diverse Dataset for Pedestrian Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Xiaogang Wang,et al.  Switchable Deep Network for Pedestrian Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Xiao Xiang Zhu,et al.  Hyperspectral and LiDAR Data Fusion Using Extinction Profiles and Deep Convolutional Neural Network , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[13]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Graham W. Taylor,et al.  Adaptive deconvolutional networks for mid and high level feature learning , 2011, 2011 International Conference on Computer Vision.

[15]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Pietro Perona,et al.  Quickly Boosting Decision Trees - Pruning Underachieving Features Early , 2013, ICML.

[17]  Erik D. Goodman,et al.  Integrating a statistical background- foreground extraction algorithm and SVM classifier for pedestrian detection and tracking , 2013, Integr. Comput. Aided Eng..

[18]  Dariu Gavrila,et al.  A new benchmark for stereo-based pedestrian detection , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[19]  Dariu Gavrila,et al.  Multi-cue pedestrian classification with partial occlusion handling , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.