Research on Low-Resolution Pedestrian Detection Algorithms based on R-CNN with Targeted Pooling and Proposal

We present an effective low-resolution pedestrian detection using targeted pooling and Region Proposal Network (RPN) in the Faster R-CNN. Our method firstly rearranges the anchor from the RPN exploiting an optimal hyper-parameter setting called "Elaborate Setup". Secondly, it refines the granularity in the pooling operation from the ROI pooling layer. The experimental results demonstrate that the proposed RPN together with fine-grained pooling, which we call LRPD-R-CNN is able to achieve high average precision and robust performance on the VOC 2007 dataset. This method has great potential in commercial values and wide application prospect in the field of computer vision, security and intelligent city.

[1]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[3]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Chin-Hui Lee,et al.  A maximal figure-of-merit learning approach to maximizing mean average precision with deep neural network based classifiers , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Hervé Glotin,et al.  Pedestrian Detection Based on Fast R-CNN and Batch Normalization , 2017, ICIC.

[6]  Zhaoqiang Xia,et al.  Multiorientation scene text detection via coarse-to-fine supervision-based convolutional networks , 2018, J. Electronic Imaging.

[7]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[8]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Abdenour Hadid,et al.  Unsupervised deep hashing for large-scale visual search , 2016, 2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA).

[10]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Joon Hee Han,et al.  Local Decorrelation For Improved Detection , 2014, ArXiv.

[12]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[14]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  David A. McAllester,et al.  Cascade object detection with deformable part models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.