A Method of Pedestrian Fine-grained Attribute Detection and Recognition

The detection and recognition of pedestrians is one of the hottest research issues. This paper presents a method that employs the fusion of convolutional neural network (CNN) models based on multi-task learning for multi-attributes, aiming to solve the problem of a low degree of accuracy in detecting and recognizing pedestrians fine-grained attributes under complex circumstances. First, the model implements double detection for data preprocessing. Specifically, this paper uses the CNN model twice to detect pedestrians under complex circumstances. In the first detection, it conducts the coarse-grained detection for the whole pedestrian; in the second detection, based on the first, it conducts fine-grained detection of and then recognizes pedestrian subcomponents. Further, to address the problem of a low rate of correct recognition for the fine-grained detection of the attributes of pedestrian subcomponents, we use the concept of the fusion of multi-task learning for multi-attributes, select the best recognized result for every attribute according to the fused results of different CNN classification models by the arrogance voting method, and then use a customized decision function to achieve a more accurate recognition of the coarse-grained attributes of pedestrians. Experimental results show that the proposed method achieved better performance than other recognition methods of pedestrians attributes.

[1]  Jean-Philippe Thiran,et al.  Combining LiDAR space clustering and convolutional neural networks for pedestrian detection , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[2]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Abhinav Gupta,et al.  A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Song-Chun Zhu,et al.  Attribute And-Or Grammar for Joint Parsing of Human Attributes, Part and Pose , 2016, ArXiv.

[5]  Yury Vizilter,et al.  Pedestrian detection in video surveillance using fully convolutional YOLO neural network , 2017, Optical Metrology.

[6]  Simone Calderara,et al.  Generative adversarial models for people attribute recognition in surveillance , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[7]  Lin Liu,et al.  Highly Occluded Face Detection: An Improved R-FCN Approach , 2017, ICONIP.

[8]  Kaiqi Huang,et al.  A Richly Annotated Dataset for Pedestrian Attribute Recognition , 2016, ArXiv.

[9]  Jian Li,et al.  Hierarchical pedestrian attribute recognition based on adaptive region localization , 2017, 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[10]  Shengcai Liao,et al.  Multi-label convolutional neural network based pedestrian attribute classification , 2017, Image Vis. Comput..

[11]  Feihu Sun,et al.  Target detection-oriented model design for unmanned underwater vehicles , 2017, OCEANS 2017 - Aberdeen.

[12]  Xiaogang Wang,et al.  A Deep Sum-Product Architecture for Robust Facial Attributes Analysis , 2013, 2013 IEEE International Conference on Computer Vision.

[13]  Xiaolin Hu,et al.  Recurrent convolutional neural network for object recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Xiaogang Wang,et al.  Pedestrian detection aided by deep learning semantic tasks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Liu We Multi-Pose Pedestrian Detection Based on Posterior HOG Feature , 2015 .

[17]  Ricardo Matsumura de Araújo,et al.  On the Performance of GoogLeNet and AlexNet Applied to Sketches , 2016, AAAI.

[18]  Shaogang Gong,et al.  Person Re-identification by Attributes , 2012, BMVC.

[19]  Pedro Encarnação,et al.  Moving Path Following for Unmanned Aerial Vehicles With Applications to Single and Multiple Target Tracking Problems , 2016, IEEE Transactions on Robotics.

[20]  Chang Liu,et al.  Model predictive control-based target search and tracking using autonomous mobile robot with limited sensing domain , 2017, 2017 American Control Conference (ACC).

[21]  Xiaogang Wang,et al.  Deeply learned attributes for crowded scene understanding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Ulf Bodin,et al.  Reusable road condition information system for traffic safety and targeted maintenance , 2017 .

[23]  Liang Zheng,et al.  Improving Person Re-identification by Attribute and Identity Learning , 2017, Pattern Recognit..

[24]  Kaiqi Huang,et al.  Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization , 2016, BMVC.

[25]  Curt H. Davis,et al.  Fusion of Deep Convolutional Neural Networks for Land Cover Classification of High-Resolution Imagery , 2017, IEEE Geoscience and Remote Sensing Letters.