Enhancing Person Re-Identification Based on Recurrent Feature Aggregation Network

This paper proposes a method for video-based person re-identification. Motivated by the capacity of Recurrent Feature Aggregation Network (RFA-Net) that allows to aggregate the image-level features over time-steps and to yield a sequence level feature. Our work improves the person re- identification performance by (1) proposing to use Gaussian of Gaussian (GOG) - a more powerful feature at image level instead of LBP-Color as in the original work and (2) applying metric learning techniques for person matching. To analyze the role of image feature and metric learning, we have performed extensive experiments on two public benchmark datasets including PRID 2011 and iLIDS- VID. The proposed solution obtains impressive results on the PRID 2011 dataset. The matching rate at rank 1 is increased by 16.8% compared to that of original method. For the iLID-SVID dataset, the matching rate at rank 1 of proposed method is increased 1.1% whereas these values at the latter ranks are higher than those of the original method by approximately 4%. In additional, the mean and standard deviation of the rank where the correct matching is found of the proposed method on the PRID 2011 and the iLIDS-VID are 2.32, 4.33 and 4.61, 7.67 while those of the original work are 3.50, 5.42 and 8.03, 17.52, respectively. These obtained results emphasize the robustness of the proposed method compared to the original one.

[1]  Yi Yang,et al.  Person Re-identification: Past, Present and Future , 2016, ArXiv.

[2]  Jin Wang,et al.  Temporally aligned pooling representation for video-based person re-identification , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[3]  Horst Bischof,et al.  Person Re-identification by Descriptive and Discriminative Classification , 2011, SCIA.

[4]  Shengcai Liao,et al.  Person re-identification by Local Maximal Occurrence representation and metric learning , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Xiang Li,et al.  Top-Push Video-Based Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Bingbing Ni,et al.  Person Re-identification via Recurrent Feature Aggregation , 2016, ECCV.

[7]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[8]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Zheng Liu,et al.  A fast adaptive spatio-temporal 3D feature for video-based person re-identification , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[10]  Bingpeng Ma,et al.  A Spatio-Temporal Appearance Representation for Video-Based Pedestrian Re-Identification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[11]  Qi Tian,et al.  Video-Based Person Re-identification by Deep Feature Guided Pooling , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[12]  Shaogang Gong,et al.  Person Re-identification by Video Ranking , 2014, ECCV.

[13]  Takahiro Okabe,et al.  Hierarchical Gaussian Descriptor for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Jesús Martínez del Rincón,et al.  Recurrent Convolutional Network for Video-Based Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).