Saliency-based Deep Multi-level Semantic Feature Fusion for Person Re-identification

Person re-identification (Re-ID) is a challenging problem due to external environmental disturbances and significant intra-class appearance variations. Discriminative person features should cover global and partial representations, and accordingly can describe high-level and middle-level semantic information of persons. To achieve this goal, a saliency-based deep multi-level semantic feature representation and fusion algorithm is proposed. Firstly, a feature fusion scheme at middle layer of our deep network is presented to effectively fuse global CNN features. Secondly, a part-based feature extraction method is designed to extract high-level semantic features. In addition, a parameter-free multi-scale saliency-based enhancement algorithm is proposed to compute patch-level saliency scores to enhance the distinctiveness of part-based features. Experimental results on three public datasets, namely, Market-1501, DukeMTMC-reID, and CHUK03, demonstrate the effectiveness of the proposed method compared with state-of-the-art Re-ID approaches.

[1]  Kaiqi Huang,et al.  Learning Deep Context-Aware Features over Body and Latent Parts for Person Re-identification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Q. Tian,et al.  GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval , 2017, ACM Multimedia.

[3]  Hantao Yao,et al.  Deep Representation Learning With Part Loss for Person Re-Identification , 2017, IEEE Transactions on Image Processing.

[4]  Qi Tian,et al.  Beyond Part Models: Person Retrieval with Refined Part Pooling , 2017, ECCV.

[5]  Heng Tao Shen,et al.  Person Re-identification Using Two-Stage Convolutional Neural Network , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[6]  Yi Yang,et al.  Pedestrian Alignment Network for Large-scale Person Re-Identification , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Liang Zheng,et al.  Re-ranking Person Re-identification with k-Reciprocal Encoding , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Frédéric Jurie,et al.  PCCA: A new approach for distance learning from sparse pairwise constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Xiaogang Wang,et al.  Person Re-identification by Salience Matching , 2013, 2013 IEEE International Conference on Computer Vision.

[10]  Xu Lan,et al.  Deep Reinforcement Learning Attention Selection For Person Re-Identification , 2017, BMVC.

[11]  Shaogang Gong,et al.  Person Re-identification by Deep Learning Multi-scale Representations , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[12]  Errui Ding,et al.  Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition , 2018, ECCV.

[13]  Yifan Sun,et al.  SVDNet for Pedestrian Retrieval , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[14]  Tao Xiang,et al.  The Devil is in the Middle: Exploiting Mid-level Representations for Cross-Domain Instance Matching , 2017, ArXiv.

[15]  Shaogang Gong,et al.  Harmonious Attention Network for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.